《淀粉樣變性病》PPT課件_第1頁
《淀粉樣變性病》PPT課件_第2頁
《淀粉樣變性病》PPT課件_第3頁
《淀粉樣變性病》PPT課件_第4頁
《淀粉樣變性病》PPT課件_第5頁
已閱讀5頁,還剩128頁未讀, 繼續(xù)免費閱讀

下載本文檔

版權說明:本文檔由用戶提供并上傳,收益歸屬內容提供方,若內容存在侵權,請進行舉報或認領

文檔簡介

1、NCBI FieldGuide A Field Guide Genome resources Sequence similarity NCBI FieldGuide Genome Resources NCBI FieldGuide Genomic Biology NCBI FieldGuide NCBI FieldGuide NCBI FieldGuide NCBI FieldGuide Genome Projects: microb NCBI FieldGuide NCBI FieldGuide NCBI FieldGuide Genome Resources NCBI FieldGuide

2、 A single query interface to SequencesSequences - RefSeqs- RefSeqs - GenBank- GenBank - Homologene- Homologene Maps MapViewerMaps MapViewer Entrez linksEntrez links LocusLink LocusLink will be replaced by Entrez Gene on MARCH 1, 2005. Check Gene FAQ for current information. NCBI FieldGuide Entrez Ge

3、ne LocusLink A single query interface to SequencesSequences - RefSeqs - RefSeqs - GenBank - GenBank - Homologene - Homologene Maps MapViewerMaps MapViewer Entrez linksEntrez links Entrez Gene More organisms - all RefSeq genomes Entrez integration NCBI FieldGuide Gsnsym 淀粉樣變性病 NCBI FieldGuide NCBI Fi

4、eldGuide Global Entrez: NADH2 nadh2 47 NCBI FieldGuide Entrez Gene: NADH2 26 records NCBI FieldGuide Gene Record for Pongo NADH2 Homo sapiens NCBI FieldGuide Display Exons/Introns: Gene Table NCBI FieldGuide Gene Table NCBI FieldGuide A Record With More Data: Human HFE 血色沉著病 NCBI FieldGuide Gene Gra

5、phic Links NM_NM_ NP_NP_ NCBI FieldGuide Introns/Exons: Gene Table links to sequence NCBI FieldGuide A Record With More Data: Human HFE NCBI FieldGuide Entrez SNP hfegene name AND humanorgn 52 血色沉著病 NCBI FieldGuide Linking to SNP 染色體定位 基因定位 序列定位 NCBI FieldGuide SNP in Structure NCBI FieldGuide Link

6、to OMIM NCBI FieldGuide Variants in OMIM NCBI FieldGuide Genome Resources NCBI FieldGuide Gene-oriented clusters of expressed sequences Automatic clustering using MegaBlast Each cluster represents a unique gene Informed by genome hits Information on tissue types and map locations Useful for gene dis

7、covery and selection of mapping reagents UniGene NCBI FieldGuide A Cluster of ESTs query 5 EST hits 3 EST hits NCBI FieldGuide Unigene NCBI FieldGuide UniGene Collections NCBI FieldGuide Example UniGene Cluster NCBI FieldGuide Histogram of cluster sizes for UniGene Hs build 177 NCBI FieldGuide UniGe

8、ne NCBI FieldGuide UniGene NCBI FieldGuide UniGene Cluster Hs.95351: expression NCBI FieldGuide UniGene Cluster Hs.95351: seqs NCBI FieldGuide Download sequences web page ftp site NCBI FieldGuide Genome Resources NCBI FieldGuide The New Homologene Automated detection of homologs among the annotated

9、genes of completely sequenced eukaryotic genomes. No longer UniGene based Protein similarities first Guided by taxonomic tree Includes orthologs and paralogs NCBI FieldGuide Orthologs 和 Paralogs 是同源序列的兩種類型。 Orthologs(垂直同源基因)是指來自于不同物種的 由垂直家系(物種形成)進化而來的蛋白,并且典型的保 留與原始蛋白有相同的功能。 Paralogs(平行同源基因)是那些在一定物種中

10、的 來源于基因復制的蛋白,可能會進化出新的與原來有關的 功能。請參考文獻獲得更多的信息。 NCBI FieldGuide gene duplication Paralogs vs Orthologs early globin gene A-chain gene B-chain gene frog A chick A mouse Amouse B chick B frog B paralogs orthologs orthologs NCBI FieldGuide The New Homologene Homologene Build 37.2 Species Number of genes i

11、nput grouped groups NCBI FieldGuide RAG1 Homologene rag1 12 recombination activating gene NCBI FieldGuide RAG1 Homolgene RAG1 Amniota NCBI FieldGuide Homolgene: RAG1 NCBI FieldGuide NCBI FieldGuide Homolgene: RAG1 NCBI FieldGuide Genome Resources NCBI FieldGuide NCBI FieldGuide NCBI FieldGuide NCBI

12、FieldGuide MapViewer NCBI FieldGuide List View NCBI FieldGuide Human MapViewer adar 腺甙脫氨酶 NCBI FieldGuide MapViewer: Human ADAR 4 NCBI FieldGuide MV Hs ADAR 3 UTR 5 UTR NCBI FieldGuide Maps uses as query very sensitive protein search RPS BLAST searches a database of PSSMs tool for conserved domain s

13、earches NCBI FieldGuide Widely used similarity search tool Heuristic approach based on Smith Waterman algorithm Finds best local alignments Provides statistical significance All combinations (DNA/Protein) query and database. . DNA vs DNA blastn DNA translation vs Protein blastx Protein vs Protein bl

14、astp Protein vs DNA translation tblastn DNA translation vs DNA translation tblastx www, standalone, and network clients Basic Local Alignment Search Tool NCBI FieldGuide Translated BLAST QueryQueryDatabaseDatabaseProgramProgram NPucleotide rotein N N N N P P blastx tblastn tblastx PPPP PP PPPP PP PP

15、PP PP PPPP PP Particularly useful for nucleotide sequences without protein annotations, such as ESTs or genomic DNA NCBI FieldGuide How BLAST Works Make lookup table of “words” for query Scan database for hits Ungapped extensions of hits (initial HSPs) Gapped extensions (no traceback) Gapped extensi

16、ons (traceback; alignment details) NCBI FieldGuide Nucleotide Words GTACTGGACATGGACCCTACAGGAAQuery: GTACTGGACAT TACTGGACATG ACTGGACATGG CTGGACATGGA TGGACATGGAC GGACATGGACC GACATGGACCC ACATGGACCCT Make a lookup table of words 11-mer . . . 828megablast 711blastn minimumdefaultWORD SIZE NCBI FieldGuide

17、 Protein Words GTQITVEDLFYNIATRRKALKNQuery: Neighborhood Words LTV, MTV, ISV, LSV, etc. GTQ TQI QIT ITV TVE VED EDL DLF . Make a lookup table of words Word size = 3 (default) Word size can only be 2 or 3 -f 11 = blastp default NCBI FieldGuide Minimum Requirements for a Hit Nucleotide BLAST requires

18、one exact match Protein BLAST requires two neighboring matches within 40 aa GTQITVEDLFYNI SEI YYN ATCGCCATGCTTAATTGGGCTT CATGCTTAATT neighborhood words one exact match two matches -A 40 = blastp default NCBI FieldGuide BLASTP Summary YLS HFL Sbjct 287 LEETYAKYLHKGASYFVYLSLNMSPEQLDVNVHPSKRIVHFLYDQEI

19、333 Query 1 IETVYAAYLPKNTHPFLYLSLEISPQNVDVNVHPTKHEVHFLHEESI 47 Gapped extension with trace backGapped extension with trace back Query 1 IETVYAAYLPKNTHPFLYLSLEISPQNVDVNVHPTKHEVHFLHEESI-LEV 50 +E YA YL K F+YLSL +SP+ +DVNVHP+K VHFL+ I + + Sbjct 287 LEETYAKYLHKGASYFVYLSLNMSPEQLDVNVHPSKRIVHFLYDQEIATSI 33

20、7 Final HSPFinal HSP +E YA YL K F+ L +SP+ +DVNVHP+K V + I High-scoring pair (HSP)High-scoring pair (HSP) HFL 18 HFV 15 HFS 14 HWL 13 NFL 13 DFL 12 HWV 10 etc YLS 15 YLT 12 YVS 12 YIT 10 etc Neighborhood Neighborhood wordswords Neighborhood Neighborhood score thresholdscore threshold T (-f) =11T (-f)

21、 =11 Query: IETVYAAYLPKNTHPFLYLSLEISPQNVDVNVHPTKHEVHFLHEESILEV example query wordsexample query words NCBI FieldGuide Scoring Systems - Nucleotides A G C T A +1 3 3 -3 G 3 +1 3 -3 C 3 3 +1 -3 T 3 3 3 +1 Identity matrix CAGGTAGCAAGCTTGCATGTCA | | | raw score = 19-9 = 10 CACGTAGCAAGCTTG-GTGTCA -r 1 -q

22、 -3 NCBI FieldGuide Scoring Systems - Proteins Position Independent Matrices PAM Matrices (Percent Accepted Mutation) Derived from observation; small dataset of alignments Implicit model of evolution All calculated from PAM1 PAM250 widely used BLOSUM Matrices (BLOck SUbstitution Matrices) Derived fr

23、om observation; large dataset of highly conserved blocks Each matrix derived separately from blocks with a defined percent identity cutoff BLOSUM62 - default matrix for BLAST Position Specific Score Matrices (PSSMs) PSI- and RPS-BLAST NCBI FieldGuide A 4 R -1 5 N -2 0 6 D -2 -2 1 6 C 0 -3 -3 -3 9 Q

24、-1 1 0 0 -3 5 E -1 0 0 2 -4 2 5 G 0 -2 0 -1 -3 -2 -2 6 H -2 0 1 -1 -3 0 0 -2 8 I -1 -3 -3 -3 -1 -3 -3 -4 -3 4 L -1 -2 -3 -4 -1 -2 -3 -4 -3 2 4 K -1 2 0 -1 -3 1 1 -2 -1 -3 -2 5 M -1 -1 -2 -3 -1 0 -2 -3 -2 1 2 -1 5 F -2 -3 -3 -3 -2 -3 -3 -3 -1 0 0 -3 0 6 P -1 -2 -2 -1 -3 -1 -1 -2 -2 -3 -3 -1 -2 -4 7 S

25、 1 -1 1 0 -1 0 0 0 -1 -2 -2 0 -1 -2 -1 4 T 0 -1 0 -1 -1 -1 -1 -2 -2 -1 -1 -1 -1 -2 -1 1 5 W -3 -3 -4 -4 -2 -2 -3 -2 -2 -3 -2 -3 -1 1 -4 -3 -2 11 Y -2 -2 -2 -3 -2 -1 -2 -3 2 -1 -1 -2 -1 3 -3 -2 -2 2 7 V 0 -3 -3 -3 -1 -2 -2 -3 -3 3 1 -2 1 -1 -2 -2 0 -3 -1 4 X 0 -1 -1 -1 -2 -1 -1 -1 -1 -1 -1 -1 -1 -1 -

26、2 0 0 -2 -1 -1 -1 A R N D C Q E G H I L K M F P S T W Y V X BLOSUM62 D F Negative for less likely substitutions D Y F Positive for more likely substitutions NCBI FieldGuide Position-Specific Score Matrix DAF-1 Serine/Threonine protein kinases catalytic loop 174PSSM scores54 NCBI FieldGuide A R N D C

27、 Q E G H I L K M F P S T W Y V 435 K -1 0 0 -1 -2 3 0 3 0 -2 -2 1 -1 -1 -1 -1 -1 -1 -1 -2 436 E 0 1 0 2 -1 0 2 -1 0 -1 -1 0 0 0 -1 0 0 -1 -1 -1 437 S 0 0 -1 0 1 1 0 1 1 0 -1 0 0 0 2 0 -1 -1 0 -1 438 N -1 0 -1 -1 1 0 -1 3 3 -1 -1 1 -1 0 0 -1 -1 1 1 -1 439 K -2 1 1 -1 -2 0 -1 -2 -2 -1 -2 5 1 -2 -2 -1

28、-1 -2 -2 -1 440 P -2 -2 -2 -2 -3 -2 -2 -2 -2 -1 -2 -1 0 -3 7 -1 -2 -3 -1 -1 441 A 3 -2 1 -2 0 -1 0 1 -2 -2 -2 0 -1 -2 3 1 0 -3 -3 0 442 M -3 -4 -4 -4 -3 -4 -4 -5 -4 7 0 -4 1 0 -4 -4 -2 -4 -1 2 443 A 4 -4 -4 -4 0 -4 -4 -3 -4 4 -1 -4 -2 -3 -4 -1 -2 -4 -3 4 444 H -4 -2 -1 -3 -5 -2 -2 -4 10 -6 -5 -3 -4

29、-3 -2 -3 -4 -5 0 -5 445 R -4 8 -3 -4 0 -1 -2 -3 -2 -5 -4 0 -3 -2 -4 -3 -3 0 -4 -5 446 D -4 -4 -1 8 -6 -2 0 -3 -3 -5 -6 -3 -5 -6 -4 -2 -3 -7 -5 -5 447 I -4 -5 -6 -6 -3 -4 -5 -6 -5 3 5 -5 1 1 -5 -5 -3 -4 -3 1 448 K 0 0 1 -3 -5 -1 -1 -3 -3 -5 -5 7 -4 -5 -3 -1 -2 -5 -4 -4 449 S 0 -3 -2 -3 0 -2 -2 -3 -3

30、-4 -4 -2 -4 -5 2 6 2 -5 -4 -4 450 K 0 3 0 1 -5 0 0 -4 -1 -4 -3 4 -3 -2 2 1 -1 -5 -4 -4 451 N -4 -3 8 -1 -5 -2 -2 -3 -1 -6 -6 -2 -4 -5 -4 -1 -2 -6 -4 -5 452 I -3 -5 -5 -6 0 -5 -5 -6 -5 6 2 -5 2 -2 -5 -4 -3 -5 -3 3 453 M -4 -4 -6 -6 -3 -4 -5 -6 -5 0 6 -5 1 0 -5 -4 -3 -4 -3 0 454 V -3 -3 -5 -6 -3 -4 -5

31、 -6 -5 3 3 -4 2 -2 -5 -4 -3 -5 -3 5 455 K -2 1 1 4 -5 0 -1 -2 1 -4 -2 4 -3 -2 -3 0 -1 -5 -2 -3 456 N 1 1 3 0 -4 -1 1 0 -3 -4 -4 3 -2 -5 -2 2 -2 -5 -4 -4 457 D -3 -2 5 5 -1 -1 1 -1 0 -5 -4 0 -2 -5 -1 0 -2 -6 -4 -5 458 L -3 -1 0 -3 0 -3 -2 3 -4 -2 3 0 1 1 -2 -2 -3 5 -1 -3 Position-Specific Score Matri

32、x catalytic loop ./blastpgp -i NP_499868.2 -d nr -j 3 -Q NP_499868.pssm NCBI FieldGuide Local Alignment Statistics High scores of local alignments between two random sequences follow the Extreme Value Distribution Score (S) Alignments (applies to ungapped alignments) E = Kmne-S or E = mn2-S K = scal

33、e for search space = scale for scoring system S = bitscore = (S - lnK)/ln2 Expect ValueExpect Value E = number of database hits you expect to find by chance, S your score expected number of random hits More info: NCBI FieldGuide Advanced BLAST Options: Nucleotide Example Entrez Queries nucleotide al

34、lFilter NOT mammaliaOrganism green plantsOrganism biomol mrnaProperties gbdiv estProperties AND ratorganism Other Advanced e 10000 expect value -v 2000 descriptions -b 2000 alignments NCBI FieldGuide Advanced BLAST Options: Protein Matrix Selection PAM30 - most stringent BLOSUM45 - least stringent E

35、xample Entrez Queries proteins allFilter NOT mammaliaOrganism green plantsOrganism srcdb refseqProperties Other Advanced e 10000 expect value -v 2000 descriptions -b 2000 alignments Limit by taxon Mus musculusOrganism MammaliaOrganism ViridiplantaeOrganism NCBI FieldGuide sp|P27476|NSR1_YEAST NUCLEA

36、R LOCALIZATION SEQUENCE BINDING PROTEIN (P67) Length = 414 Identities = 35/131 (26%), Positives = 56/131 (42%), Gaps = 4/131 (3%) Query: 362 STTSLTSSSTSGSSDKVYAHQMVRTDSREQKLDAFLQPLSKPLS-SQPQAIVTEDKTD 418 S+S SSS+S SS + + +S + + S S S+ + E K Sbjct: 29 SSSSSESSSSSSSSSESESESESESESSSSSSSSDSESSSSSSSDSESE

37、AETKKEESKDS 88 Filtered Unfiltered Low Complexity Filtering NCBI FieldGuide Other BLAST Algorithms Megablast Discontiguous Megablast PSI-BLAST PHI-BLAST NCBI FieldGuide Megablast: NCBIs Genome Annotator Long alignments of similar DNA sequences Greedy algorithm Concatenation of query sequences Faster

38、 than blastn; less sensitive NCBI FieldGuide MegaBLAST 18(3):440-5 W = word size; # matches in template t = template length NCBI FieldGuide Discontiguous (Cross-species) MegaBLAST NCBI FieldGuide Discontiguous Word Options NCBI FieldGuide MegaBLAST vs Discontiguous MegaBLAST NM_017460 Homo sapiens c

39、ytochrome P450, family 3, subfamily A, polypeptide 4 (CYP3A4), transcript variant 1, mRNA (2768 letters) vs Drosophila NCBI FieldGuide MegaBLAST vs Discontiguous MegaBLAST MegaBLAST = “No significant similarity found.” Discontiguous megaBLAST = NCBI FieldGuide Another Example . . . Discontiguous meg

40、aBLAST = numerous hits . . . Query: NM_078651 Drosophila melanogaster CG18582-PA (mbt) mRNA, (3244 bp) /note= mushroom bodies tiny; synonyms: Pak2, STE20, dPAK2 MegaBLAST = “No significant similarity found.” Database: nr (nt), Mammaliaorgn NCBI FieldGuide Ex: Discontiguous MegaBLAST NCBI FieldGuide

41、Ex: BLASTN NCBI FieldGuide PSI-BLAST Example: Confirming relationships of purine nucleotide metabolism proteins Position-specific Iterated BLAST NCBI FieldGuide gi|113340|sp|P03958|ADA_MOUSE ADENOSINE DEAMINASE (ADENOSINE MAQTPAFNKPKVELHVHLDGAIKPETILYFGKKRGIALPADTVEELRNIIGMDKPLSLPGF VIAGCREAIKRIAYEF

42、VEMKAKEGVVYVEVRYSPHLLANSKVDPMPWNQTEGDVTPDDVVD EQAFGIKVRSILCCMRHQPSWSLEVLELCKKYNQKTVVAMDLAGDETIEGSSLFPGHVEAY RTVHAGEVGSPEVVREAVDILKTERVGHGYHTIEDEALYNRLLKENMHFEVCPWSSYLTGA VRFKNDKANYSLNTDDPLIFKSTLDTDYQMTKKDMGFTEEEFKRLNINAAKSSFLPEEEKK PSI-BLAST 0.005 E value cutoff for PSSM NCBI FieldGuide RESULTS: Ini

43、tial BLASTP Same results as protein-protein BLAST; different format NCBI FieldGuide Results of First PSSM Search Other purine nucleotide metabolizing enzymes not found by ordinary BLAST NCBI FieldGuide Tenth PSSM Search: Convergence Just below threshold, another nucleotide metabolism enzyme Check to

44、 add to PSSM NCBI FieldGuide Reverse PSI-BLAST (RPS)-BLAST NCBI FieldGuide Adenosine/AMP Deaminase Domain AMP Deaminases . . . . . . NCBI FieldGuide PHI-BLAST gi|231729|sp|P30429|CED4_CAEEL CELL DEATH PROTEIN 4 MLCEIECRALSTAHTRLIHDFEPRDALTYLEGKNIFTEDHSELISKMSTRLERIANFLRIYRRQASE LIDFFNYNNQSHLADFLEDYI

45、DFAINEPDLLRPVVIAPQFSRQMLDRKLLLGNVPKQMTCYIREYHV IKKLDEMCDLDSFFLFLHGRAGSGKSVIASQALSKSDQLIGINYDSIVWLKDSGTAPKSTFDLFTDI LKSEDDLLNFPSVEHVTSVVLKRMICNALIDRPNTLFVFDDVVQEETIRWAQELRLRCLVTTRDVEI ASQTCEFIEVTSLEIDECYDFLEAYGMPMPVGEKEEDVLNKTIELSSGNPATLMMFFKSCEPKTFEK GAxxxxGKST NCBI FieldGuide Whats New?Whats New? N

46、CBI FieldGuide BLAST Databases Nucleotide refseq_rna = NM_*, XM_* refseq_genomic = NC_*, NG_* env_nt environmental samplefilter, e.g., 16S rRNA Protein refseq = NP_*, XP_* env_nr NCBI FieldGuide New Formatter Select lower caseSelect red NCBI FieldGuide New Formatter gray line = same database hit hsps color-coded independently NCBI FieldGuide BLAST Output: Alignments & Filter low complexity sequence filteredlow complexity sequence filtered NCBI FieldGuide Advanced Options Limit to Organism protein allfilter Nprotein allfilter N Example Entre

溫馨提示

  • 1. 本站所有資源如無特殊說明,都需要本地電腦安裝OFFICE2007和PDF閱讀器。圖紙軟件為CAD,CAXA,PROE,UG,SolidWorks等.壓縮文件請下載最新的WinRAR軟件解壓。
  • 2. 本站的文檔不包含任何第三方提供的附件圖紙等,如果需要附件,請聯系上傳者。文件的所有權益歸上傳用戶所有。
  • 3. 本站RAR壓縮包中若帶圖紙,網頁內容里面會有圖紙預覽,若沒有圖紙預覽就沒有圖紙。
  • 4. 未經權益所有人同意不得將文件中的內容挪作商業(yè)或盈利用途。
  • 5. 人人文庫網僅提供信息存儲空間,僅對用戶上傳內容的表現方式做保護處理,對用戶上傳分享的文檔內容本身不做任何修改或編輯,并不能對任何下載內容負責。
  • 6. 下載文件中如有侵權或不適當內容,請與我們聯系,我們立即糾正。
  • 7. 本站不保證下載資源的準確性、安全性和完整性, 同時也不承擔用戶因使用這些下載資源對自己和他人造成任何形式的傷害或損失。

評論

0/150

提交評論