1 / 27

Finding detailed relationships between proteins specific to phenotypes among microbial organisms

Finding detailed relationships between proteins specific to phenotypes among microbial organisms. Daniel Park Molecular Biology Institute, UCLA Yeates lab SoCalBSI August 24, 2006. OUTLINE. Phylogenetic profiles Ternary logic analysis Building COG & phenotype profiles

Download Presentation

Finding detailed relationships between proteins specific to phenotypes among microbial organisms

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Finding detailed relationships between proteins specific to phenotypes among microbial organisms Daniel Park Molecular Biology Institute, UCLA Yeates lab SoCalBSI August 24, 2006

  2. OUTLINE • Phylogenetic profiles • Ternary logic analysis • Building COG & phenotype profiles • Results of logic analysis

  3. OUTLINE • Phylogenetic profiles • Ternary logic analysis • Building COG & phenotype profiles • Results of logic analysis

  4. PHYLOGENETIC PROFILES • Turning an earlier question on its side: • From, “What proteins are found in a genome?” • To, “What genomes contain a given protein?”

  5. VARIATIONS OF PHYLOGENETIC PROFILES • Relationships between protein families • Relationships between protein family profile and given target ‘phenotype’ profile

  6. OUTLINE • Phylogenetic profiles • Ternary logic analysis • Building COG & phenotype profiles • Results of logic analysis

  7. COMPLEXITY OF CELLULAR PROCESSES

  8. B A HIGHER ORDER RELATIONSHIPS:TERNARY LOGIC ANALYSIS

  9. 8 LOGIC TYPES FOR PHYLOGENETIC PROFILE TRIPLETS

  10. MEASURING MUTAL INFORMATION BETWEEN TWO PROFILES Where Uis the uncertainty coefficient relating profiles x and y His the Shannon entropy of the probability distributions Range of U: [0,1] Ex. U = 0.88  88% decrease in uncertainty High value of U indicates high mutual information between x and y

  11. MEASURING MUTAL INFORMATION AMONG THREE PROFILES U(c | f(a,b))where f(a,b) is the logical combination of a and b Constraints: U(c|a) < x U(c|b) < x U(c|f(a,b)) > y

  12. OUTLINE • Phylogenetic profiles • Ternary logic analysis • Building COG & phenotype profiles • Results of logic analysis

  13. COGs: CLUSTERS OF ORTHOLOGOUS GROUPS Set of orthologous proteins from at least three different lineages Cluster  Functional group

  14. COMBINATIONS OF COG PROFILES MATCHING A PHENOTYPE

  15. ASSOCIATING MORE GENOMES WITH COGS

  16. ` BUILDING COG PROFILES • 81,480 proteins • 354 bacterial genomes • 4,613 COGs

  17. BUILDING PHENOTYPE PROFILES http://www.ncbi.nlm.nih.gov/genomes/lproks.cgi

  18. OUTLINE • Phylogenetic profiles • Ternary logic analysis • Building COG & phenotype profiles • Results of logic analysis

  19. Cumulative no. of protein triplets recovered at an uncertainty coefficient score greater than a given threshold

  20. Frequency for each of the eight logic function types observed

  21. CORRELATIONS WITH PHENOTYPES:TEMPERATURE RANGE • For U> 0.8, one relationship between proteins was found: Hyperthermophilicity = and( COG0432, !COG0225 ) U ( Hyp.| COG0432 ) = 0.26 U( Hyp. | COG0225 ) = 0.29 U( Hyp. | and( COG0432, !COG0225 ) ) = 0.71 [S] COG0432: Uncharacterized conserved protein [O] COG0225: Peptide methionine sulfoxide reductase

  22. LOGICAL COMBINATION OF COG PROFILES MATCHING A PHENOTYPE PROFILE c = hyperthermophilicity f = and( COG0432, !COG0225 ) a = COG0432 (Uncharacterized conserved protein) b = !COG0225 (Peptide methionine sulfoxide reductase)

  23. CONCLUSIONS • There may be a correlation between the absence of methionine sulfoxide reductase and the presence of an uncharacterized conserved protein in hyperthermophiles.

  24. CONCLUSIONS • Classified ~80,000 proteins from 354 bacterial genomes into ~4,600 COGs • Built COG and phenotype profile matrices for 354 fully sequenced bacterial genomes • Support that ternary relationships among COGs are biologically significant • Support that some logic types are seen in biology more than others: 1 (and) 5 7 (xor)

  25. FUTURE DIRECTIONS • Build a richer database of phenotype profiles • Investigate relationships at lower cutoffs • Experimentally characterize the unknown COG0432 by crystallography

  26. Todd Yeates Matteo Pellegrini Yeates lab Morgan Beeby Brian O’Connor Rest of the lab SoCalBSI 2006 Jamil Momand Wendie Johnston Sandra Sharp Nancy Warter-Perez Ronnie Cheng Fellow participants ACKNOWLEDGEMENTS

More Related