160 likes | 335 Views
Bacteriophage Gene Clustering and Phylogeny. Nicholas Celms San Diego State University Funded in part by NSF 0827278 UBM Interdisciplinary Training in Biology and Mathematics. Goal.
E N D
Bacteriophage Gene Clustering and Phylogeny Nicholas Celms San Diego State University Funded in part by NSF 0827278 UBM Interdisciplinary Training in Biology and Mathematics.
Goal • Build a method for making taxonomic groupings of bacteriophages based on sets of protein-encoding genes (PEGs)
Bacteriophages • Viruses that infect bacteria • ~24-200 nm long • Horizontal gene transfer • Short, highly-diverse genomes
Goal • Build a method for making taxonomic groupings of bacteriophages based on sets of protein-encoding genes (PEGs) • Define clusters of protein-encoding genes that differentiate strains of bacteriophages into subdivisions called clans • Define super-groups of clans called components • Examine components and clans for: • phylogeny • Classification of new strains • Use PEG clusters to: • Improve functional annotations • Define lifestyle indicators • Find horizontally-transferred groups of genes
Image Cluster vs. Clusters Image cluster: generic PEG set associated with a clan Cluster: a phage’s specific set of PEG orthologs of the image cluster
Data 120 phages, 8558 PEGs Filtered: 2512 contributive PEGs (appear in a cluster) 335 clans (interrelated groupings of phages) forming 14 components
Future • Broadening our analysis to all available phages • ~700 phages, ~55,000 PEGs • Phage phylogeny • Experimentally-validating suggested functional annotations