1 / 15

New Scalable Coalescent-Based Species Tree Estimation Methods: BBCA, ASTRAL, and ASTRID

Learn about new scalable coalescent-based methods for species tree estimation: BBCA, ASTRAL, and ASTRID. These methods provide accurate analysis of large datasets with high efficiency and statistical consistency in the presence of ILS.

rronda
Download Presentation

New Scalable Coalescent-Based Species Tree Estimation Methods: BBCA, ASTRAL, and ASTRID

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. New Scalable Coalescent-Based Species Tree Estimation Methods: BBCA, ASTRAL, and ASTRID Tandy Warnow The University of Illinois

  2. BBCA, ASTRAL, and ASTRID • BBCA is a simple way of making *BEAST scalable to large numbers of genes (but doesn’t address large numbers of species) • ASTRAL and ASTRID: • summary methods that are statistically consistent in the presence of ILS, and that run in polynomial time. • Both can analyze very large datasets (1000 species and 1000 genes – or more) with high accuracy. • The relative accuracy depends on the model condition – sometimes ASTRAL is better, sometimes ASTRID is better.

  3. Main competing approaches gene 1gene 2 . . . gene k . . . Analyze separately . . . Summary Method Species Concatenation

  4. Incomplete Lineage Sorting (ILS) is a dominant cause of gene tree heterogeneity

  5. *BEAST • Heled and Drummond, MBE 2010 • Input: set of multiple sequence alignments for collection of genes • Techique: Uses MCMC to co-estimate gene trees and species trees • Highly accurate • Limited in practice to small numbers of genes and species, due to convergence issues

  6. BBCA: improving *BEAST Zimmermann, Mirarab, and Warnow, BMC Genomics 2014: • Randomly partition genes into bins of at most 25 genes • Apply *BEAST to each bin, and take the gene trees it computes • Apply favored summary method to the gene trees • Matches accuracy of *BEAST • Improves scalability to large # genes

  7. ASTRAL • Mirarab and Warnow, Bioinformatics 2014 • https://github.com/smirarab/ASTRAL Tutorial in Species Tree Workshop

  8. ASTRID • ASTRID: Accurate species trees using internode distances, Vachaspati and Warnow, RECOMB-CG 2015 and BMC Genomics 2015 • Algorithmic design: Computes a matrix of average leaf-to-leaf topological distances, and then computes a tree using FastME (more accurate than neighbor Joining and faster, too). • Related to NJst (Liu and Yu, 2010), which computes the same matrix but then computes the tree using neighbor joining (NJ). • Statistically consistent under the MSC • O(kn2 + n3) time where there are k gene trees and n species

  9. BBCA, ASTRAL, and ASTRID • BBCA is a simple way of making *BEAST scalable to large numbers of genes (but doesn’t address large numbers of species) • ASTRAL and ASTRID: • summary methods that are statistically consistent in the presence of ILS, and that run in polynomial time. • Both can analyze very large datasets (1000 species and 1000 genes – or more) with high accuracy. • The relative accuracy depends on the model condition – sometimes ASTRAL is better, sometimes ASTRID is better.

  10. Acknowledgments Software ASTRAL: Available at https://github.com/smirarab ASTRID: Available at https://github.com/pranjalv123 Others at http://tandy.cs.illinois.edu/software.html NSF grant DBI-1461364 (joint with Noah Rosenberg at Stanford and LuayNakhleh at Rice): http://tandy.cs.illinois.edu/PhylogenomicsProject.html NSF graduate fellowship to PranjalVachaspati HHMI graduate fellowship to Siavash Mirarab Papers available at http://tandy.cs.illinois.edu/papers.html

More Related