1 / 21

GSR The Gene Sequence evolution model with iid Rate variation over tree

GSR The Gene Sequence evolution model with iid Rate variation over tree. Örjan Åkerborg, KTH Lars Arvestad, KTH Jens Lagergren, KTH Bengt Sennblad. What?. Gene evolution through duplication and loss Sequence evolution. Why?. Base reconciliation analysis directly on data

Download Presentation

GSR The Gene Sequence evolution model with iid Rate variation over tree

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. GSRThe Gene Sequence evolution model with iid Rate variation over tree Örjan Åkerborg, KTH Lars Arvestad, KTH Jens Lagergren, KTH Bengt Sennblad

  2. What? • Gene evolution through duplication and loss • Sequence evolution

  3. Why? • Base reconciliation analysis directly on data • Avoid information loss • Addresses uncertainty better • Gene tree reconstruction should mirror generation

  4. When? • Arvestad et al. 2003, • MrBayes + GEM • Flawed model

  5. 60s ribosomal data

  6. When? • Arvestad et al. 2004 • Intergrated GEM + Substitution model • Mathematically correct model • Molecular clock • Sampling algorithm - slow • Arvestad et al. 2003, • MrBayes + GEM • Flawed model

  7. MHC revisited

  8. When? • Arvestad et al. 2003, • MrBayes + GEM • Flawed model • Arvestad et al. 2004 • Intergrated GEM + Substitution model • Mathematically correct model • Molecular clock • Sampling algorithm - slow • Åkerborg et al. Submitted (GSR) • Integrated GEM + SRT model

  9. How? • GSR • GEM • Reconciled trees – duplication and loss • SRT • Relaxed clock model (iid) • Substitution model • Fast algorithm • Discretized time space

  10. Pr[D,T] Sequence data, F Trees, T Self-consistency

  11. Self-consistency – the X%-test

  12. Application to Yeast data • Compare to prev. Results • YGOB • Orthogrups (SYNERGY) • Both synteny-based

  13. Synteny -- gene order

  14. Application to Yeast data • Compare to prev. Results • YGOB • Orthogrups (SYNERGY) • Both synteny-based • Whole genome duplication • Challenge! • Genome-wide analysis! • 4809 gene families (orthogroups)

  15. Comparison YGOB results

  16. Molecular clock?

  17. Comparison SYNERGY results

  18. Sequence vs. synteny data • No sequence diff • 36% of data sets are >85% similar • Strong divergence • 25% of data set are <40% similar • Long-branch attraction • Conflicting sequence-synteny signal

  19. Sequence vs. synteny data

  20. 31.6 % 21.6 % Orthogroup 3176 42.4 % Single best SYNERGY tree

  21. Summary • primeGSR • Integrated model • Reconciliation – gene duplication loss • Relaxed clock • Sequence evolution • Efficient algorithms • Improved gene tree reconstruction • Future prospects • Divergence time estimates (MAP) • Species tree reconstruction • Include synteny

More Related