1 / 29

Phylogenetic Analysis

Phylogenetic Analysis. Greek: phylon – race genetic -- birth. Phylogenetic Analysis. The evolutionary relationship among a set of species is called a Phylogeny, represented by a phylogenetic tree . Infer phylogenetic tree (Reconstruction) from observation of the existing organisms

dolph
Download Presentation

Phylogenetic Analysis

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Phylogenetic Analysis Greek: phylon – race genetic -- birth

  2. Phylogenetic Analysis • The evolutionary relationship among a set of species is called a Phylogeny, represented by a phylogenetic tree. • Infer phylogenetic tree (Reconstruction) from observation of the existing organisms • Then: morphological characters • Now: molecular sequences! Zuckerkandl & Pauling [1962]

  3. Relationship to MSA • Multiple alignment of sequences should take account of their evolutionary relationship. (Some multiple alignment algorithms do use a “guide tree”) • Alignment and tree-building can proceed simultaneously

  4. Gene Duplication 2B 1B 3A 3B 2A Phylogeny of … • … Orthologues: divergence from a common ancestor, speciation -- in different species • … Paralogues: divergence from gene duplication -- within same species/organism Speciation 1A

  5. Elements of a Tree • Leaves/Nodes: sequences • Taxa (singular: taxon): outer leaves • Edges: edge lengths correspond to evolutionary time periods • Roots:

  6. Molecular Clock Theory

  7. Molecular Clock Theory I(Zukerkandl & Pauling, early 1960’s) • For any given protein, accepted mutations in the amino acid sequence for the protein occur at constant rate • Implication • # of accepted mutations proportional to length of time interval • All proteins/species observed today have the same “molecular age” • Works well for closely related species

  8. Molecular Clock Theory II • Rate of accepted mutations maybe different for different proteins (depending on their tolerance for mutations) • Different parts of a protein may evolve at different rates Counting mutations 2 3 2 3 4 1 4 1

  9. Distance-based Methods We assume that the “distance” between each pair of sequences is proportional to the evolutionary time between them.

  10. How to Collect Distance Data • Lab methods: mix single strands of DNA from different species, measure how tightly they associate. • Sequence analysis methods: estimate number of mutations based on sequence comparisons

  11. Fill Out A Distance Matrix

  12. Ultrametric Distance Matrices • D is an ultrametricdistance matrix, if and only if • for every three indices i, j and k there is a tie for the maximum of D(i,j), D(i,k) and D(j,k). That is, the maximum is not unique.

  13. Test if the data is ultrametric Mol. Clock Theory I is valid for this group of seq.s

  14. Ultrametric not-ultrametric 2 3 2 3 4 1 4 1 Constant mutation rate

  15. When MCT. 1 fails The distance matrix is no longer ultrametric

  16. When distance is additive Inferring an inner node k j m i

  17. Neighbor Joining • Can we use this fact to construct trees? • Infer inner nodes • Gradually strip off leaves (outer nodes)

  18. Finding Neighboring Leaves • Let where Theorem: if D(i,j) is minimal (among all pairs of leaves), then i and j are neighbors in the tree g j i h

  19. Neighbor Joining • Set L to contain all leaves Iteration: • Choose i,j such that D(i,j) is minimal (neighbors) • Create new node k, and set • remove i,j from L, and add k Terminate:when |L| =2, connect two remaining nodes

  20. NJ will construct the correct tree (if additive)

More Related