460 likes | 1.15k Views
Maximum Parsimony. Character-based vs Distance-based. Character-based Trees. NO explicit measure of distance Parsimony is widely used on character-based trees Trees constructed on the basis of change of characters (or traits)
E N D
Maximum Parsimony Character-based vs Distance-based
Character-based Trees • NO explicit measure of distance • Parsimony is widely used on character-based trees • Trees constructed on the basis of change of characters (or traits) • Explains the observed sequences with a minimum number of substitutions • Best for small sets of sequences with high similarity
Simple Example • Suppose we have five species, such that three have ‘C’ and two ‘T’ at a specified position • Minimal tree has one evolutionary change: C T C T C C C T T C
2 steps to Maximum Parsimony • Parsimony: for each possible tree topology, calculate parsimonious cost (involve filling in the inner nodes such that there is minimum # substitutions) • Maximum: pick the tree whose cost is the least
W W W X Y Z Y X X Z Z Y Possible Trees Sequence W:A C G C GT TG G GSequence X:A C G C GT TG G GSequence Y:A C G C A ATGA ASequence Z:A C A C A G G GA A
T T A G T A T G T G T A Sequence W:A C G C GT TG G GSequence X:A C G C GT TG G GSequence Y:A C G C A ATGA ASequence Z:A C A C A G G GA A
T T T T T T T T A A A A G G G G T T A G T T A T C A G A G T A G Some Possible Evolutionary Paths
T T A G ATGC ATGC ATGC All Possible Evolutionary Paths # of Possible Paths / OTU / Position: (Number of States)(Number of Nodes) = (Number of States)(Number of OTU -1) = 43 = 64
Step1. Given a Tree • How do we compute the Parsimony score? 1 for substitution, 0 no. • Weighted Parsimony • Each change of character a to b is weighted by the score c(a,b)
T T A G ATGC ATGC ATGC Calculate Parsimony Scores • From leaves to the root S(r, X) = cost of whole tree. r: root S(i, X) = cost of tree rooted at node iif igets residue X
Calculate Pars. Score Iteration: • if node k has children i and j, then S(k,X) = minY1(S(i,Y1)+c(X,Y1)) + minY2(S(j,Y2)+c(X,Y2)) Termination: • cost of tree is minxS(r,X) where r is the root
Calculate Parsimony Scores Initialization: • For each outer leaf i, for all X, • If X is given by the sequence, S(i,X) = 0 only possibility • Otherwise, S(i,X) = impossible
T T A G ATGC ATGC ATGC
Evaluate Parsimony Score for The Whole Sequence • Score is evaluated at each position independently. • Then scores are summed over all positions.
Step 2. Pick the Tree • With the lowest total parsimony score
A Worked Example 1 2 3 4 5 6 7 8 9 10 Species 1 - A G G G T A A C T G Species 2 - A C G A T T A T T A Species 3 - A T A A T T G T C T Species 4 - A A T G T T G T C G How many possible unrooted trees? (tree topologies)
How Many Possible Trees? 1 2 3 4 5 6 7 8 9 10 Species 1 - A G G G T A A C T G Species 2 - A C G A T T A T T A Species 3 - A T A A T T G T C T Species 4 - A A T G T T G T C G
0 0 0 Compute Pars. Score for Each 1 2 3 4 5 6 7 8 9 10 1 - A G G G T A A C T G 2 - A C G A T T A T T A 3 - A T A A T T G T C T 4 - A A T G T T G T C G
G T C A G C T A G T A C Calculate Parsimony Score 1 3 3 4 1 - G 2 - C 3 - T 4 - A 2 4 3 3
0 3 0 3 0 3 Maximum Parsimony 1 2 3 4 5 6 7 8 9 10 1 - A G G G T A A C T G 2 - A C G A T T A T T A 3 - A T A A T T G T C T 4 - A A T G T T G T C G
0 3 2 0 3 2 0 3 2 Maximum Parsimony 1 2 3 4 5 6 7 8 9 10 1 - A G G G T A A C T G 2 - A C G A T T A T T A 3 - A T A A T T G T C T 4 - A A T G T T G T C C
G A 2 A G G A 2 A G 1 Maximum Parsimony 4 1 - G 2 - A 3 - A 4 - G A G A G
0 3 2 2 0 3 2 2 0 3 2 1 Maximum Parsimony 1 2 3 4 5 6 7 8 9 10 1 - A G G G T A A C T G 2 - A C G A T T A T T A 3 - A T A A T T G T C T 4 - A A T G T T G T C G
0 3 2 2 0 1 1 1 1 2 13 0 3 2 2 0 1 2 1 2 2 15 0 3 2 1 0 1 2 1 2 2 14 Maximum Parsimony
Pro and Con • Guaranteed to find the most parsimonious tree • Misleading when rates of mutation in the different branches differ
Searching for the Optimal Tree • Exhaustive Search • Very intensive • Branch and Bound • A compromise • Heuristic • Fast • Usually starts with NJ
How to evaluate confidence/uncertainty of a tree? Bootstrap methods