1 / 26

Likelihood methods

Likelihood methods. Trees - “What is the probability that a proposed model of sequence evolution and a particular tree would give rise to the observed data?” “What tree and model would maximize the probability of observing the observed data?. P (data) :: tree, model.

arden
Download Presentation

Likelihood methods

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Likelihood methods Trees - “What is the probability that a proposed model of sequence evolution and a particular tree would give rise to the observed data?” “What tree and model would maximize the probability of observing the observed data?

  2. P (data) :: tree, model In practice, the data are “given,” the tree is a hypothesis, and the model of the evol’n process is usually unknown, but w/ parameters either “given” based on external knowledge or estimated from the data set. Therefore, we search for the hypothesis (tree) that gives the best probability of getting the observed data.

  3. Potential Benefits of Likelihood • Improved compensation for superimposed changes using explicit models • Method is consistent • Usually minimizes variance of model parameters • Often robust to violations of assumptions • Estimation and testing of evolutionary models and hypotheses is a natural outcome

  4. Likelihood of a tree

  5. Likelihood of a tree II Fixed Tree- dependent 4 bases x 4 bases = 16 possibles. Some much more probable.

  6. Likelihood of a tree III If we can assume that nucleotide sites evolve independently, the Likelihood of full tree is product of likelihood at each site -- because these are vanishingly small., usu. Would log transform, so log likelihood of the tree is sum of log likelihoods of each site

  7. eg, if L(tree1) = .0000002, ln L = -15.4 if L(tree2) = .0000004, ln L = -14.7 If L(tree3) = .0000008, ln L = -14.0

  8. Likelihood of a tree IV 5. X P ( A to G) 4. X P ( retaining A) 3. X P ( A to C) 2. X P ( A to C) 1. X P ( retaining A) 0. Prior probability of an “A” Probabilities are a function of: Substitution model, base frequencies, branch lengths

  9. Calculation of probability of substitution or retention Probabilities are a function of: Substitution model, base frequencies, branch lengths * See example in Mount, p. 277 * Formal analysis takes uses the model (JC, HKY, etc.) to generate explicit probabilities

  10. -4 t -4 t eg., Probability of a substitution: b e a d f c C Under Jukes-Cantor PC = (1 + 3 e )/4 PnotC = 3/4 * (1 - e )

  11. Likelihood of state i at position j in A Likelihood that i could give rise to state in B * Prob of state i changing to state k Likelihood that B has state k Branch length Similar for going to outcome in C Ie., Conditional likelihood that A has state i is the product of the likelihoods that the i could have given rise to the outcomes in B and C

  12. Likelihood Ratio test • = max[L(null hypothesis data)] max[L(alternative hypothesis data)] • Huelsenbeck et al (1997) Science. 276:227

  13. Potential Benefits of Likelihood • Improved compensation for superimposed changes using explicit models • Method is consistent • Usually minimizes variance of model parameters • Often robust to violations of assumptions • Estimation and testing of evolutionary models and hypotheses is a natural outcome **** effective Likelihood analysis requires a lg. Dataset, and full ML analysis is comput. intensive

  14. Likelihood of a tree - review

  15. Likelihood Ratio test • = max[L(null hypothesis data)] max[L(alternative hypothesis data)] • Huelsenbeck et al (1997) Science. 276:227

  16. Potential Benefits of Likelihood • Improved compensation for superimposed changes using explicit models • Method is consistent • Usually minimizes variance of model parameters • Often robust to violations of assumptions • Estimation and testing of evolutionary models and hypotheses is a natural outcome **** effective Likelihood analysis requires a lg. Dataset, and full ML analysis is comput. intensive

More Related