1 / 32

Genetic evaluation under parental uncertainty

Genetic evaluation under parental uncertainty. Robert J. Tempelman Michigan State University, East Lansing, MI National Animal Breeding Seminar Series December 6, 2004. Key papers from our lab:.

marjean
Download Presentation

Genetic evaluation under parental uncertainty

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Genetic evaluation under parental uncertainty Robert J. Tempelman Michigan State University, East Lansing, MI National Animal Breeding Seminar Series December 6, 2004.

  2. Key papers from our lab: • Cardoso, F.F., and R.J. Tempelman. 2003. Bayesian inference on genetic merit under uncertain paternity. Genetics, Selection, Evolution 35:469-487. • Cardoso, F.F., and R.J. Tempelman. 2004. Genetic evaluation of beef cattle accounting for uncertain paternity. Livestock Production Science 89: 109-120.

  3. Multiple sires – The situation • Cows are mated with a group of bulls under pasture conditions • Common in large beef cattle populations raised on extensive pasture conditions • Accounts for up to 50% of calves in some herds under genetic evaluation in Brazil (~25-30% on average) • Multiple sires group sizes range from 2 to 12+ (Breeding cows group size range from 50 to 300+) • Common in commercial U.S. herds. • Potential bottleneck for genetic evaluations beyond the seedstock level (Pollak, 2003).

  4. ? ? Who is the sire? Multiple sires – The situation x x

  5. The tabular method for computing genetic relationships • Recall basis tabular method for computing the numerator relationship matrix: • Henderson, C.R. 1976. A simple method for computing the inverse of a numerator relationship matrix used in prediction of breeding values. Biometrics 32:69. • A = {aij} where aijis the genetic relationship between animals i and j. Let parents of j be sjand dj.

  6. The average numerator relationship matrix (ANRM) • Henderson, C.R. 1988. Use of an average numerator relationship matrix for multiple-sire joining. Journal of Animal Science 66:1614-1621. • aijis the genetic relationship between animals i and j. Suppose dam of j be known to be dj whereas there are vj different candidate sires (s1,s2,…svj) with probabilities (p1,p2,…pvj) of being the true sire:

  7. Pedigree file example from Henderson (1988) 0 = unknown Could be determined using genetic markers

  8. Numerator relationship matrix: Rest provided in Henderson, 1988 symmetric Note if true sire of 7 is 3, a77 = 1.25; otherwise a77 = 1.1875

  9. How about inferring upon what might be the correct sire? • Empirical Bayes Strategy: • Foulley, J.L., D. Gianola, and D. Planchenault. 1987. Sire evaluation with uncertain paternity. Genetics, Selection, Evolution. 19: 83-102. • Sire model implementation.

  10. Simple sire model y =Xb+ Zs + e

  11. One possibility: Substitute sire probabilities for elements of Z.

  12. Strategy of Foulley et al. (1987) : Posterior probabilities using provided sire probabilities as “prior” probabilities and y to estimate elements of Z. - computed iteratively Limitation: Can only be used for sire models.

  13. Inferring upon elements of design matrix • Where else is this method currently used? • Segregation analysis • Estimating allelic frequencies and genotypic effects for a biallelic locus WITHOUT molecular marker information. • Prior probabilities based on HW equilibrium for base population. • Posterior probabilities based on data. • Reference: Janss, L.L.G., R. Thompson., J.A.M. Van Arendonk. 1995. Application of Gibbs sampling for inference in a mixed major gene-polygenic inheritance model in animal populations. Theoretical and Applied Genetics 91: 1137-1147.

  14. Another strategy (most commonly used) • Use phantom groups (Westell et al., 1988; Quaas et al., 1988). Used commonly in genetic evaluation systems having incomplete ancestral pedigrees in order to mitigate bias due to genetic trend. • Limitations (applied to multiple sires): • Assumes the number of candidate sires is effectively infinite within a group. • None of the phantom parents are related. • Potential confounding problems for small groups (Quaas, 1988).

  15. The ineffectiveness of phantom grouping for genetic evaluations in multiple sire pastures: • Perez-Enciso, M. and R.L. Fernando. 1992. Genetic evaluation with uncertain parentage: A comparison of methods. Theoretical and Applied Genetics 84:173-179. • Sullivan, P.G. 1995. Alternatives for genetic evaluation with uncertain paternity. Canadian Journal of Animal Science 75:31-36. • Greater selection response using Henderson’s ANRM relative to phantom grouping (simulation studies). • Excluding animals with uncertain paternity reduces expected selection response by as much as 37%.

  16. Uncertain paternity - objectives • To propose a hierarchical Bayes animal model for genetic evaluation of individuals having uncertain paternity • To estimate posterior probabilities of each bull in the group being the correct sire of the individual • To compare the proposed method with Henderson’s ANRM via • Simulation study • Application to Hereford PWG and WW data.

  17. Non-genetic effects - b(Contemporary groups, age of dam, age of calf, gender) Residual terms - e (assumed to be normal) Animal genetic values – a Uncertain paternity -hierarchical Bayes model 1st stage Data - y (Performance records) y = Xb + Za + e; e ~N (0,Ise2)

  18. Uncertain paternity -hierarchical Bayes model 2nd stage Residual Variance Non-genetic effects Animal genetic values b ~N (bo,Vb) a|s ~N (0,Assa2) se2 ~ se2cn-2 (Co)variances based on relationship (A), sire assignments (s) and genetic variance (sa2) Prior means based on literature information Variance based on the reliability of prior information Prior knowledge based on literature information

  19. Uncertain paternity -hierarchical Bayes model 3rd stage sire assignments genetic variance sa2 ~ sa2cn(a)-2 Probability for sire assignments (pj) Prior knowledge based on literature information Could be based on marker data.

  20. Uncertain paternity -hierarchical Bayes model 4th stage Specifying uncertainty for probability of sire assignments Dirichlet prior e.g. How sure are you about the prior probabilities of 0.6 and 0.4 for Sires 3 and 5, respectively, being the correct sire? Assessment based on how much you trust the genotype based probabilities. Could also model genotyping error rates explicitly (Rosa, G.J.M, Yandell, B.S., Gianola, D. A Bayesian approach for constructing genetics maps when markers are miscoded. Genetics, Selection, Evolution 34:353-369)

  21. Uncertain paternity -joint posterior density Data 2nd stage 1st stage Residual error Non-genetic fixed effects Genetic effects Prior knowledge based on literature information Prior means (literature information)Variance (reliability of priors) (Co)variances(relationship, sire assignments and genetic variances) 3rd stage Prior probability for sire assignments Prior knowledge based on literature information 4th stage Reliability of priors Markov chain Monte Carlo (MCMC)

  22. Generation 0 Base population Selection (20 sires & 100 dams) Breeding population Selection (5 sires& 25 dams) Selection (5 sires & 25 dams) Random mating (inbreeding avoided) 1 Offspring (500 animals) Selection (15 sires & 75 dams) Selection (15 sires& 75 dams) Breeding population Breeding population Random mating (inbreeding avoided) 2 Offspring (500 animals) . . . . . . 5 Offspring (360 animals) Simulation Study (Cardoso and Tempelman, 2003) Totals: 80 sires, 400 dams, 2000 non-parents.

  23. Offspring Random Assignment to Paternity Condition .7 .3 Certain Uncertain Assignment to Multiple Sire Groups .2 .3 .2 .1 .1 .1 2 3 4 6 8 10 Within the assigned group one of the sire is picked to be the true sire (with equal or unequal probabilities) Sire Record: Paternity assignment Sires averaged 23.6 progeny, Dams averaged 5.9 progeny

  24. Simulated traits: • Ten datasets generated from each of two different types of traits: • Trait 1 (WW): • Trait 2 (PWG): Naïve prior assignments: i.e. equal prior probabilities to each candidate sire (i.e. no information based on genetic markers available)

  25. Posterior probabilities of sire assignments being equal to true sires

  26. Rank correlation of predicted genetic effects ANRM = Henderson’s ANRM HIER = proposed model TRUE = all sires known Sidenote: Model fit criteria was clearly in favor of HIER over ANRM

  27. Uncertain paternity -application to field data • Data set • 3,402 post-weaning gain records on Hereford calves raised in southern Brazil (from 1991-1999) • 4,703 animals • Paternity (57% certain; 15% uncertain & 28% unknown-base animals) • Group sizes 2, 3, 4, 5, 6, 10, 12 & 17 • Methods • ANRM (average relationship) • HIER (uncertain paternity hierarchical Bayes model)

  28. Posterior inference for PWG genetic parameters under ANRM versus HIER models

  29. Uncertain paternity -Results summary • Model choice criteria (DIC and PBF) decisively favored HIER over ANRM • Very high rank correlations between genetic evaluations using ANRM versus HIER • Some non-trivial differences on posterior means of additive genetic value for some animals

  30. Uncertain paternity -assessment of accuracy (PWG) Standard deviation of additive genetic effects i.e. accuracies are generally slightly overstated with Henderson’s ANRM Sire with 9 progeny Sire with 50 progeny

  31. Conclusions • Uncertain paternity modeling complements genetic marker information (as priors) • Reliability on prior information can be expressed (via Dirichlet). • Little advantage over the use of Henderson’s ANRM. • However, accuracies of EPD’s overstated using ANRM. • Power of inference may improve with better statistical assumptions (i.e. heterogeneous residual variances)

  32. Implementation issues • Likely require a non-MCMC approach to providing genetic evaluations. • Some hybrid with phantom grouping may be likely needed. • Candidate sires are not simply known for some animals. • Bob Weaber’s talk.

More Related