210 likes | 369 Views
Software for Incorporating Marker Data in Genetic Evaluations. Kathy Hanford U.S. Meat Animal Research Center Agricultural Research Service U.S. Department of Agriculture. Outline. Introduction Mixed Models Incorporating Random QTL Effects Current/Future Modification to MTDFREMLQ
E N D
Software for Incorporating Marker Data in Genetic Evaluations Kathy Hanford U.S. Meat Animal Research Center Agricultural Research Service U.S. Department of Agriculture
Outline • Introduction • Mixed Models Incorporating Random QTL Effects • Current/Future Modification to MTDFREMLQ • Practical Limitations of MTDFREMLQ • Applications
Introduction • Genetic evaluation • genetic improvement of quantitative traits through selection • currently use polygenic model • genes at many loci each with a small effect • measure the cumulative effect • analysis with mixed models –software available • add genomic information
Introduction • Two phases in application of genomic data to livestock improvement 1) Statistical analysis of genomic information to determine the potential importance of that information (i.e. use of genetic markers to quantify the effects of QTL on traits of economic importance) 2) Include marker information in the genetic evaluation of potential parents to determine which will have the best progeny (Marker Assisted Selection)
Introduction • QTL Identification • methods needed for outbred populations • daughter and granddaughter designs • many half-sib families with QTL effects being estimated for each half-sib family • Fernando and Grossman • works with the outbred population as a whole, using both pedigree and marker information. Need complete marker data • other methods • such as MCMC, primarily been used only in simulations
Mixed Model Incorporating Random QTL Effects v2nx1 vector of QTL alleleic effects (ai=vp i+vm i+u i, v i=[vp i,vm i]’ )
Numerator Relationship Matrix (A) • The probability that alleles are IBD • Probability between two half sibs is .25 • Need the inverse of A • depends on pedigree information • Computed directly (Henderson, 1976) • Relatively few nonzero elements (sparse)
Gametic (QTL) Correlation Matrix (M) • The probability that alleles are IBD • Need the inverse of M • depends on pedigree information • depends on probabilities QTL alleles are IBD • Computed directly if complete marker information (Abdel-Azim and Freeman, 2001)
Practical Issues in Calculating the QTL Correlation Matrix • Outbred population • Sparse marker information • Individuals with missing or incomplete marker data • Some of which will be incorrect • Large complex pedigrees (inbreeding and loops)
Complex Pedigree A B A1A2 ?? D C A1A2 • Software • MCMC • LOKI • DET • Pong-Wong,et al. • Allelic Peeling • GenoProb E A1A2
Size Considerations • Each additional QTL increases the number of equations by 2 times the number of animals in the pedigree • Sparse matrix storage • Only store nonzero elements • Polygenic (A-1) grows by 4 times the number of animals • Gametic (M-1) grows by 15 times the number of animals
MTDFREML • Multiple Trait Derivative-free Restricted (or residual) Maximum Likelihood • A set of programs to obtain estimates of variances and covariances • USDA/ARS – Dale Van Vleck • Keith Boldman, Lisa Kriese, Curt Van Tassell, Steve Kachman, Joerg Dodenhoff
MTDFREML • MTDFNRM – Calculate and output the inverse of the numerator relationship matrix • MTDFPREP – Set up the model for the analysis • MTDFRUN – Run the analysis using the files produced by MTDFNRM and MTDFPREP to obtain (co)variance estimates and breeding values
Current Modifications to MTDFREML to Incorporate QTL effects (MTDFREMLQ) • MTDFNRMQ – modified to calculate inverse of QTL correlation matrix (M-1) from IBD probability file (produced by Genoprob, Loki, etc) • Non-inbred pedigree when marker data are incomplete • Inbred pedigree when marker data are complete • Genetic groups arising from different populations with different prior selection
Current Modifications to MTDFREMLQ (cont.) • MTDFPRPQ – modified to include multiple QTL in the model (validated for single QTL) • Multiple trait • Gametic imprinting (coded, not validated)
Current Modifications to MTDFREMLQ (cont.) • MTDFRUNQ – modified to include M-1 and associated between trait (co)variances for each QTL (V-1) • Assumes independence between two QTLs
Further Modifications to MTDFREMLQ • Include inbred pedigree when marker data are incomplete (approximate M-1) • Calculate standard errors for the parameters using the delta method • Currently in MTDFREML • In the testing/debugging stage • appears to work for single-trait, single-QTL and two-trait, single-QTL cases. • still need to test for multiple-QTL
Practical Limitations of MTDFREMLQ • Memory Limitations/animal/traits/qtl Time Limitations
Applications • QTL detection • Find and utilize QTL in a breed and include that information in national genetic evaluation. • Marker Assisted Selection • Experimental herds • The twinning herd at MARC • Currently producing about 50% twin calving compared to a normal range of 1-3% • ~6000 in genetic evaluation, marker data from 1994 on over 3000 animals in regions of 3 QTLs