1 / 18

Combined Analysis of Experiments

Combined Analysis of Experiments. Basic Research Researcher makes hypothesis and conducts a single experiment to test it The hypothesis is modified and another experiment is conducted Combined analysis of experiments is seldom required Experiments may be repeated to

jenaya
Download Presentation

Combined Analysis of Experiments

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Combined Analysis of Experiments • Basic Research • Researcher makes hypothesis and conducts a single experiment to test it • The hypothesis is modified and another experiment is conducted • Combined analysis of experiments is seldom required • Experiments may be repeated to • Provide greater precision (increased replication) • Validate results from initial experiment • Applied Research • Recommendations to producers must be based on multiple locations and seasons that represent target environments (soil types, weather patterns)

  2. Multilocational trials • Often called MET = multi-environment trials • How do treatment effects change in response to differences in soil and weather throughout a region? • What is the range of responses that can be expected? • Detect and quantify interactions of treatments and locations and interactions of treatments and seasons in the recommendation domain • Combined estimates are valid only if locations are randomly chosen within target area • Experiments often carried out on experiment stations • Generally use sites that are most accessible or convenient • Can still analyze the data, but consider possible bias due to restricted site selection when making interpretations

  3. Preliminary Analysis • Complete ANOVA for each experiment • Do we have good data from each site? • Examine residual plots for validity of ANOVA assumptions, outliers • Examine experimental errors from different locations for heterogeneity • Perform F Max test or Levene’s test for homogeneity of variance • If homogeneous, perform a combined analysis across sites • If heterogeneous, may need to use a transformation or break sites into homogeneous groups and analyze separately • Differences in means across sites are often greater than treatment effects • Does not prevent a combined analysis, but may contribute to error heterogeneity if there are associations between means and variances

  4. MET Linear Model (for an RBD) Yijk =  + i + j(i) + k + ()ik + ijk  = mean effect i = ith location effect j(i) = jth block effect within the ith location k= kth treatment effect ik = interaction of the kth treatment in the ith location ijk = pooled error • Environments = Locations = Sites • Blocks are nested in locations • SS for blocks is pooled across locations

  5. Treatment x Environment Interaction • Obtain a preliminary estimate of interaction of treatment with environment or season • Will we be able to make general recommendations about the treatments or should they be specific for each site? • Error degrees of freedom are pooled across sites, so it is relatively easy to detect interactions • Consider the relative magnitude of variation due to the treatments compared to the interaction MS • Are there rank changes in treatments across environments (crossover interactions)?

  6. Source df SS MS Expected MS Location l-1 SSL M1 Blocks in Loc. l(r-1) SSB(L) M2 Treatment t-1 SST M3 Loc. X Treatment (l-1)(t-1) SSLT M4 Pooled Error l(r-1)(t-1) SSEM5 Treatments and locations are random F for Locations = (M1+M5)/(M2+M4) Satterthwaite’s approximate df N1’ = (M1+M5)2/[(M12/(l-1))+M52/(l)(r-1)(t-1)] N2’ = (M2+M4)2/[(M22/(l-1))+M42/(l)(r-1)(t-1)] F for Treatments = M3/M4 F for Loc. x Treatments = M4/M5

  7. Source df SS MS Expected MS Location l-1 SSL M1 Blocks in Loc. l(r-1) SSB(L) M2 Treatment t-1 SST M3 Loc. X Treatment (l-1)(t-1) SSLT M4 Pooled Error l(r-1)(t-1) SSE M5 Treatments and locations are fixed • Fixed Locations • constitute the entire population of environments • OR • represent specific environmental conditions (rainfall, elevation, etc.) F for Locations = M1/M2 F for Treatments = M3/M5 F for Loc. x Treatments = M4/M5

  8. Source df SS MS Expected MS Location l-1 SSL M1 Blocks in Loc. l(r-1) SSB(L) M2 Treatment t-1 SST M3 Loc. X Treatment (l-1)(t-1) SSLT M4 Pooled Error l(r-1)(t-1) SSE M5 Treatments are fixed, Locations are random F for Locations = M1/M2 F for Treatments = M3/M4 F for Loc. x Treatments = M4/M5 • SAS uses slightly different rules for determining Expected MS  No direct test for Locations for this model

  9. SAS Expected Mean Squares Varieties fixed, Locations random PROCGLM; Class Location Rep Variety; Model Yield = Location Rep(Location) Variety Location*Variety; Random Location Rep(Location) Location*Variety/Test; Source Type III Expected Mean Square Location Var(Error) + 3 Var(Location*Variety) + 7 Var(Rep(Location)) + 21 Var(Location) Dependent Variable: Yield Source DF Type III SS Mean Square F Value Pr > F Location 1 0.505125 0.505125 0.20 0.6745 Error 5.8098 15.027788 2.586644 Error: MS(Rep(Location)) + MS(Location*Variety) - MS(Error)

  10. Source df SS MS Expected MS Years l-1 SSY M1 Blocks in Years l(r-1) SSB(Y) M2 Treatment t-1 SST M3 Years X Treatment (l-1)(t-1) SSYT M4 Pooled Error l(r-1)(t-1) SSE M5 Treatments are fixed, Years are random F for Years = M1/M2 F for Treatments = M3/M4 F for Years x Treatments = M4/M5

  11. Locations and Years in the same trial • Can analyze as a factorial • Can determine the magnitude of the interactions between treatments and environments • TxY, TxL, TxYxL • For a simpler interpretation, consider all year and location combinations as “sites” and use one of the models presented for multilocational trials

  12. Source df SS MS Expected MS Trial l-1 SSL M1 Treatment t-1 SST M2 Trial x Treatment (l-1)(t-1) SSLT M3 Pooled Error lt(r-1) SSE M4 Combined Lab or Greenhouse Study (CRD) • Assume Treatments are fixed, Trials are random • A “trial” is a repetition of a replicated experiment • If there are no interactions, consider pooling SSLT and SSE • Use a conservative P value to pool (e.g. >0.25 or >0.5) F for Trials = M1/M4 (SAS would say M1/M3) F for Treatments = M2/M3 F for Trials x Treatments = M3/M4

  13. Preliminary ANOVA • Assumptions for this example: • locations and blocks are random • Treatments are fixed • If Loc. x Treatment interactions are significant, must be cautious in interpreting main effects combined across all locations Source df SS MS F Total lrt-1 SSTot Location l-1 SSL M1 M1/M2 Blocks in Loc. l(r-1) SSB(L) M2 Treatment t-1 SST M3 M3/M4 Loc. X Treatment (l-1)(t-1) SSLT M4 M4/M5 Pooled Error l(r-1)(t-1) SSE M5

  14. Genotype by Environment Interactions (GEI) • When the relative performance of varieties differs from one location or year to another… • how do you make selections? • how do you make recommendations to farmers?

  15. P = G + E + GE P is phenotype of an individualG is genotypeE is environment GE is the interaction 70-20-10 rule E: GE: G 20% of the observed variation among genotypes is due to interaction of genotype and environment Genotype x Environment Interactions (GEI) • How much does GEI contribute to variation among varieties or breeding lines? DeLacey et al., 1990 – summary of results from many crops and locations

  16. Stability • Many approaches for examining GEI have been suggested since the 1960’s • Characterization of GEI is closely related to the concept of stability. “Stability” has been interpreted in different ways. • Static – performance of a genotype does not change under different environmental conditions (relevant for disease resistance, quality factors) • Dynamic – genotype performance is affected by the environment, but its relative performance is consistent across environments. It responds to environmental factors in a predictable way.

  17. Measures of stability • CV of individual genotypes across locations • Regression of genotypes on an environmental index • Eberhart and Russell, 1966 • Ecovalence • Wricke, 1962 • Superiority measure of cultivars • Lin and Binns, 1988 • Many others…

  18. Analysis of GEI – other approaches • Rank sum index (nonparametric approach) • Cluster analysis • Factor analysis • Principal component analysis • AMMI • Pattern analysis • Analysis of crossovers • Partial Least Squares Regression • Factorial Regression

More Related