140 likes | 294 Views
Analysis of Variance (ANOVA). When ANOVA is used. All the explanatory variables are categorical (factors) Each factor has two or more levels
E N D
When ANOVA is used.. • All the explanatory variables are categorical (factors) • Each factor has two or more levels • Example:You have 60 DNA samples from 3 plants: A, B, and C. And you measured DNA concentration once for each sample. Explanatory variable will be plants.
One-way ANOVA • A single factor with three or more levels. • Example: see previous example
Multi-way ANOVA • Two or more factors. • Example:You measured concentration of DNA samples from 3 plants (A, B, and C) that were grown in four different soils (K, M, N, P) - two-way ANOVA
Multi-way ANOVA • Null hypotheses: The results of a two-way anova include tests of two null hypotheses: that the means of observations grouped by one factor are the same; that the means of observations grouped by the other factor are the same; • Model=concentration~plant+soils
Factorial ANOVA • When there is replication at each combination of levels in a multi-way ANOVA. • Example: You measured 3 times concentration of DNA samples from 3 plants (A, B, and C) that were grown in four different soils (K, M, N, P) - two-way ANOVA with replication
Factorial ANOVA (cont.) • Null hypotheses: The results of a two-way anova with replication include tests of three null hypotheses: that the means of observations grouped by one factor are the same; that the means of observations grouped by the other factor are the same; and that there is no interaction between the two factors. The interaction test tells you whether the effects of one factor depend on the other factor. • Model=concentration~plant+soils+plant:soils
ANOVA ANOVA compares the mean values by comparing variances. It calculates the total variation (SSY) and partitioning it into two components: explained variation (SSA) and unexplained variation (SSE) SSY SSA SSE
Total variation • SSY- the total sum of squares is the sum of squares of the differences between the data points and the overall mean, n is number of samples per treatment, k is the number of treatments
Unexplained variation • SSE- error sum of squares is the sum of the squares of the differences between the data points and their individual treatments mean
Explained variation • SSA- treatment sum of squares is the sum of the squares of the differences between the individual treatment means and the overall mean • The amount of the variation explained by differences between the treatment means
Explained variation (cont.) • SSA=SSY-SSE • The larger difference between total variation and unexplained variation (SSY-SSE) the larger explained variation (SSA) • the greater the deference between treatment means
Before starting ANOVA 2. Test homogeneity of variance Is Fligner-Killeen test showing significant p-value? 1. Check for constancy of variance Is the variances differ by more than factor of 2?
Analysis of sample Assumptions • Independence of samples elements • Normality • Homogeneity • Sufficient sample sizes, equal sample sizes is the best