Chapter

Chapter 13 Comparing Three or More Means

Section 13.1 Comparing Three or More Means (One-Way Analysis of Variance)

Objectives • Verify the requirements to perform a one-way ANOVA • Test a hypothesis regarding three or more means using one-way ANOVA

Analysis of Variance (ANOVA) is an inferential method used to test the equality of three or more population means.

CAUTION! Do not test H0: 1=2=3 by conducting three separate hypothesis tests, because the probability of making a Type I error will be much higher than .

Objective 1 • Verify the requirements to perform a one-way ANOVA

Requirements of a One-Way ANOVA Test • There are k simple random samples; one from each of k populations. • The k samples are independent of each other; that is, the subjects in one group cannot be related in any way to subjects in a second group. • The populations are normally distributed. • The populations have the same variance; that is, each treatment group has the population variance 2.

Verifying the Requirement of Equal Population Variance The one-way ANOVA procedures may be used provided that the largest sample standard deviation is no more than twice the smallest sample standard deviation.

Parallel Example 1: Verifying the Requirements of ANOVA The following data represent the weight (in grams) of pennies minted at the Denver mint in 1990,1995, and 2000. Verify that the requirements in order to perform a one-way ANOVA are satisfied.

1990 1995 2000 2.50 2.52 2.50 2.50 2.54 2.48 2.49 2.50 2.49 2.53 2.48 2.50 2.46 2.52 2.48 2.50 2.50 2.52 2.47 2.49 2.51 2.53 2.53 2.49 2.51 2.48 2.51 2.49 2.55 2.50 2.48 2.49 2.52

Solution • The 3 samples are simple random samples. • The samples were obtained independently. • Normal probability plots for the 3 years follow. All of the plots are roughly linear so the normality assumption is satisfied.

Solution 4. The sample standard deviations are computed for each sample using Minitab and shown on the following slide. The largest standard deviation is not more than twice the smallest standard deviation (2*0.0141 =0.0282 > 0.02430) so the requirement of equal population variances is considered satisfied.

Descriptive Statistics Variable N Mean Median TrMean StDev SE Mean 1990 11 2.4964 2.5000 2.4967 0.0220 0.0066 1995 11 2.5091 2.5000 2.5078 0.0243 0.0073 2000 11 2.5000 2.5000 2.5000 0.0141 0.0043 Variable Minimum Maximum Q1 Q3 1990 2.4600 2.5300 2.4800 2.5100 1995 2.4800 2.5500 2.4900 2.5300 2000 2.4800 2.5200 2.4900 2.5100

Objective 2 • Test a Hypothesis Regarding Three or More Means Using One-Way ANOVA

The basic idea in one-way ANOVA is to determine if the sample data could come from populations with the same mean, , or if the sample evidence suggests that at least one sample comes from a population whose mean is different from the others. To make this decision, we compare the variability among the sample means to the variability within each sample.

We call the variability among the sample means the between-sample variability, and the variability of each sample the within-sample variability. If the between-sample variability is large relative to the within-sample variability, we have evidence to suggest that the samples come from populations with different means.

ANOVA F-Test Statistic The analysis of variance F-test statistic is given by

Computing the F-Test Statistic Step 1: Compute the sample mean of the combined data set by adding up all the observations and dividing by the number of observations. Call this value . Step 2: Find the sample mean for each sample (or treatment). Let represent the sample mean of sample 1, represent the sample mean of sample 2, and so on. Step 3: Find the sample variance for each sample (or treatment). Let represent the sample variance for sample 1, represent the sample variance for sample 2, and so on.

Computing the F-Test Statistic Step 4: Compute the sum of squares due to treatments, SST, and the sum of squares due to error, SSE. Step 5: Divide each sum of squares by its corresponding degrees of freedom (k-1 and n-k, respectively) to obtain the mean squares MST and MSE. Step 6: Compute the F-test statistic:

Parallel Example 2: Computing the F-Test Statistic Compute the F-test statistic for the penny data.

Solution Step 1: Compute the overall mean. Step 2: Find the sample variance for each treatment (year).

Solution Step 3: Find the sample variance for each treatment (year).

Solution Step 4: Compute the sum of squares due to treatment, SST, and the sum of squares due to error, SSE. SST =11(2.4964-2.5018)2 + 11(2.5091-2.5018)2 + 11(2.5-2.5018)2= 0.0009 SSE =(11-1)(0.0005) + (11-1)(0.0006) + (11-1)(0.0002) = 0.013

Solution Step 5: Compute the mean square due to treatment, MST, and the mean square due to error, MSE.

Solution Step 6: Compute the F-statistic.

Solution ANOVA Table:

Decision Rule in the One-Way ANOVA Test If the P-value is less than the level of significance, , reject the null hypothesis.

Section 13.2 Post Hoc Tests on One-Way Analysis of Variance

Objectives • Perform the Tukey Test

When the results from a one-way ANOVA lead us to conclude that at least one population mean is different from the others, we can make additional comparisons between the means to determine which means differ significantly. The procedures for making these comparisons are called multiple comparison methods.

Objective 1 • Perform the Tukey Test

The computation of the test statistic for Tukey’s test for comparing pairs of means follows the same logic as the test for comparing two means from independent sampling. However, the standard error that is used is where s2 is the mean square error estimate (MSE) of 2 from the one-way ANOVA, n1 is the sample size from population 1, and n2 is the sample size from population 2.

Test Statistic for Tukey’s Test The test statistic for Tukey’s test when testing H0: 1= 2 versus H1: 1≠ 2 is given by where s2is the mean square estimate of 2 (MSE) from ANOVA n1 is the sample size from population 1 n2 is the sample size from population 2

Critical Value for Tukey’s Test The critical value for Tukey’s test using a familywise error rate  is given by where  is the degrees of freedom due to error (n-k) k is the total number of means being compared

Parallel Example 1: Finding the Critical Value from the Studentized Range Distribution • Find the critical value from the Studentized range distribution with v = 13 degrees of freedom and k=4 degrees of freedom with a familywise error rate =0.01. • Find the critical value from the Studentized range distribution with v = 64 degrees of freedom and k=6 degrees of freedom with a familywise error rate =0.05.

Solution

Tukey’s Test After rejecting the null hypothesis H0: 1= 2= ··· = k the following steps can be used to compare pairs of means for significant differences, provided that • There are k simple random samples from k populations. • The k samples are independent of each other. • The populations are normally distributed. • The populations have the same variance. Step 1:Arrange the sample means in ascending order.

Tukey’s Test Step 2: Compute the pairwise differences where . Step 3: Compute the test statistic, for each pairwise difference.

Tukey’s Test Step 4: Determine the critical value, , where  is the level of significance (the familywise error rate). Step 5: If , reject the null hypothesis that H0: 1= 2 and conclude that the means are significantly different. Step 6: Compare all pairwise differences to identify which means differ.

Parallel Example 2: Performing Tukey’s Test Suppose that there is sufficient evidence to reject H0: 1= 2= 3 using a one-way ANOVA. The mean square error from ANOVA is found to be 28.7. The sample means are , and , with n1= n2= n3=9. Use Tukey’s test to determine which pairwise means are significantly different using a familywise error rate of  = 0.05.

Solution Step 1: The means, in ascending order, are Step 2: We next compute the pairwise differences for each pair, subtracting the smaller sample mean from the larger sample mean:

Solution Step 3: Compute the test statistic q0 for each pairwise difference. 2-1: 2-3:

Solution 3-1: Step 4: Find the critical value using an  = 0.05 familywise error rate with  = n-k =27-3=24 and k=3. Then q0.05,24,3=3.532.

Solution Step 5: Since 6.22 and 4.42 are greater than 3.532, but 1.79 is less than 3.532, we reject H0: 1=2 and H0: 2=3 but not H0: 1=3. Step 6: The conclusions of Tukey’s test are

Section 13.3 The Randomized Complete Block Design

Objectives • Conduct analysis of variance on the randomized complete block design • Perform the Tukey test

In the completely randomized design, the researcher manipulates a single factor and fixes it at two or more levels and then randomly assigns experimental units to a treatment. This design is not always sufficient because the researcher may be aware of additional factors that cannot be fixed at a single level throughout the experiment.

Chapter

Chapter

Presentation Transcript

Chapter

Chapter

Chapter

chapter:

Chapter

Chapter # - Chapter Title

Chapter

chapter:

Chapter 6 Chapter Review

Chapter # - Chapter Title

CHAPTER 3 CHAPTER REVIEW

Chapter 1. Chapte r 2. Chapter 3. Chapter 4. Chapter 5. Chapter 6. Chapter 7.

Chapter Number Chapter Title

CHAPTER

chapter

Chapter

CHAPTER

Chapter # - Chapter Title

Chapter # - Chapter Title

CHAPTER 5 CHAPTER 2

Chapter 17 Chapter Review

Chapter 18 Chapter 18