320 likes | 501 Views
M&Ms Two-way Tables. Ellen Gundlach STAT 301 Course Coordinator Purdue University. M&Ms Color Distribution % according to their website. Skittles Color Distribution % according to their hotline. My M&Ms data in counts. My M&Ms data: joint % (divide counts by total = 76).
E N D
M&Ms Two-way Tables Ellen Gundlach STAT 301 Course Coordinator Purdue University
Conditional distribution of flavor for color • We know the color of our M&M already, but now how is flavor distributed for this color?
Conditional distribution example • We know we have a red M&M, so what is the probability it is a plain M&M?
Conditional distribution of color for flavor • We know the flavor of our M&M already, but now how is color distributed for this color?
Conditional distribution example • We know we have a peanut M&M, so what is the probability it is green?
Conditional distributions in general Conditional distribution of X for Y (we know Y for sure already, but we want to know the probability or % of having X be true as well):
Bar graphs for conditional distribution of color for both flavors
Chi-squared hypothesis test H0: There is no association between color distribution and flavor for M&Ms. Ha: There is association between color distribution and flavor for M&Ms. Use an = 0.01 for this story.
Full-class M&Ms data in counts(large sample size necessary for test)
Chi-squared test conclusions • Test statistic = 14.396 and P-value = 0.013 • Since P-value is > our of 0.01, we do not reject H0. • We do not have enough evidence to say there is association between color distribution and flavor for M&Ms.
Skittles vs. M&Ms • Now we will compare the proportion of yellow candies for Skittles and for M&Ms. • The previous two-way table with plain and peanut M&Ms was of size 2 x 6. • This table will be of size 2x2 because we only care about whether a candy is yellow or non-yellow.
Full-class M&Ms and Skittles data in counts(large sample size necessary for test)
Chi-squared hypothesis test H0: There is no association between color distribution and flavor for these candies. Ha: There is association between color distribution and flavor for these candies. Use an = 0.01 for this story.
Chi-squared test conclusions • Test statistic = 11.839 and P-value = 0.001 • Since P-value is < our of 0.01, we reject H0. • We have evidence that there is association between color distribution and flavor for these candies.
Another way to do this test Since this is a 2x2 table, and if we are only interested in a 2-sided () hypothesis test, we can use the 2-sample proportions test here.
2-sample proportion test hypotheses H0: pM&Ms = pSkittles Ha: pM&Ms pSkittles
Results from the proportion test • Sample proportions: • Test statistic Z = -3.44 • P-value = 2(0.0003) = 0.0006 • Since P-value < our of 0.01, we reject H0.
Conclusion to the proportion test • We have evidence the proportion of yellow M&Ms is not the same as the proportion of yellow Skittles. • In other words, the type of candy makes a difference to the color distribution.
How do our results from the 2 tests compare? • The X2 test statistic = 11.839, which is actually the (Z test statistic = -3.44)2. • If you take into account the rounding, the P-values for both tests are 0.001. • We rejected H0 in both tests.
When do you use which test? • Chi-squared tests are best for: two-sided hypothesis tests only 2x2 or bigger tables • Proportion (Z) tests are best for: one- or two-sided hypothesis tests only 2x2 tables