490 likes | 613 Views
MGMT 276: Statistical Inference in Management Spring , 2013. Welcome. . Please hand in your homework – they must be stapled. Careful: Be sure your data are your own If not sure, do not hand in your homework. Please hand in your homework. Statistical Inference in Management.
E N D
MGMT 276: Statistical Inference in ManagementSpring, 2013 Welcome
. Please hand in your homework – they must be stapled Careful: Be sure your data are your own If not sure, do not hand in your homework Please hand in your homework
Statistical Inference in Management Instructor:Suzanne Delaney, Ph.D. Office:405 “N” McClelland Hall Phone:621-2045 Email:delaney@u.arizona.edu Office hours:2:00 – 3:30Mondays and Fridays and by appointment
Homework due – Thursday (April 4th) On class website: Please print and complete homework #17 Hypothesis testing with ANOVAs Please click in My last name starts with a letter somewhere between A. A – D B. E – L C. M – R D. S – Z Please double check – All cell phones other electronic devices are turned off and stowed away
Please read: Chapters 10 – 12 in Lind book and Chapters 2 – 4 in Plous book: (Before the next exam – April 9th) Lind Chapter 10: One sample Tests of Hypothesis Chapter 11: Two sample Tests of Hypothesis Chapter 12: Analysis of Variance Plous Chapter 2: Cognitive Dissonance Chapter 3: Memory and Hindsight Bias Chapter 4: Context Dependence
Use this as your study guide By the end of lecture today4/2/13 • Logic of hypothesis testing • Steps for hypothesis testing • Hypothesis testing with analysis of variance (ANOVA) • Interpreting excel output of hypothesis tests • Constructing brief, complete summary statements
“Between Groups”Variability . Difference between means Difference between means Difference between means Variabilityof curve(s) “Within Groups”Variability Variabilityof curve(s) Variabilityof curve(s)
Five steps to hypothesis testing Step 1: Identify the research problem (hypothesis) Describe the null and alternative hypotheses Step 2: Decision rule • Alpha level? (α= .05 or .01)? Still, difference between means • Critical statistic (e.g. z or t or F or r) value? Step 3: Calculations MSBetween F = MSWithin Still, variabilityof curve(s) Step 4: Make decision whether or not to reject null hypothesis If observed t (or F) is bigger then critical t (or F) then reject null Step 5: Conclusion - tie findings back in to research problem
One-way ANOVA Number of cookies sold Bike None Hawaii trip Incentives • One-way ANOVAs test only one independent variable • - although there may be many levels • “Factor” = one independent variable • “Level” = levels of the independent variable • treatment • condition • groups • “Main Effect” of independent variable = difference between levels • Note: doesn’t tell you which specific levels (means) differ from each other A multi-factor experiment would be a multi-independent variables experiment
Comparing ANOVAs with t-tests Similarities still include: Using distributions to make decisions about common and rare events Using distributions to make inferences about whether to reject the null hypothesis or not The same 5 steps for testing an hypothesis Tells us generally about number of participants / observations Tells us generally about number of groups / levels of IV • The three primary differences between t-tests and ANOVAS are: • 1. ANOVAs can test more than two means • 2. We are comparing sample means indirectly by • comparing sample variances • 3. We now will have two types of degrees of freedom • t(16) = 3.0; p < 0.05 F(2, 15) = 3.0; p < 0.05 Tells us generally about number of participants / observations
One way analysis of varianceVariance is divided Remember, one-way = one IV Total variability Between group variability (only one factor) Within group variability (error variance) Remember, 1 factor = 1 independent variable(this will be our numerator – like difference between means) Remember, error variance = random error(this will be our denominator – like within group variability
Three different types of variance Between groups Within groups Total Between Groups Variability Total Variability Variability between groups F = Within Groups Variability Variability within groups
ANOVA Variability between groups F = Variability within groups Variability Between Groups “Between” variability bigger than “within” variability so should get a big (significant) F Variability Within Groups Variability Within Groups Variability Between Groups “Between” variability getting smaller “within” variability staying same so, should get a smaller F Variability Within Groups Variability Within Groups Variability Between Groups “Between” variability getting very small “within” variability staying same so, should get a very small F Variability Within Groups Variability Within Groups
ANOVA Variability between groups F = Variability within groups Variability Between Groups “Between” variability bigger than “within” variability so should get a big (significant) F Variability Within Groups Variability Within Groups Variability Between Groups “Between” variability getting smaller “within” variability staying same so, should get a smaller F Variability Within Groups “Between” variability getting very small “within” variability staying same so, should get a very small F (equal to 1)
. Effect size is considered relativeto variability of distributions Treatment Effect x Variability between groups Treatment Effect x Variabilitywithin groups
x = 10 x = 14 x = 12 What if we want to compare 3 means? One independent variable with 3 means A girlscout troop leader wondered whether providing an incentive to whomever sold the most girlscout cookies would have an effect on the number cookies sold. She provided a big incentive to one troop (trip to Hawaii), a lesser incentive to a second troop (bicycle), and no incentive to a third group, and then looked to see who sold more cookies. Troop 1 (Hawaii) 14 9 19 13 15 Troop 2 (bicycle) 12 14 10 11 13 Troop 3 (nada) 10 8 12 7 13 Note: 5 girls in each troop
A girl scout troop leader wondered whether providing an incentive to whomever sold the most girl scout cookies would have an effect on the number cookies sold. She provided a big incentive to one troop (trip to Hawaii), a lesser incentive to a second troop (bicycle), and no incentive to a third group, and then looked to see who sold more cookies. How many levels of the Independent Variable? What is Independent Variable? Troop 3 (Hawaii) 14 9 19 13 15 Troop 1 (nada) 10 8 12 7 13 Troop 2 (bicycle) 12 14 10 11 13 What is Dependent Variable? How many groups? n = 5 x = 10 n = 5 x = 12 n = 5 x = 14
Main effect of incentive: Will offering an incentive result in more girl scout cookies being sold? • If we have a “effect” of • incentive then the means • are significantly different • from each other • we reject the null • we have a significant F • p < 0.05 • To get an effect we want: • Large “F” - big effect and small variability • Small “p” - less than 0.05 (whatever our alpha is) We don’t know which means are different from which …. just that they are not all the same
Hypothesis testing: Step 1: Identify the research problem Is there a significant difference in the number of cookie boxes sold between the girlscout troops that were given the different levels of incentive? Describe the null and alternative hypotheses
Hypothesis testing: = .05 Decision rule Degrees of freedom (between) = number of groups - 1 = 3 - 1 = 2 Degrees of freedom (within) = # of scores - # of groups = (15-3) = 12* Critical F(2,12) = 3.98 *or = (5-1) + (5-1) + (5-1) = 12.
Appendix B.4 (pg.518) F (2,12) α= .05 Critical F(2,12) = 3.89
“SS” = “Sum of Squares”- will be given for exams- you can think of this as the numerator in a standard deviation formula ANOVA table F Source df MS SS Between ? ? ? ? Within ? ? ? Total ? ?
“SS” = “Sum of Squares”- will be given for exams ANOVA table F Source df MS SS 3-1=2 # groups - 1 Between 40 ? 2 ? ? ? 15-3=12 Within ? 88 ? 12 # scores - number of groups ? Total ? 128 ? 14 # scores - 1 15- 1=14
ANOVA table MSbetween MSwithin 40 88 SSbetween 12 2 ANOVA table dfbetween F Source df MS SS ? Between 40 2 ? 2.73 20 Within 88 12 ? 7.33 Total 128 14 SSwithin dfwithin 88 20 =2.73 =7.33 40 7.33 12 =20 2
Make decision whether or not to reject null hypothesis Observed F = 2.73 Critical F(2,12) = 3.89 2.73 is not farther out on the curve than 3.89 so, we do not reject the null hypothesis F(2,12) = 2.73; n.s. Conclusion: There appears to be no effect of type of incentive on number of girl scout cookies sold The average number of cookies sold for three different incentives were compared. The mean number of cookie boxes sold for the “Hawaii” incentive was 14 , the mean number of cookies boxes sold for the “Bicycle” incentive was 12, and the mean number of cookies sold for the “No” incentive was 10. An ANOVA was conducted and there appears to be no significant difference in the number of cookies sold as a result of the different levels of incentive F(2, 12) = 2.73; n.s.
Let’s do same problemUsing MS Excel A girlscout troop leader wondered whether providing an incentive to whomever sold the most girlscout cookies would have an effect on the number cookies sold. She provided a big incentive to one troop (trip to Hawaii), a lesser incentive to a second troop (bicycle), and no incentive to a third group, and then looked to see who sold more cookies. Troop 1 (Nada) 10 8 12 7 13 Troop 2 (bicycle) 12 14 10 11 13 Troop 3 (Hawaii) 14 9 19 13 15 n = 5 x = 10 n = 5 x = 12 n = 5 x = 14
MSbetween MSwithin 40 88 SSbetween 12 2 dfbetween 3-1=2 # groups - 1 SSwithin dfwithin # scores - number of groups 15-3=12 88 20 =2.73 =7.33 40 # scores - 1 7.33 12 =20 2 15- 1=14
No, so it is not significant Do not reject null No, so it is not significant Do not reject null F critical(is observed F greater than critical F?) P-value(is it less than .05?)
Make decision whether or not to reject null hypothesis Observed F = 2.73 Critical F(2,12) = 3.89 2.7 is not farther out on the curve than 3.89 so, we do not reject the null hypothesis Also p-value is not smaller than 0.05 so we do not reject the null hypothesis Step 6: Conclusion: There appears to be no effect of type of incentive on number of girl scout cookies sold
Make decision whether or not to reject null hypothesis Observed F = 2.72727272 F(2,12) = 2.73; n.s. Critical F(2,12) = 3.88529 2.7 is not farther out on the curve than 3.89 so, we do not reject the null hypothesis Conclusion: There appears to be no effect of type of incentive on number of girl scout cookies sold The average number of cookies sold for three different incentives were compared. The mean number of cookie boxes sold for the “Hawaii” incentive was 14 , the mean number of cookies boxes sold for the “Bicycle” incentive was 12, and the mean number of cookies sold for the “No” incentive was 10. An ANOVA was conducted and there appears to be no significant difference in the number of cookies sold as a result of the different levels of incentive F(2, 12) = 2.73; n.s.
Let’s try one In a one-way ANOVA we have three types of variability. Which picture best depicts the random error variability (also known as the within variability)? a. Figure 1 b. Figure 2 c. Figure 3 d. All of the above 1. 2. 3.
Variability between groups F = Let’s try one Variability within groups Which figure would depict the largest F ratio a. Figure 1 b. Figure 2 c. Figure 3 d. All of the above 1. 2. 3.
Let’s try one Winnie found an observed F ratio of .9, what should she conclude? a. Reject the null hypothesis b. Do not reject the null hypothesis c. Not enough info is given 1. 2. 3.
How many observations within each group? Let’s try one An ANOVA was conducted comparing different types of solar cells and there appears to be a significant difference in output of each (watts) F(4, 25) = 3.12; p < 0.05. In this study there were __ types of solar cells and __ total observations in the whole study? a. 4; 25 b. 5; 30 c. 4; 30 d. 5; 25 F(4, 25) = 3.12; p < 0.05 # groups - 1 # scores - # of groups # scores - 1
Let’s try one An ANOVA was conducted comparing different types of solar cells and there appears to be significant difference in output of each (watts) F(4, 25) = 3.12; p < 0.05. In this study ___ a. we rejected the null hypothesis b. we did not reject the null hypothesis F(4, 25) = 3.12; p < 0.05 Observed F bigger than Critical F p < .05
Let’s try one An ANOVA was conducted comparing different types of solar cells. The analysis was completed using an alpha of 0.05. But Julia now wants to know if she can reject the null with an alpha of at 0.01. In this study ___ a. we rejected the null hypothesis b. we did not reject the null hypothesis F(4, 25) = 3.12; p < 0.05 Comparison of the Observed F and Critical F Is no longer are helpful because the critical F is no longer correct. We must use the p value p < .05 p > .01
Let’s try one An ANOVA was conducted comparing home prices in four neighborhoods (Southpark, Northpark, Westpark, Eastpark) . For each neighborhood we measured the price of four homes. Please complete this ANOVA table. Degrees of freedom between is _____; degrees of freedom within is ____ a. 16; 4 b. 4; 16 c. 12; 3 d. 3; 12 .
Let’s try one An ANOVA was conducted comparing home prices in four neighborhoods (Southpark, Northpark, Westpark, Eastpark) . For each neighborhood we measured the price of four homes. Please complete this ANOVA table. Mean Square between is _____; Mean Square within is ____ a. 300, 300 b. 100, 100 c. 100, 25 d. 25, 100 .
Let’s try one An ANOVA was conducted comparing home prices in four neighborhoods (Southpark, Northpark, Westpark, Eastpark) . For each neighborhood we measured the price of four homes. Please complete this ANOVA table. The F ratio is: a. .25 b. 1 c. 4 d. 25 .
Let’s try one An ANOVA was conducted comparing home prices in four neighborhoods (Southpark, Northpark, Westpark, Eastpark) . For each neighborhood we measured the price of four homes. Please complete this ANOVA table. We should: a. reject the null hypothesis b. not reject the null hypothesis Observed F bigger than Critical F p < .05
Let’s try one An ANOVA was conducted comparing home prices in four neighborhoods (Southpark, Northpark, Westpark, Eastpark) . For each neighborhood we measured the price of four homes. The most expensive neighborhood was the ____ neighborhood a. Southpark b. Northpark c. Westpark d. Eastpark
An ANOVA was conducted comparing home prices in four neighborhoods (Southpark, Northpark, Westpark, Eastpark) . For each neighborhood we measured the price of four homes. Please complete this ANOVA table. The best summary statement is: a. F(3, 12) = 4.0; n.s. b. F(3, 12) = 4.0; p < 0.05 c. F(3, 12) = 3.49; n.s. d. F(3, 12) = 3.49; p < 0.05
Thank you! See you next time!!