1 / 40

Week 9

Week 9. Chapter 9 - Hypothesis Testing II: The Two-Sample Case . Chapter 9. Hypothesis Testing II: The Two-Sample Case. In This Presentation. The basic logic of the two-sample case Test of significance for sample means (large and small samples)

colm
Download Presentation

Week 9

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Week 9 Chapter 9 - Hypothesis Testing II: The Two-Sample Case

  2. Chapter 9 Hypothesis Testing II: The Two-Sample Case

  3. In This Presentation The basic logic of the two-sample case Test of significance for sample means (large and small samples) Tests of significance for sample proportions (large samples) The difference between “statistical significance” and “importance”

  4. In The Last Chapter Group 1 = All Education Majors RS of 100 Education Majors Population = Penn State University

  5. In This Chapter Group 1 = All Males in Population RS of 100 Males Population = Pennsylvania RS of 100 Females Group 2 = All Females in Population

  6. Basic Logic • We begin with a difference between sample statistics (means or proportions) • The question we test: • “Is the difference between the samples large enough to allow us to conclude (with a known probability of error) that the populationsrepresented by the samples are different?” (p. 212)

  7. Basic Logic • The Ho is that the populations are the same • There is no difference between the parameters of the two populations • If the difference between the sample statistics is large enough, or, if a difference of this size is unlikely, assuming that the Ho is true, we will rejectthe Ho and conclude there is a difference between the populations

  8. Basic Logic The Ho is a statement of “no difference” The 0.05 level will continue to be our indicator of a significant difference

  9. The Five Step Model Make assumptions and meet test requirements State the null hypothesis, Ho Select the sampling distribution and establish the critical region Calculate the test statistic Make a decision and interpret the results

  10. The Five Step Model Changes from the one-sample case: Step 1: In addition to samples selected according to EPSEM principles, samples must be selected independently of each other: independent random sampling Step 2: Null hypothesis statement will state that the two populations are not different Step 3: Sampling distribution refers to difference between the sample statistics

  11. Scenario #1: Dependent variable is measured at the interval/ratio level with large sample sizes and unknown population standard deviations

  12. Hypothesis Test for Means Two-Sample Test of Means (Large Samples) • Do men and women significantly differ on their support of gun control? • For men (sample 1): • Mean = 6.2 • Standard deviation = 1.3 • Sample size = 324 • For women (sample 2): • Mean = 6.5 • Standard deviation = 1.4 • Sample size = 317

  13. Step 1: Make Assumptions and Meet Test Requirements • Independent random sampling • The samples must be independent of each other • Level of measurement is interval-ratio • Support of gun control is assessed with an interval-ratio level scale, so the mean is an appropriate statistic • Sampling distribution is normal in shape • Total N ≥ 100 (N1 + N2 = 324 + 317 = 641) so the Central Limit Theorem applies and we can assume a normal shape

  14. Step 2: State the Null Hypothesis • Ho: μ1 = μ2 • The null hypothesis asserts there is no difference between the populations • H1: μ1 μ2 • The research hypothesis contradicts the Ho and asserts there is a difference between the populations

  15. Step 3: Select the Sampling Distribution and Establish the Critical Region Sampling Distribution = Z distribution Alpha (α) = 0.05 (two-tailed) Z (critical) = ±1.96

  16. Step 4: Compute the Test Statistic Calculate the pooled estimate of the standard error Calculate the obtained Z score

  17. Step 5: Make a Decision The obtained test statistic (Z = -2.80) falls in the critical region so reject the null hypothesis The difference between the sample means is so large that we can conclude (at alpha = 0.05) that a difference exists between the populations represented by the samples The difference between men’s and women’s support of gun control is significant

  18. Factors in Making a Decision The size of the difference (e.g., means of 6.2 and 6.5) The value of alpha – the higher the alpha, the more likely we are to reject the Ho The use of one- vs. two-tailed tests (we are more likely to reject with a one-tailed test) The size of the sample (N) – the larger the sample the more likely we are to reject the Ho

  19. Scenario #2: Dependent variable is measured at the interval/ratio level with small sample sizes and unknown population standard deviations

  20. Scenario #2 For small samples, s is too biased an estimator of σso do not use standard normal distribution Use Student’s t distribution

  21. Hypothesis Test for Means Two-Sample Test of Means (Small Samples) • Do families that reside in the center-city have more children than families that reside in the suburbs? • For suburbs (sample 1): • Mean = 2.37 • Standard deviation = 0.63 • Sample size = 42 • For center-city (sample 2): • Mean = 2.78 • Standard deviation = 0.95 • Sample size = 37

  22. Step 1: Make Assumptions and Meet Test Requirements • Independent random sampling • The samples must be independent of each other • Level of measurement is interval-ratio • Number of children can be treated as interval-ratio • Population variances are equal • As long as the two samples are approximately the same size, we can make this assumption • Sampling distribution is normal in shape • Because we have two small samples (N < 100), we have to add the previous assumption in order to meet this assumption

  23. Step 2: State the Null Hypothesis • Ho: μ1 = μ2 • The null hypothesis asserts there is no difference between the populations • H1: μ1 <μ2 • The research hypothesis contradicts the Ho and asserts there is a difference between the populations

  24. Step 3: Select the Sampling Distribution and Establish the Critical Region Sampling Distribution = t distribution Alpha (α) = 0.05 (one-tailed) Degrees of freedom = N1 + N2 -2 = 77 t (critical) = -1.671

  25. Step 4: Compute the Test Statistic Calculate the pooled estimate of the standard error Calculate the obtained Z score

  26. Step 4: Compute the Test Statistic

  27. Step 5: Make a Decision The obtained test statistic (t = -2.16) falls in the critical region so reject the null hypothesis The difference between the sample means is so large that we can conclude (at alpha = 0.05) that a difference exists between the populations represented by the samples The difference between the number of children in center-city families and the suburban families is significant

  28. Scenario #3: Dependent variable is measured at the nominal level with large sample sizes and unknown population standard deviation

  29. Hypothesis Test for Proportions Two-Sample Test of Proportions (Large Samples) • Do Black and White senior citizens differ in their number of memberships in clubs and organizations? (Using the proportion of each group classified as having a “high” level of membership) • For Black senior citizens (sample 1): • Proportion = 0.34 • Sample size = 83 • For White senior citizens (sample 2): • Proportion = 0.25 • Sample size = 103

  30. Step 1: Make Assumptions and Meet Test Requirements • Independent random sampling • The samples must be independent of each other • Level of measurement is nominal • We have measured the proportion of each group classified as having a “high” level of membership • Sampling distribution is normal in shape • Total N ≥ 100 (N1 + N2 = 83 + 103 = 186) so the Central Limit Theorem applies and we can assume a normal shape

  31. Step 2: State the Null Hypothesis • Ho: Pu1 = Pu2 • The null hypothesis asserts there is no difference between the populations • H1: Pu1 ≠ Pu2 • The research hypothesis contradicts the Ho and asserts there is a difference between the populations

  32. Step 3: Select the Sampling Distribution and Establish the Critical Region Sampling Distribution = Z distribution Alpha (α) = 0.05 (two-tailed) Z(critical) = ±1.96

  33. Step 4: Compute the Test Statistic Calculate the pooled estimate of the standard error

  34. Step 4: Compute the Test Statistic Calculate the obtained Z score

  35. Step 5: Make a Decision The obtained test statistic (Z = +1.29) does not fall in the critical region so fail to reject the null hypothesis The difference between the sample means is small enough that we can conclude (at alpha = 0.05) that no difference exists between the populations represented by the samples The difference between the memberships of Black and White senior citizens is not significant

  36. Limitations of Hypothesis Testing Significance Versus Importance • As long as we work with random samples, we must conduct a test of significance • Significance is not the same thing as importance • Differences that are otherwise trivial or uninteresting may be significant

  37. Conclusion • Scenario #1: Dependent variable is measured atthe interval/ratio level with large sample sizes and unknown population standard deviations • Use standard Z distribution formula • Scenario #2: Dependent variable is measured atthe interval/ratio level with small sample sizes and unknown population standard deviations • Use standard T distribution formula • Scenario #3: Dependent variable is measured atthe nominal level with large sample sizes and known population standard deviations • Use slightly modified Z distribution formula • Scenario #4: Dependent variable is measured atthe nominal level with small sample sizes • This is not covered in your text or in class

  38. USING SPSS • On the top menu, click on “Analyze” • Select “Compare Means” • Select “Independent Samples T Test OR Paired Samples T Test”

  39. Test on two means (independent samples) • Analyze / Compare means / Independent samples t-test Specify which groups you are comparing

  40. Output The null hypothesis is rejected(as the p-value is smaller than 0.05)

More Related