1 / 52

Comparing Populations

Comparing Populations. Proportions and means. The sampling distribution of differences of Normal Random Variables. If X and Y denote two independent normal random variables, then : D = X – Y is normal with. Comparing proportions. Situation We have two populations (1 and 2)

peay
Download Presentation

Comparing Populations

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Comparing Populations Proportions and means

  2. The sampling distribution of differences of Normal Random Variables If X and Y denote twoindependent normal random variables, then : D = X –Y is normal with

  3. Comparing proportions Situation • We have two populations (1 and 2) • Let p1 denote the probability (proportion) of “success” in population 1. • Let p2 denote the probability (proportion) of “success” in population 2. • Objective is to compare the two population proportions

  4. Consider the statistic: This statistic has a normal distribution

  5. Consider the statistic: This statistic has a normal distribution with

  6. Thus Has a standard normal distribution

  7. We want to test either: or or

  8. If p1 = p2 (p say) then the test statistic:

  9. has a standard normal distribution. where is an estimate of the common value of p1 and p2.

  10. Thus for comparing two binomial probabilities p1 and p2 The test statistic where

  11. The Critical Region

  12. Example • In a national study to determine if there was an increase in mortality due to pipe smoking, a random sample of n1 = 1067 male nonsmoking pensioners were observed for a five-year period. • In addition a sample of n2 = 402 male pensioners who had smoked a pipe for more than six years were observed for the same five-year period. • At the end of the five-year period, x1 = 117 of the nonsmoking pensioners had died while x2 = 54 of the pipe-smoking pensioners had died. • Is there a the mortality rate for pipe smokers higher than that for non-smokers

  13. We want to test: The test statistic:

  14. Note:

  15. The test statistic:

  16. We reject H0 if: Not true hence we accept H0. Conclusion: There is not a significant (a= 0.05) increase in the mortality rate due to pipe-smoking

  17. Estimating a difference proportions using confidence intervals Situation • We have two populations (1 and 2) • Let p1 denote the probability (proportion) of “success” in population 1. • Let p2 denote the probability (proportion) of “success” in population 2. • Objective is to estimate the difference in the two population proportions d = p1 – p2.

  18. Confidence Interval for d = p1 – p2 100P% = 100(1 – a) % :

  19. Example • Estimating the increase in the mortality rate for pipe smokers higher over that for non-smokers d = p2 – p1

  20. Comparing Means Situation • We have two normal populations (1 and 2) • Let m1and s1 denote the mean and standard deviation of population 1. • Let m2and s2 denote the mean and standard deviation of population 1. • Let x1, x2, x3 , … , xn denote a sample from a normal population 1. • Let y1, y2, y3 , … , ym denote a sample from a normal population 2. • Objective is to compare the two population means

  21. or or We want to test either:

  22. Consider the test statistic:

  23. If: • will have a standard Normal distribution • This will also be true for the approximation (obtained by replacing s1 by sx and s2 by sy) if the sample sizes n and m are large (greater than 30)

  24. Note:

  25. Example • A study was interested in determining if an exercise program had some effect on reduction of Blood Pressure in subjects with abnormally high blood pressure. • For this purpose a sample of n = 500 patients with abnormally high blood pressure were required to adhere to the exercise regime. • A second sample m = 400 of patients with abnormally high blood pressure were not required to adhere to the exercise regime. • After a period of one year the reduction in blood pressure was measured for each patient in the study.

  26. vs We want to test: The exercise group did not have a higher average reduction in blood pressure The exercise group did have a higher average reduction in blood pressure

  27. The test statistic:

  28. Suppose the data has been collected and:

  29. The test statistic:

  30. We reject H0 if: True hence we reject H0. Conclusion: There is a significant (a= 0.05) effect due to the exercise regime on the reduction in Blood pressure

  31. Estimating a difference means using confidence intervals Situation • We have two populations (1 and 2) • Let m1 denote the mean of population 1. • Let m2 denote the mean of population 2. • Objective is to estimate the difference in the two population proportions d = m1 – m2.

  32. Confidence Interval for d = m1 – m2

  33. Example • Estimating the increase in the average reduction in Blood pressure due to the excercize regime d = m1 – m2

  34. Comparing Means – small samples Situation • We have two normal populations (1 and 2) • Let m1and s1 denote the mean and standard deviation of population 1. • Let m2and s2 denote the mean and standard deviation of population 1. • Let x1, x2, x3 , … , xn denote a sample from a normal population 1. • Let y1, y2, y3 , … , ym denote a sample from a normal population 2. • Objective is to compare the two population means

  35. We want to test either: or or

  36. Consider the test statistic:

  37. If the sample sizes (m and n) are large the statistic will have approximately a standard normal distribution This will not be the case if sample sizes (m and n) are small

  38. The t test – for comparing means – small samples (equal variances) Situation • We have two normal populations (1 and 2) • Let m1and s denote the mean and standard deviation of population 1. • Let m2and s denote the mean and standard deviation of population 1. • Note: we assume that the standard deviation for each population is the same. s1 = s2 = s

  39. Let

  40. The pooled estimate of s. Note: both sxand syare estimators of s. These can be combined to form a single estimator of s, sPooled.

  41. The test statistic: If m1 = m2 this statistic has a t distribution with n + m –2 degrees of freedom

  42. are critical points under the t distribution with degrees of freedom n + m –2.

  43. Example • A study was interested in determining if administration of a drug reduces cancerous tumor size. • For this purpose n +m = 9 test animals are implanted with a cancerous tumor. • n = 3 are selected at random and administered the drug. • The remaining m = 6 are left untreated. • Final tumour sizes are measured at the end of the test period

  44. We want to test: The treated group did not have a lower average final tumour size. vs The exercise group did have a lower average final tumour size.

  45. The test statistic:

  46. Suppose the data has been collected and:

  47. The test statistic:

  48. We reject H0 if: with d.f. = n + m – 2 = 7 Hence we accept H0. Conclusion: The drug treatment does not result in a significant (a= 0.05) smaller final tumour size,

  49. Summary of Tests

More Related