1 / 57

S TATISTICS

S TATISTICS. Part IIIA. Hypothesis Testing. Hypothesis Testing . Similar techniques to last block of study on estimation procedures. There, we made interval estimates of parameters without preconceptions.

farren
Download Presentation

S TATISTICS

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. STATISTICS Part IIIA. Hypothesis Testing

  2. Hypothesis Testing • Similar techniques to last block of study on estimation procedures. There, we made interval estimates of parameters without preconceptions. • In Hypothesis testing someone is making a claim or challenging a proposition about a parameter’s value. • Examples: • What is the mean weight of a medium-sized box of Cheerios? Build a C.I.! • General Mills claims a medium-sized box of Cheerios has a mean weight of 16 ounces. You wish to challenge this claim. Do hypothesis testing!

  3. Two Opposing Hypotheses • The null hypothesis, symbolized byHO, is a description of the status quo, or what people have long believed to be true. If the null hypothesis is not rejected, no action needs to be taken. • The alternative hypothesis, symbolized by HA, is a vehicle for challenging or contradicting the conventional wisdom. Accepting the alternative hypothesis calls for action.

  4. Examples • HO :Autonetics meets our quality specifications HA: Autonetics doesn’t meet our quality specifications • HO :Cancer rates for people who live close to high voltage power lines are not greater than for people who don’t. HA: Cancer rates for people who live close to high voltage power lines are greater than for people who don’t.

  5. The key to setting up hypotheses: • That which is contrary to what is commonly believed or accepted goes into the Alternative. • That which calls for action goes into the Alternative. • That which you are trying to demonstrate or that which you, the researcher, believe goes into the Alternative.* • “Everything else” goes into the Null. • The “equal” sign always goes with the Null.

  6. One-sided versus two-sided • HO : μ= 63.25 HA: μ 63.25 (A two-sided or two-tailed test) • HO : μ≥30.0 HA: μ<30.0 (A one-sided or one-tailed test, also called a left-tailed test) • HO : μ≤4.0 HA: μ>4.0 (A one-sided or one-tailed test, also called a right-tailed test)

  7. Examples • One of your suppliers claims that he fills and ships orders, on average, within six days of receipt. You believe his average time is longer than six days. HO : μ ≤6.0 HA: μ>6.0 • A manufacturer claims that a bottling machine fills each bottle precisely with 2000 cubic centimeters of cola. You want to test his claim. HO : μ=2000 HA: μ2000 • Research shows that Web surfers will lose interest in a Web page if downloading takes more than 6 seconds at 56K. You wish to test the effectiveness of a newly designed Web page in regard to its download time. HO : μ≤ 6 HA: μ> 6

  8. The Nature of Hypothesis Testing • We formulate two opposing hypotheses • We develop sample information • We assess whether sample information is sufficiently strong to allow us to reject the null. • This process is very similar to the American system of jurisprudence . . .

  9. Citizens are presumed innocent • HO :The defendant is innocent HA: The defendant is guilty If, based upon evidence during the trial, the jury cannot find the defendant guilty beyond a reasonable doubt, what do they declare? “We find the defendant not guilty.” Does the jury accept the null hypothesis?

  10. In hypothesis testing, we never accept the null hypothesis! We may fail to reject it but we never accept it. • We simply state, “Based upon sample information I cannot reject the null hypothesis beyond a reasonable doubt.” Note: When we reject the Null,we do accept the alternative.

  11. ‘The Reasonable Doubt’ • Let HO : μ= 400.0 • Let’s assume that this null really is true. • Let’s take a sample of size 36 from this population • What’s the expected value of X? • What if X turns out to be 396.9? 401.4? 407.0? 382.1? • The crux of hypothesis testing: How far must a sample statistic be from the null hypothesis before we go beyond ‘reasonable doubt’ and reject the null? • We define beyond ‘reasonable doubt’ by setting up “tails of rejection.”

  12. “Alpha” (  ) defines tails of rejection Let  = .05 .025 .025 μ= 400 382.1 407 X If X = 407 we would not reject the null If X = 382.1 we would reject the null

  13. “Alpha” (  ) defines tails of rejection Let  = .05 .025 .025 μ= 400 382.1 407 X What is the probability of rejecting a true null?

  14. “Alpha” (  ) defines tails of rejection Let  = .05 .025 .025 μ= 400 382.1 407 X Larger values of αmake it easier to reject the null

  15. TYPE I and TYPE II Errors

  16. Review of the four possible outcomes of hypothesis testing: • The Null Hypothesis is true but we reject it. What is the type of error and what is the probability of this occurrence? • Type I error. Probability is  and is called the level of significance. • The Null Hypothesis is true and we fail to reject it. What is the type of error and what is the probability of this occurrence? • No error! The probability of failing to reject a true null is 1 -  and is called the level of confidence. • The Null Hypothesis is false but we fail to reject it. What is the type of error and what is the probability of this occurrence? • Type II error. Probability is. • The Null Hypothesis is false and we reject it. What is the type of error and what is the probability of this occurrence? • No error! The probability of rejecting a false null is 1 -  and is called “the power of the test.”

  17. The Trade-off Between Type I & Type II Error Fail to reject Ho

  18. Relationship between αandβ • The “teeter totter”  β α This relationship holds for a given sample size.

  19. Relationship between αandβ • Why not set α at zero? • We would never reject the null whether it be true or false! • In this case only, αandβadd up to 1.0 • Setting αinfluences β but knowing αdoes not reveal β (except when αis 0.0). • We can reduce the risks of both Type I and Type II errors at the same time by increasing sample size.

  20. Where should we set α? • If the risk of a Type I error is greater than the risk of a Type II error, set αsmall, .01 for example. • If the risk of a Type II error is greater, make αrelatively large, .10 for example. • If the risks of the two types of errors are equal, set αat .05 (most common). S’pose we are testing the average tensile strength of a batch of bolts to see if it meets or exceeds minimum specified. It is very expensive to adjust the machine if the manufactured strength of these bolts does not meet standard. What level of αwould we use if these bolts are used to secure lids on trash cans? If these bolts are used on the space shuttle?

  21. The null would be that the bolts meet standard • The alternative would be that they don’t meet standard. • A Type I error would occur if they meet standard and we conclude they don’t. Then we’d go through an expensive machine adjustment process that really wasn’t necessary. • If these bolts are used on trash cans, we’d want a small alpha—reduce the risk of a Type I error! • If these bolts are used on the Shuttle, we’d want a small Beta—which we would get with a larger alpha!

  22. Hypothesis Testing • Hypothesis testing involves five major steps: • Formulating two opposing hypotheses • Settingα(and thereby defining the regions of rejection) • Collecting sample information • Using sample data to compute a test statistic • Make a decision based upon whether the test statistic is in a tail of rejection (defined by α)

  23. Computing a Test Statistic • Using z or t values • Remember that we can convert any normally distributed random variable to z or t values.

  24. Two-tailed vs. One-tailed Hypothesis Test

  25. Example: Large sample • Ever-Glo advertises that their 100 Watt bulbs last an average of 400 hours. You wish to challenge that claim. You test the lifetimes of 100 bulbs and get a sample mean of 411 hours and a sample standard deviation of 42.5 hours. What conclusion would you reach using a level of significance of .05? • Use the five step process to work this problem.

  26. Five step process: 1. HO : μ = 400.0 HA: μ 400.0 (Note: A two-tailed test) • α= .05 (which defines the rejection region as ± 1.96) 3. n = 100 X = 411.0 s = 42.5 4. 5. 2.59 > 1.96. Therefore we are in the right tail. We reject the null!

  27. “Alpha” (  ) defines tails of rejection .025 .025 μo= 400 X -1.96 +1.96 2.59

  28. “Alpha” (  ) defines tails of rejection .025 .025 μo= 400 X -1.96 +1.96 2.59 How do we interpret the 2.59?

  29. “Alpha” (  ) defines tails of rejection .025 .025 μo= 400 X -1.96 +1.96 2.59 For a normal distribution with a mean of 400 and a standard deviation of 4.25, 411 is 2.59 standard deviations to the right.

  30. “Alpha” (  ) defines tails of rejection .025 .025 μo= 400 X -1.96 +1.96 2.59 For a normal distribution with a mean of 400 and a standard deviation of 4.25, 411 is 2.59 standard deviations to the right. Our decision rule: Reject the Null for any test statistic beyond ± 1.96 standard deviations.

  31. “Alpha” (  ) defines tails of rejection .025 .025 μo= 400 X -1.96 +1.96 2.59 We are at risk for committing what kind of error? What is that risk?

  32. Another Large Sample Problem: • Honda claims its Pilot SUV obtains an average of at least 22 mpg for highway driving. I’m skeptical. I had an independent research firm drive 50 identical models of the Pilot under identical and controlled highway conditions. The sample mean was 20.9 mpg. The sample standard deviation was 5.1 mpg. What conclusion can be drawn at the .05 level of significance?

  33. Is Honda likely telling the truth? HO : μ≥22.0 HA: μ<22.0 (Note: A left-tailed test) The tail of rejection (the Z value that puts .05 in the left tail) is defined by –1.645. The test statistic is

  34. A left-tailed test: .05 -1.645 μo= 22 X -1.525 We fail to reject the null. These results are notstatisticallysignificant. We may, however, be committing a ________error.

  35. A left-tailed test: .05 -1.645 μo= 22 X -1.525 We fail to reject the null. These results are notstatisticallysignificant. We may, however, be committing a TYPE II error.

  36. Try Problem Set 3

  37. Another Large Sample Problem: • A venture capitalist believes that funding the development of a new remote control lawn mower will be a profitable investment if the average price paid for a standard, self-propelled lawn mower is more than $450.00. A random sample of 46 people who recently purchased high-end mowers was obtained. The mean was $465.70. The standard deviation was $76.00. • Based upon a significance level of .05 should the venture capitalist invest in developing the new lawn mower? (Use MINITAB)

  38. Small sample problems • Remember for small samples, the parent population must be normally distributed. • Use the t statistic instead of Z

  39. Small sample problems • Remember for small samples, the parent population must be normally distributed. • Use the t statistic instead of Z Go into the t table at n-l degrees of freedom to get the tail of rejection. (Split the alpha for a two-tailed test.)

  40. Small sample problems • HO : μ≥30.0 HA: μ<30.0 Suppose n= 14 and the “t” computed from sample data is -1.84. What is the tail of rejection at .05 according to the t table and what is your decision? -1.771 defines the tail of rejection. We are in the tail. We would reject the null.

  41. Small sample problems • HO : μ=32 inches HA: μ 32 inches Suppose n = 26 and the “t” computed from sample data is 1.44. What is the tail of rejection at .05 according to the t table and what is your decision? +2.060 defines the tail of rejection. We are not in the tail. We would fail to reject the null.

  42. The p-Value concept • S’pose, in a two-tailed test of hypotheses, I obtained a test statistic (Z value) of 1.85. • Would I reject the Null at a .10 level of significance? The tail of rejection would be defined by 1.645. The Z value of 1.85 would be in a tail of rejection. • Would I reject the Null at a .05 level of significance? The tail of rejection would now be defined by 1.96. I would not be in a tail of rejection with a Z=1.85. • Would I reject the Null at a .0644 level of significance? The tail of rejection would now be defined by 1.85—the Z value!

  43. The p-Value concept • S’pose, in a two-tailed test of hypothesis, I obtained a test statistic (Z value) of 1.85. • Would I reject the Null at a .10 level of significance? The tail of rejection would be defined by 1.645. The Z value of 1.85 would be in a tail of rejection. • Would I reject the Null at a .05 level of significance? The tail of rejection would now be defined by 1.96. I would not be in a tail of rejection with a Z=1.85. • Would I reject the Null at a .0644 level of significance? The tail of rejection would now be defined by 1.85—the Z value!

  44. The p-value is the level of alpha at which the decision changes If Z = 1.85, we would reject the null at .10. We would not reject the null at .05. The decision changes at an alpha of .0644 .025 .025 X -1.96 1.645 +1.96 1.85 As alpha gets smaller and smaller, it becomes harder and harder to reject the null. What is the lowest level of alpha at which we can still reject the null in this example?

  45. The p-value is the level of alpha at which the decision changes If Z = 1.85, we would reject the null at .10. We would not reject the null at .05. The decision changes at an alpha of .0644 .025 .025 X -1.96 1.645 +1.96 1.85 As alpha gets smaller and smaller, it becomes harder and harder to reject the null. What is the lowest level of alpha at which we can still reject the null in this example? .0644 and that is the p-value!

  46. p-Value • The p-value is the value of  at which the hypothesis test procedure changes conclusions based on a given set of data. • P-value conceptualization I: It is the lowest value of  for which you will be able to reject Ho. • P-value conceptualization II: It is the probability of getting the sample result that you obtained or something more extreme, given that the null is true.

  47. Procedure for Finding the p-Value Obtain the area in the tail(s) at Z • For a two-tailed test (Ha:   o) p-value = 2 * (area outside Z) • For a right-tailed test (Ha:  > o) p-value = area to the right of Z • For a left tailed test (Ha:  < o) p-value = area to the left of Z

  48. Examples of computing p-value: 1. For HO : μ≥22.0 HA: μ<22.0 and Z = -1.5 what is the p-value? .0668 2. For HO : μ≤6.0 HA: μ>6.0 And Z = 4.9 what is the p-value? 0.00

  49. Examples of computing p-value: 3. For HO : μ=2000 HA: μ 2000 and Z = 1.85 what is the p-value? .0644 4. For HO : μ≤34,000 HA: μ>34,000 And Z = .84 what is the p-value? 0.20 What does the .20 mean in plain English?

  50. Work this problem: • Consumer Reports magazine claims that the good negotiator can get at least 7% off the sticker price of a new car. I send out skilled negotiators to 36 randomly selected new car dealers to test this claim. I find they average a negotiated discount of 6.32% with a sample standard deviation of 1.7%. • What are the hypotheses? • What is the lowest level of alpha at which I can reject the null?

More Related