690 likes | 932 Views
2. 2008 Brooks/Cole, a division of Thomson Learning, Inc.. BASICS. In statistics, a hypothesis is a statement about a population characteristic.. 3. 2008 Brooks/Cole, a division of Thomson Learning, Inc.. FORMAL STRUCTURE. Hypothesis Tests are based on an reductio ad absurdum form of argument.S
E N D
1. 1 © 2008 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 10 Hypothesis Tests
Using a Single Sample
2. 2 © 2008 Brooks/Cole, a division of Thomson Learning, Inc. BASICS In statistics, a hypothesis is a statement about a population characteristic.
3. 3 © 2008 Brooks/Cole, a division of Thomson Learning, Inc. FORMAL STRUCTURE Hypothesis Tests are based on an reductio ad absurdum form of argument.
Specifically, we make an assumption and then attempt to show that assumption leads to an absurdity or contradiction, hence the assumption is wrong.
4. 4 © 2008 Brooks/Cole, a division of Thomson Learning, Inc. FORMAL STRUCTURE The null hypothesis, denoted H0 is a statement or claim about a population characteristic that is initially assumed to be true.
The null hypothesis is so named because it is the “starting point” for the investigation. The phrase “there is no difference” is often used in its interpretation.
5. 5 © 2008 Brooks/Cole, a division of Thomson Learning, Inc. FORMAL STRUCTURE The alternate hypothesis, denoted by Ha is the competing claim.
The alternate hypothesis is a statement about the same population characteristic that is used in the null hypothesis.
Generally, the alternate hypothesis is a statement that specifies that the population has a value different, in some way, from the value given in the null hypothesis.
6. 6 © 2008 Brooks/Cole, a division of Thomson Learning, Inc. FORMAL STRUCTURE Rejection of the null hypothesis will imply the acceptance of this alternative hypothesis.
Assume H0 is true and attempt to show this leads to an absurdity, hence H0 is false and Ha is true.
7. 7 © 2008 Brooks/Cole, a division of Thomson Learning, Inc. FORMAL STRUCTURE Typically one assumes the null hypothesis to be true and then one of the following conclusions are drawn.
Reject H0
Equivalent to saying that Ha is correct or true
Fail to reject H0
Equivalent to saying that we have failed to show a statistically significant deviation from the claim of the null hypothesis
This is not the same as saying that the null hypothesis is true.
8. 8 © 2008 Brooks/Cole, a division of Thomson Learning, Inc. AN ANALOGY The Statistical Hypothesis Testing process can be compared very closely with a judicial trial.
Assume a defendant is innocent (H0)
Present evidence to show guilt
Try to prove guilt beyond a reasonable doubt (Ha)
9. 9 © 2008 Brooks/Cole, a division of Thomson Learning, Inc. AN ANALOGY Two Hypotheses are then created.
H0: Innocent
Ha: Not Innocent (Guilt)
10. 10 © 2008 Brooks/Cole, a division of Thomson Learning, Inc. Examples of Hypotheses You would like to determine if the diameters of the ball bearings you produce have a mean of 6.5 cm. H0: µ?=?6.5
Ha: µ???6.5
(Two-sided alternative)
11. 11 © 2008 Brooks/Cole, a division of Thomson Learning, Inc. The students entering into the math program used to have a mean SAT quantitative score of 525. Are the current students poorer (as measured by the SAT quantitative score)? H0: µ = 525
(Really: µ ? 525)
Ha: µ < 525
(One-sided alternative) Examples of Hypotheses
12. 12 © 2008 Brooks/Cole, a division of Thomson Learning, Inc. Do the “16 ounce” cans of peaches canned and sold by DelMonte meet the claim on the label (on the average)? H0: µ = 16
(Really: ? =16)
Ha: µ?< 16 Examples of Hypotheses
13. 13 © 2008 Brooks/Cole, a division of Thomson Learning, Inc. Is the proportion of defective parts produced by a manufacturing process more than 5%? H0: ? = 0.05
(Really, ? ? 0.05)
Ha: ? > 0.05
Examples of Hypotheses
14. 14 © 2008 Brooks/Cole, a division of Thomson Learning, Inc. Do two brands of light bulb have the same mean lifetime? H0: µBrand A = µBrand B
Ha: µBrand A ?? µBrand B
Examples of Hypotheses
15. 15 © 2008 Brooks/Cole, a division of Thomson Learning, Inc. Do parts produced by two different milling machines have the same variability in diameters? Examples of Hypotheses
16. 16 © 2008 Brooks/Cole, a division of Thomson Learning, Inc. Comments on Hypothesis Form The null hypothesis must contain the equal sign.
This is absolutely necessary because the test requires the null hypothesis to be assumed to be true and the value attached to the equal sign is then the value assumed to be true and used in subsequent calculations.
The alternate hypothesis should be what you are really attempting to show to be true.
This is not always possible.
17. 17 © 2008 Brooks/Cole, a division of Thomson Learning, Inc. Hypothesis Form The form of the null hypothesis is
H0: population characteristic = hypothesized value
where the hypothesized value is a specific number determined by the problem context.
The alternative (or alternate) hypothesis will have one of the following three forms:
Ha: population characteristic > hypothesized value
Ha: population characteristic < hypothesized value
Ha: population characteristic ? hypothesized value
18. 18 © 2008 Brooks/Cole, a division of Thomson Learning, Inc. Caution When you set up a hypothesis test, the result is either
Strong support for the alternate hypothesis (if the null hypothesis is rejected)
There is not sufficient evidence to refute the claim of the null hypothesis (you are stuck with it, because there is a lack of strong evidence against the null hypothesis.
19. 19 © 2008 Brooks/Cole, a division of Thomson Learning, Inc. Error
20. 20 © 2008 Brooks/Cole, a division of Thomson Learning, Inc. Error Analogy Consider a medical test where the hypotheses are equivalent to
H0: the patient has a specific disease
Ha: the patient doesn’t have the disease
Then,
Type I error is equivalent to a false negative
(i.e., Saying the patient does not have the disease when in fact, he does.)
Type II error is equivalent to a false positive
(i.e., Saying the patient has the disease when, in fact, he does not.)
21. 21 © 2008 Brooks/Cole, a division of Thomson Learning, Inc. More on Error The probability of a type I error is denoted by ??and is called the level of significance of the test.
Thus, a test with ? = 0.01 is said to have a level of significance of 0.01 or to be a level 0.01 test.
The probability of a type II error is denoted by ?.
22. 22 © 2008 Brooks/Cole, a division of Thomson Learning, Inc. Relationships Between ??and ? Generally, with everything else held constant, decreasing one type of error causes the other to increase.
The only way to decrease both types of error simultaneously is to increase the sample size.
No matter what decision is reached, there is always the risk of one of these errors.
23. 23 © 2008 Brooks/Cole, a division of Thomson Learning, Inc. Comment of Process Look at the consequences of type I and type II errors and then identify the largest ? that is tolerable for the problem.
Employ a test procedure that uses this maximum acceptable value of ? (rather than anything smaller) as the level of significance (because using a smaller ? increases ?).
24. 24 © 2008 Brooks/Cole, a division of Thomson Learning, Inc. Test Statistic A test statistic is the function of sample data on which a conclusion to reject or fail to reject H0 is based.
25. 25 © 2008 Brooks/Cole, a division of Thomson Learning, Inc. P-value The P-value (also called the observed significance level) is a measure of inconsistency between the hypothesized value for a population characteristic and the observed sample.
The P-value is the probability, assuming that H0 is true, of obtaining a test statistic value at least as inconsistent with H0 as what actually resulted.
26. 26 © 2008 Brooks/Cole, a division of Thomson Learning, Inc. Decision Criteria A decision as to whether H0 should be rejected results from comparing the P-value to the chosen ?:
H0 should be rejected if P-value ? ?.
H0 should not be rejected if P-value > ?.
27. 27 © 2008 Brooks/Cole, a division of Thomson Learning, Inc. Large Sample Hypothesis Test for a Single Proportion
28. 28 © 2008 Brooks/Cole, a division of Thomson Learning, Inc. Hypothesis Test Large Sample Test of Population Proportion
29. 29 © 2008 Brooks/Cole, a division of Thomson Learning, Inc. Hypothesis Test Large Sample Test of Population Proportion
30. 30 © 2008 Brooks/Cole, a division of Thomson Learning, Inc. Hypothesis Test Large Sample Test of Population Proportion
31. 31 © 2008 Brooks/Cole, a division of Thomson Learning, Inc. An insurance company states that the proportion of its claims that are settled within 30 days is 0.9. A consumer group thinks that the company drags its feet and takes longer to settle claims. To check these hypotheses, a simple random sample of 200 of the company’s claims was obtained and it was found that 160 of the claims were settled within 30 days. Hypothesis Test Example Large-Sample Test for a Population Proportion
32. 32 © 2008 Brooks/Cole, a division of Thomson Learning, Inc. Hypothesis Test Example 2Single Proportion continued
33. 33 © 2008 Brooks/Cole, a division of Thomson Learning, Inc. Hypothesis Test Example 2Single Proportion continued
34. 34 © 2008 Brooks/Cole, a division of Thomson Learning, Inc. Hypothesis Test Example 2Single Proportion continued
35. 35 © 2008 Brooks/Cole, a division of Thomson Learning, Inc. A county judge has agreed that he will give up his county judgeship and run for a state judgeship unless there is evidence at the 0.10 level that more then 25% of his party is in opposition. A SRS of 800 party members included 217 who opposed him. Please advise this judge. Hypothesis Test Example Single Proportion
36. 36 © 2008 Brooks/Cole, a division of Thomson Learning, Inc. Hypothesis Test ExampleSingle Proportion continued
37. 37 © 2008 Brooks/Cole, a division of Thomson Learning, Inc. Hypothesis Test Example Single Proportion continued At a level of significance of 0.10, there is sufficient evidence to support the claim that the true percentage of the party members that oppose him is more than 25%.
Under these circumstances, I would advise him not to run.
38. 38 © 2008 Brooks/Cole, a division of Thomson Learning, Inc. Describe (determine) the population characteristic about which hypotheses are to be tested.
State the null hypothesis H0.
State the alternate hypothesis Ha.
Select the significance level ? for the test.
Display the test statistic to be used, with substitution of the hypothesized value identified in step 2 but without any computation at this point. Steps in a Hypothesis-Testing Analysis
39. 39 © 2008 Brooks/Cole, a division of Thomson Learning, Inc. Steps in a Hypothesis-Testing Analysis Check to make sure that any assumptions required for the test are reasonable.
Compute all quantities appearing in the test statistic and then the value of the test statistic itself.
Determine the P-value associated with the observed value of the test statistic
State the conclusion in the context of the problem, including the level of significance.
40. 40 © 2008 Brooks/Cole, a division of Thomson Learning, Inc. Hypothesis Test (Large samples)Single Sample Test of Population Mean
41. 41 © 2008 Brooks/Cole, a division of Thomson Learning, Inc. Hypothesis Test Single Sample Test of Population Mean
42. 42 © 2008 Brooks/Cole, a division of Thomson Learning, Inc. Hypothesis Test Single Sample Test of Population Mean
43. 43 © 2008 Brooks/Cole, a division of Thomson Learning, Inc. Hypothesis Test Single Sample Test of Population Mean
44. 44 © 2008 Brooks/Cole, a division of Thomson Learning, Inc. Reality Check
45. 45 © 2008 Brooks/Cole, a division of Thomson Learning, Inc. Hypothesis Test (? unknown) Single Sample Test of Population Mean
46. 46 © 2008 Brooks/Cole, a division of Thomson Learning, Inc. Hypothesis Test Single Sample Test of Population Mean
47. 47 © 2008 Brooks/Cole, a division of Thomson Learning, Inc. Hypothesis Test Single Sample Test of Population Mean
48. 48 © 2008 Brooks/Cole, a division of Thomson Learning, Inc. Hypothesis Test Single Sample Test of Population Mean
49. 49 © 2008 Brooks/Cole, a division of Thomson Learning, Inc. Hypothesis Test (? unknown) Single Sample Test of Population Mean
50. 50 © 2008 Brooks/Cole, a division of Thomson Learning, Inc. Tail areas for t curves
51. 51 © 2008 Brooks/Cole, a division of Thomson Learning, Inc. Tail areas for t curves
52. 52 © 2008 Brooks/Cole, a division of Thomson Learning, Inc. An manufacturer of a special bolt requires that this type of bolt have a mean shearing strength in excess of 110 lb. To determine if the manufacturer’s bolts meet the required standards a sample of 25 bolts was obtained and tested. The sample mean was 112.7 lb and the sample standard deviation was 9.62 lb. Use this information to perform an appropriate hypothesis test with a significance level of 0.05. Example of Hypothesis TestSingle Sample Test of Population Mean - continued
53. 53 © 2008 Brooks/Cole, a division of Thomson Learning, Inc. Example of Hypothesis TestSingle Sample Test of Population Mean - continued
54. 54 © 2008 Brooks/Cole, a division of Thomson Learning, Inc. Example of Hypothesis TestSingle Sample Test of Population Mean - continued
55. 55 © 2008 Brooks/Cole, a division of Thomson Learning, Inc. Example of Hypothesis TestSingle Sample Test of Population Mean - conclusion
56. 56 © 2008 Brooks/Cole, a division of Thomson Learning, Inc. Using the t table
57. 57 © 2008 Brooks/Cole, a division of Thomson Learning, Inc. Revisit the problem with a=0.10 What would happen if the significance level of the test was 0.10 instead of 0.05?
58. 58 © 2008 Brooks/Cole, a division of Thomson Learning, Inc. Comments continued Many people are bothered by the fact that different choices of ? lead to different conclusions.
This is nature of a process where you control the probability of being wrong when you select the level of significance. This reflects your willingness to accept a certain level of type I error.
59. 59 © 2008 Brooks/Cole, a division of Thomson Learning, Inc. Another Example A jeweler is planning on manufacturing gold charms. His design calls for a particular piece to contain 0.08 ounces of gold. The jeweler would like to know if the pieces that he makes contain (on the average) 0.08 ounces of gold. To test to see if the pieces contain 0.08 ounces of gold, he made a sample of 16 of these particular pieces and obtained the following data.
0.0773 0.0779 0.0756 0.0792 0.0777
0.0713 0.0818 0.0802 0.0802 0.0785 0.0764 0.0806 0.0786 0.0776 0.0793 0.0755
Use a level of significance of 0.01 to perform an appropriate hypothesis test.
60. 60 © 2008 Brooks/Cole, a division of Thomson Learning, Inc. Another Example
61. 61 © 2008 Brooks/Cole, a division of Thomson Learning, Inc. Another Example
62. 62 © 2008 Brooks/Cole, a division of Thomson Learning, Inc. Another Example
63. 63 © 2008 Brooks/Cole, a division of Thomson Learning, Inc. Another Example
64. 64 © 2008 Brooks/Cole, a division of Thomson Learning, Inc. Another Example Conclusion:
Since P-value = 0.006 ? 0.01 = ?, we reject H0 at the 0.01 level of significance.
At the 0.01 level of significance there is convincing evidence that the true mean gold content of this type of charm is not 0.08 ounces.
Actually when rejecting a null hypothesis for the ? alternative, a one tailed claim is supported. In this case, at the 0.01 level of significance, there is convincing evidence that the true mean gold content of this type of charm is less than 0.08 ounces.
65. 65 © 2008 Brooks/Cole, a division of Thomson Learning, Inc. Power and Probability of Type II Error The power of a test is the probability of rejecting the null hypothesis.
When H0 is false, the power is the probability that the null hypothesis is rejected. Specifically, power = 1 – ?.
66. 66 © 2008 Brooks/Cole, a division of Thomson Learning, Inc. Effects of Various Factors on Power The larger the size of the discrepancy between the hypothesized value and the true value of the population characteristic, the higher the power.
The larger the significance level, ?, the higher the power of the test.
The larger the sample size, the higher the power of the test.
67. 67 © 2008 Brooks/Cole, a division of Thomson Learning, Inc. Some Comments Calculating ? (hence power) depends on knowing the true value of the population characteristic being tested. Since the true value is not known, generally, one calculates ? for a number of possible “true” values of the characteristic under study and then sketches a power curve.
68. 68 © 2008 Brooks/Cole, a division of Thomson Learning, Inc. Example (based on z-curve) Consider the earlier example where we tested H0: µ = 110 vs. Ha: µ > 110 and furthermore, suppose the true standard deviation of the bolts was actually 10 lbs.
69. 69 © 2008 Brooks/Cole, a division of Thomson Learning, Inc. Example (based on z-curve)
70. 70 © 2008 Brooks/Cole, a division of Thomson Learning, Inc. Example (based on z-curve)