400 likes | 626 Views
Introduction to Biostatistics (BIO/EPI 540) Lecture 11: Hypothesis Testing. Acknowledgement: Thanks to Professor Pagano (Harvard School of Public Health) for lecture material. Testing. No human investigation can be called true science without passing through mathematical tests.
E N D
Introduction to Biostatistics(BIO/EPI 540) Lecture 11: Hypothesis Testing Acknowledgement: Thanks to Professor Pagano (Harvard School of Public Health) for lecture material
Testing No human investigation can be called true science without passing through mathematical tests. Leonardo da Vinci (1452-1519) (in Treatise on Painting)
Sampling Paradigm Inference μ, σ Population ,S Sample
Inference • Sample mean is an estimate of • Sample variance (S) is an estimate • of • Confidence intervals and • hypothesis tests are equivalent • techniques to quantify uncertainty • in sample derived inferences • regarding population parameters μ σ2
Confidence Interval - Illustration We know that cholesterol levels in US men 20-24 yrs are normally distributed with σX 46 mg/100ml. We obtain a sample of n=25 and want to infer μ.
Use of C.I. to infer value value of μ
If true • Alternatively IF • = 211 and = 46 and we take a sample of size n=25 from this pop., then the Central Limit Theorem says that the sample mean is approx. normal with mean = 211 and std. dev. 46/5; i.e.
Hypothesis Testing Hypothesis TestingTrial by jury
Hypothesis Testing & Trial by jury Individual on trial. Is he/she innocent? Evidence Trial
Hypothesis Testing & Trial by jury Individual on trial. Is he/she innocent? Evidence Trial
Hypothesis Testing & Trial by jury Individual on trial. Is he/she innocent? Evidence Trial
Hypothesis Testing & Trial by jury Individual on trial. Is he/she innocent? Evidence Trial
Hypothesis Testing & Trial by jury Individual on trial. Is he/she innocent? Evidence Trial
Hypothesis Testing Test of Hypothesis that = 0? Evidence Trial Evidence Trial
Hypothesis Testing Test of Hypothesis that = 0? Sample Trial Trial
Hypothesis Testing Test of Hypothesis that = 0? Analysis Sample
Hypothesis Testing Test of Hypothesis that = 0? Analysis Sample
Hypothesis Testing Test of Hypothesis that = 0? Analysis Sample
Hypothesis Testing Test of Hypothesis that = 0? Analysis Sample
Hypothesis Testing Test of Hypothesis that = 0? Analysis Sample
Hypothesis Testing Test of Hypothesis that = 0? Analysis Sample
Hypothesis Testing Test of Hypothesis that = 0? Analysis Sample
Hypothesis Testing Test of Hypothesis that = 0? Analysis Sample
Possible errors in analysis results Probability of Type I error is i.e. the probability of rejecting the null hypothesis when it is true. Probability of Type II error is i.e the probability of not rejecting the null hypothesis when it is false. 1- is the power of the test.
2 sided hypothesis test -Illustration We know that cholesterol levels in US men 20-74 yrs are normally distributed with σX 46 mg/100ml and μ = 211. We obtain a random sample of 12 hypertensive smokers and obtain a sample mean of 217 mg/100ml. We want to test whether their population mean is the same as that of the general population?
2 sided hypothesis test -Illustration = 46 mg/100ml 12 hypertensive smokers have:
P-value Some prefer to quote the p-value. The p-value answers the question, “What is the probability of get- ting as large, or larger, a Discrepancy given the null hypothesis is true?” Question: Do hypertensive smokers have the same mean as the general population?
Rejecting the null hypothesis • Assume a specific threshold of Type I error, α • Typically α = 0.05 • If p value < α Reject null
P-value Some prefer to quote the p-value. The p-value answers the question, “What is the probability of get- ting as large, or larger, a Discrepancy given the null hypothesis is true?” Answer: Do not reject the null hypothesis. No evidence that hypertensive smokers have a different mean than general population
Summary Decide on statistic: Determine which values of are consonant with the hypothesis that = 0 and which ones are not. Look at and decide.
Alternative hypothesis Need to set up 2 hypotheses to cover all possibilities for . Choice of 3 possibilities:
Example - One-sided alternative Blood glucose level of healthy persons has = 9.7 mmol/L and = 2.0 mmol/L Sample of 64 diabetics yields Do diabetics have blood glucose levels that are higher on average when compared to the general population?
Example - One-sided alternative Blood glucose level of healthy persons has = 9.7 mmol/L and = 2.0 mmol/L n = 64 p-value << 0.001 Answer: Reject the null hypothesis. Significant evidence that diabetics have a higher mean level of glucose when compared to the general population
Alternative hypothesis Need to set up 2 hypotheses to cover all possibilities for . Choice of 3 possibilities:
Summary • Hypothesis testing: • Type I and II errors • Power • Two sided hypothesis test • One sided hypothesis test