480 likes | 505 Views
Chapter 8. Estimation. Understandable Statistics Ninth Edition By Brase and Brase Prepared by Yixun Shi Bloomsburg University of Pennsylvania. Estimating µ When σ is Known. Point Estimate. An estimate of a population parameter given by a single number. Margin of Error.
E N D
Chapter 8 Estimation Understandable Statistics Ninth Edition By Brase and Brase Prepared by Yixun Shi Bloomsburg University of Pennsylvania
Point Estimate An estimate of a population parameter given by a single number.
Margin of Error Even if we take a very large sample size, will differ from µ.
Confidence Levels A confidence level, c, is any value between 0 and 1 that corresponds to the area under the standard normal curve between –zc and +zc.
Recall From Sampling Distributions If we take samples of size n from our population, then the distribution of the sample mean has the following characteristics:
A Probability Statement In words, c is the probability that the sample mean will differ from the population mean by at most
Maximal Margin of Error Since µ is unknown, the margin of error | - µ| is unknown. Using confidence level c, we can say that differs from µ by at most:
Critical Thinking Since is a random variable, so are the endpoints After the confidence interval is numerically fixed for a specific sample, it either does or does not contain µ.
Critical Thinking If we repeated the confidence interval process by taking multiple random samples of equal size, some intervals would capture µ and some would not! Equation states that the proportion of all intervals containing µ will be c.
Estimating µ When σ is Unknown In most cases, researchers will have to estimate σ with s (the standard deviation of the sample). The sampling distribution for will follow a new distribution, the Student’s t distribution.
The t Distribution Use Table 6 of Appendix II to find the critical values tcfor a confidence level c. The figure to the right is a comparison of two t distributions and the standard normal distribution.
Using Table 6 to Find Critical Values Degrees of freedom, df, are the row headings. Confidence levels, c, are the column headings.
Maximal Margin of Error If we are using the t distribution:
Estimating p in the Binomial Distribution We will use large-sample methods in which the sample size, n, is fixed. We assume the normal curve is a good approximation to the binomial distribution if both np > 5 and nq = n(1-p) > 5.
Margin of Error The magnitude of the difference between the actual value of p and its estimate is the margin of error.
The Distribution of The distribution is well approximated by a normal distribution.
A Probability Statement With confidence level c, as before.
Choosing Sample Sizes When designing statistical studies, it is good practice to decide in advance: The confidence level The maximal margin of error Then, we can calculate the required minimum sample size to meet these goals.
Sample Size for Estimating μ If σ is unknown, use σ from a previous study or conduct a pilot study to obtain s. Always round n up to the next integer!!
Sample Size for Estimating If we have no preliminary estimate for p, use the following modification:
Independent Samples Two samples are independent if sample data drawn from one population is completely unrelated to the selection of a sample from the other population. Occurs when we draw two random samples
Dependent Samples Two samples are dependent if each data value in one sample can be paired with a corresponding value in the other sample. Occur naturally when taking the same measurement twice on one observation Example: your weight before and after the holiday season.
Confidence Intervals forμ1 – μ2 when σ1, σ2 unknown If σ1, σ2 are unknown, we use the t distribution (just like the one-sample problem).
What if σ1 = σ2 ? If the sample standard deviations s1 and s2 are sufficiently close, then it may be safe to assume that σ1 = σ2. Use a pooled standard deviation. See Section 8.4, problem 27.
Estimating the Difference in Proportions We consider two independent binomial distributions. For distribution 1 and distribution 2, respectively, we have: n1p1q1r1 n2p2q2r2 We assume that all the following are greater than 5: