1 / 41

Understanding Confidence Intervals in Statistical Estimation

Learn about point estimates, confidence intervals, standard error, constructing and interpreting confidence intervals in statistics. Gain insights on determining population mean with examples.

luisaf
Download Presentation

Understanding Confidence Intervals in Statistical Estimation

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Estimation and Confidence Intervals Chapter 9

  2. Terminology Point Estimate …is a single value (statistic) used to estimate a population value (parameter) …states the range within which a population parameter probably lies Interval Estimate Confidence Interval …is a range of values within which the population parameter is expected to occur

  3. Desirable properties of a point estimator • efficient … possible values are concentrated close to the value of the parameter • consistent …values are distributed evenly on both sides of the value of the parameter • unbiased …unbiasedwhen the expected value equals the value of the population parameter being estimated. Otherwise, it is biased!

  4. Terminology s = n s s x x Standard error of the sample mean …isthe standard deviation of the sampling distribution of the sample means It is computed by where …is the symbol for the standard error of the sample mean s …is the standard deviation of the population n …is the size of the sample

  5. s s x = n Standard Error of the Means If is not known and n > 30, the standard deviation of the sample(s)is used toapproximate the population standard deviation Computed by…

  6. 1. The sample size, n 2. The variability in the population, usually estimated by s 3. The desired level of confidence Factors …that determine the width of a confidence interval are:

  7. s n z ± x α/2 Constructing Confidence Intervals IN GENERAL, A confidence interval for a mean is computed by: Interpreting…

  8. Alanya Interpreting Confidence Intervals Suppose that you read that “…the average selling price of a home in Alanya is $200 000+/- $15000 at 95% confidence!” This means…what?

  9. Alanya “…the average selling price of a home in Alanya is $200 000+/- $15 000 at 95% confidence!” Interpreting Confidence Intervals In statistical terms, this means: …that we are 95% surethat the interval estimateobtained contains the value of the population mean. Lower confidence limit is $185 000 ($200 000 - $15 000) Upperconfidence limit is $215 000 ($200 000 + $15 000) Also…

  10. Alanya Interpreting Confidence Intervals Your newspaper also reports that… You select a random sample of 36homes sold during the past year, and determine a 90% confidence intervalestimate for the population mean to be (31-39) days. “…the mean time to sell a home in Alanya is 40 days. Do your sample results support the paper’s claim?

  11. Interpreting Confidence Intervals Lower confidence limit is 31 days You select a random sample of 36homes sold during the past year, and determine a 90% confidence intervalestimate for the population mean to be (31-39) days. Upperconfidence limit is 39 days Our evidence does not support the statement made by the newspaper, i.e., the population mean is not 40 days, when using a 90% interval estimate There is a 10% chance (100%-90%) that the interval estimate does not contain the value of the population mean!

  12. Interpreting Confidence Intervals 90% Confidence Interval … 10% chance of falling outside this interval …or, focus on tail areas … i.e.  = 0.10 .05 .05 90% 39 31  is the probability of a value falling outsidethe confidence interval

  13. Try it! LocateArea on the normal curve 1 2 Look up a=0.46 in Table to get the corresponding z-score This is a 92% confidence interval Find the appropriate value of z: 0.92 0 -1.75 1.75 Search in the centre of the table for the areaof 0.46 Z = +/- 1.75

  14. Constructing Confidence Intervals s n z ± x α/2 Common Confidence Intervals Also, 95% of the sample means for a specifiedsample size will lie within 1.96standard deviations of the hypothesized population mean. About 95% of the constructed intervals will contain the parameter being estimated. 95% C.I. for the mean: 99% C.I. for the mean:

  15. Interval Estimates s s n n Use the t-table… z t ± ± x x α/2 α/2 Use the z table… If the population standard deviation is knownor n > 30 If the population standard deviation is unknown and n<30 More on this later…

  16. Q uestion A nswer The Dean of the Tourism Faculty wants to estimate the mean number of hours worked per week by students. A sample of 49 students showed a meanof 24 hours with a standard deviation of 4 hours. What is the population mean? Our best estimate is 24 hours. This is a point estimate.

  17. 95 percent confidence Commonly denoted as 1-α s z ± α/2 x n 4 S 49 olution Find the 95 percent confidenceinterval for the population mean. Q Mean = 24 SD = 4 N = 49 Z = +/- 1.96 95% Confidence Substitute values: +1.96 24 = 24 +/- 1.12 The Confidence Limitsrangefrom22.88 to 25.12

  18. Interval Estimates 90% confidence level 1-α = 0.9 orα = 0.10 99% confidence level 1-α = 0.99 orα= 0.010

  19. Student’s t-distribution ….used for small sample sizes Characteristics …like z, the t-distribution is continuous …takes values between –4and +4 …it is bell-shaped and symmetric about zero …it is more spread out and flatter at the centre than the z-distribution …for larger and larger values of degrees of freedom, the t-distribution becomes closer and closerto the standard normal distribution

  20. tdistribution Zdistribution Student’s t-distribution Chart 9-1 Comparison of The Standard Normal Distribution and the Student’stDistribution The t distribution should be flatter and more spread out than thezdistribution

  21. 0.10 t T -table Student’s t-distribution Example …with df = 9 and 0.10 area in the upper tail… t = 1.383

  22. 1.383 Student’s t-distribution Confidence Intervals 80% 90% 95% 98% 99% Level of Significance for One-Tailed Test 0.100 0.050 0.025 0.010 0.005 0.10 Level of Significance for Two-Tailed Test 0.20 0.10 0.05 0.02 0.01 9

  23. When? …to use the zDistribution or the tDistribution ? Population Normal? NO YES Population standard deviation known? n30 ormore? NO YES NO YES Use a nonparametric test (see Ch16) Use thetdistribution Use thez distribution Use thez distribution

  24. Q The Dean of the Tourism Faculty wants to estimate the mean number of hours worked per week by students. A sample of only 12 students showed a meanof 24 hours with a standard deviation of 4 hours. Student’s t-distribution Find the 95 percent confidenceintervalfor the population mean. n is small so use the t- Distribution

  25. Q Data Formula = 24 n = 12 s = 4 df = 12-1 = 11  = 1 – 95% =.05 X S olution s t ± α/2 x n …sample of only 12 students …a meanof 24 hours …a standard deviation of 4 hours Looking up 5% level of significance for a two-tailed test with 11df, we find…

  26. Student’s t-distribution Confidence Intervals 80% 90% 95% 98% 99% Level of Significance for One-Tailed Test 0.100 0.050 0.025 0.010 0.005 Level of Significance for Two-Tailed Test 0.05 0.20 0.10 0.05 0.02 0.01 11 2.201

  27. Q Data Formula = 24 n = 12 s = 4 df = 12-1 = 11  = 1 – 95% =.05 X S olution s t 4 ± α/2 x n 24 ± 2.201 12 …sample of only 12 students …a meanof 24 hours …a standard deviation of 4 hours Looking up 5% level of significance for a two-tailed test with 11df, we find… t0.025 = 2.201 = 24 +/- 2.54 The confidence limits range from 21.46 to 26.54 Compare these with earlier limits of 22.88 to 25.12

  28. Student’s t-distribution Q The manager of the college cafeteria wants to estimate the mean amount spent per customer per purchase. A sample of 10 customers revealed the following amounts spent: $4.45 $4.05 $4.95 $3.25 $4.68 $5.75 $6.01 $3.99 $5.25 $2.95 Determine the 99% confidence interval for the mean amount spent.

  29. = $4.53 s = $1.00 X 1.00 10 S olution s t ± α/2 x n Student’s t-distribution $4.45 $4.05 $4.95 $3.25 $4.68 $5.75 $6.01 $3.99 $5.25 $2.95 Step 1 Determine the sample mean and standard deviation. Step 2 Enter the key datainto the appropriate formula. 10 df = 10 – 1 = 9  = 1-99% = .01 n = ± = 4.53 3.25 Formula = $4.53 +/- $1.03 We are 99%confident that the mean amount spent per customer is between $3.50 and $5.56

  30. Constructing Confidence Intervals for Population Proportions Formula A confidence interval for a population proportion is estimated by: where p …is the symbolfor the sample proportion Q ...

  31. Q Constructing Confidence Intervals for Population Proportions uestion A sample of 500 executives who own their own home revealed 175 planned to sell their homes and retire to Alanya. Develop a 98%confidence interval for the proportion of executives that plan to sell and move to Alanya.

  32. Constructing Confidence Intervals for Population Proportions Formula - . 35 ( 1 . 35 ) ± . 35 2 . 33 500 S olution ± . 35 . 0497 A sample of 500 executives who own their own home revealed 175 planned to sell their homes and retire to Alanya. Develop a 98%confidence interval for the proportion of executives… n = p = z = 500 2.33 175/500 = .35 98% CL =

  33. Q uestion Correction Factor s N - n Formula s = x N - 1 n Finite-Population Correction Factor Used when n/Nis 0.05 or more The attendance at the college hockey game last night was 2700. A random sample of 250 of those in attendance revealed that the average number of drinks consumed per person was 1.8 with a standard deviation of 0.40. Developa 90%confidence interval estimate for the mean number of drinks consumed per person.

  34. - . 4 2700 250 ± 1 . 8 1 . 645 ( )( ) s N - n ± Formula Zα/2 - X 2700 1 250 n N - 1 N = n = x = s = /2 = S olution ± 1 . 8 0 . 04 Finite-Population Correction Factor The attendance at the college hockey game last night was 2700. A sample of 250 of those in attendance revealed that the average number of drinks consumed per person was 1.8 with a standard deviation of 0.40. Developa 90%confidence intervalestimate.… 2700 250 1.8 0.05 0.40 Since 250/2700>.05,use the correction factor 90% CL =

  35. Selecting the Sample Size

  36. Factors …that determine the sample size are: 1. The degree of confidence selected 2. The maximum allowable error 3. The variation in the population

  37. Selecting the Sample Size 2 zα/2 s æ ö ç ÷ E è ø Formula n = where E… is the allowable error Z …is the z-score for the chosen level of confidence S …is the sample deviation of the pilot survey

  38. Selecting the Sample Size Q uestion A consumer group would like to estimate the mean monthly electricity charge for a single family house in July (within $5)using a 99 percent level of confidence. Based on similar studies the standard deviation is estimated to be $20.00. How large a sample is required?

  39. Selecting the Sample Size 2 zα/2 s Formula æ ö ç ÷ E è ø 2 æ æ 2.58 · 20 = ç ç 5.00 è è S olution A minimum of 107 homes must be sampled. 90% CL = A consumer group would like to estimate the mean monthly electricity charge for a single family house in July (within $5)using a 99 percent level of confidence. Based on similar studies the standard deviation is estimated to be $20.00. = (10.32)2 = 106.5

  40. Selecting the Sample Size Q uestion Alanya’s Mayor wants to estimate theproportion of tourist family that have a children. Assume a 95% level ofconfidence and that the Mayor estimates that 30% of the family have children. If the Mayor wants the estimate to bewithin 3% of the population proportion, how many family would they need to contact?

  41. Selecting the Sample Size Q uestion New Formula 2 æ ö Z = - ç ÷ n p ( 1 p ) è ø E 2 æ ö 1 . 96 = - ç ÷ . 3 ( 1 . 3 ) è . 03 ø 2 = ) ( (. 21 ) 65 . 33 The Mayor of Alanya wants to estimate theproportion of tourist family that have children. Assume a 95% level ofconfidence and that the Mayorestimates that 30% of the family have children. n= 896.4 A minimum of 897 family must be sampled.

More Related