1 / 21

Agenda

Agenda. Informationer Reserver tid til projekt 2 Introduktion til projekt 2 Opsamling fra sidst Sampling distribution. Den empiriske regel: 68 – 95 – 99,7% Rule. 68% of the observations fall within 1 standard deviation of the mean

lloyd
Download Presentation

Agenda

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Agenda • Informationer • Reserver tid til projekt 2 • Introduktion til projekt 2 • Opsamling fra sidst • Sampling distribution

  2. Den empiriske regel: 68 – 95 – 99,7% Rule • 68% of the observations fall within 1 standard deviation of the mean • 95% of the observations fall within 2 standard deviations of the mean • 99.7% of the observations fall within 3 standard deviations of the mean

  3. Eksempel på brug af normalfordelingen De besøgende på en hjemmeside bruger i gns. 300 sekunder på forsiden, før de klikker videre til en underside. Besøgstiden er normalfordelt med en standardafvigelse på 50 sekunder. Hvad er sandsynligheden for at tilfældig besøgende højest bruger 265 sekunder på forsiden? X = 265, μ = 300, σ = 50. Hvad er P(x<265)? Svaret er 0,24196....

  4. z-score X angiver tiden før der klikkes videre fra forsiden til en underside. X er normalfordelt med μ = 300 og σ = 50. X’s z-score angiver hvor mange (antal) standardafvigelser, X ligger fra μ Hvad er sandsynligheden for at en tilfældig besøgende bruger mindre end 240 sekunder på forsiden? z = (X – μ) / σ = (240 – 300) / 50 = -60 / 50 = -1,2 P(z<-1,2) = 0,1151.

  5. Learning Objectives • Statistic vs. Parameter • Sampling Distributions • Mean and Standard Deviation of the Sampling Distribution of a Proportion • Standard Error • Sampling Distribution Example • Population, Data, and Sampling Distributions

  6. Population Sample Learning Objective 1:Statistic and Parameter • A statistic is a numerical summary of sample data such as a sample proportion or sample mean. • A parameter is a numerical summary of a population such as a population proportion or population mean. • In practice, we seldom know the values of parameters. • We use sample data to estimate parameters. • Our estimates are called statistics. • Hvad er følgende? • μ • s • σ

  7. Learning Objective 2:Sampling Distributions Example: • Prior to counting the votes, the proportion in favor of Mr. Barack Obama is an unknown parameter. • An exit poll of 3.160 voters reported that the sample proportion in favor of a recall was 0.54. • If a different random sample of about 3.000 voters were selected, a different sample proportion would occur. Hvad betyder følgende (definition)? The sampling distribution of the sample proportion shows all possible values and the probabilities for those values.

  8. Learning Objective 2:Sampling Distributions • The sampling distribution of a statistic is the probability distribution that specifies probabilities for the possible values the statistic can take. • Sampling distributions describe the variability that occurs from study to study using statistics to estimate population parameters

  9. Learning Objective 3:Mean and SD of the Sampling Distribution of a Proportion • For a random sample of size n from a population with proportion p of outcomes in a particular category, the sampling distribution of the proportion of the sample in that category has

  10. Learning Objective 4:The Standard Error • To distinguish the standard deviation of a sampling distribution from the standard deviation of an ordinary probability distribution, we refer to it as a standard error. • Example: If the population proportion supporting the reelection of Mr. Obama was 0.50, would it have been unlikely to observe the exit-poll sample proportion of 0.565?

  11. Learning Objective 5:Example: 2012 Election • An exit poll had 2.705 people • Assume 50% support the reelection of Mr. Obama • Find the estimate of the population proportion and the standard error.

  12. Learning Objective 5:Example: 2012 Election

  13. Learning Objective 5:Example: 2006 California Election • The sample proportion of 0.565 is more than six standard errors from the expected value of 0.50. • The sample proportion of 0.565 voting for reelection of Mr. Obama would be very unlikely if the population proportion were p = 0.50

  14. Learning Objectives • The Sampling Distribution of the Sample Mean • Effect of n on the Standard Error • Central Limit Theorem (CLT)

  15. Learning Objective 1:The Sampling Distribution of the Sample Mean • The sample mean, x, is a random variable. • The sample mean varies from sample to sample. • By contrast, the population mean, µ, is a single fixed number.

  16. Learning Objective 1:The Sampling Distribution of the Sample Mean • For a random sample of size n from a population having mean µ and standard deviation σ, the sampling distribution of the sample mean has: • Center described by the mean µ (the same as the mean of the population). • Spread described by the standard error, which equals the population standard deviation divided by the square root of the sample size: • standard error of

  17. Learning Objective 2:Effect of n on the Standard Error • Knowing how to find a standard error gives us a mechanism for understanding how much variability to expect in sample statistics “just by chance.” • The standard error of the sample mean = • As the sample size n increases, the standard error decreases. • With larger samples, the sample mean is more likely to fall closer to the population mean.

  18. Learning Objective 3:Central Limit Theorem (CLT) • For random sampling with a large sample size n, the sampling distribution of the sample mean is approximately a normal distribution. • This result applies no matter what the shape of the probability distribution from which the samples are taken.

  19. Learning Objective 3:CLT: Impact of increasing n

  20. Learning Objective 3:CLT: How Large a Sample? • The sampling distribution of the sample mean takes more of a bell shape as the random sample size n increases. • The more skewed the population distribution, the larger n must be before the shape of the sampling distribution is close to normal. • In practice, the sampling distribution is usually close to normal when the sample size n is at least about 30. • If the population distribution is approximately normal, then the sampling distribution is approximately normal for all sample sizes.

  21. Learning Objective 3:CLT: Making Inferences • For large n, the sampling distribution is approximately normal even if the population distribution is not. • This enables us to make inferences about population means regardless of the shape of the population distribution.

More Related