1 / 15

Why Draw A Sample?

Why Draw A Sample? . Why not just the get the whole enchilada? Pragmatic reasons The true population is typically “unknowable” When done right, a small proportion of the population works just fine…. Types of Sampling. Probability Sampling Based on the principles of probability theory

manton
Download Presentation

Why Draw A Sample?

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Why Draw A Sample? • Why not just the get the whole enchilada? • Pragmatic reasons • The true population is typically “unknowable” • When done right, a small proportion of the population works just fine…

  2. Types of Sampling • Probability Sampling • Based on the principles of probability theory • Elements of the population have some known probability (typically equal odds) of selection • Non-probability sampling • Elements in the population have unknown odds of selection • Make it very difficult to generalize findings back to the population of interest

  3. Probability Sampling • Terminology • Element • Population • Sample • Sampling Frame • Parameter vs. Statistic

  4. Probability Sampling • Advantages • Avoids both conscious and unconscious bias • By using probability theory, we can judge the accuracy of our findings • There is ALWAYS ERROR in any sample • No sample perfectly reflects the entire population • Key issue = How much error is likely in our specific sample?

  5. Probability Theory • A branch of mathematics that allows us to gauge how well our samplestatistics reflect the true populationparameters. • Based on HYPOTHETICAL distributions • What would happen if we took an infinite number of unbiased (random) samples from a population and plotted the results? • Some “weird” findings just by chance (large errors) • Findings closer to the true parameter more likely (small errors) • Population parameter is most likely outcome (top of curve)

  6. Probability Theory II • Hypothetical distributions are called: • Sampling distributions • Because they are based on drawing an infinite # of samples • Probability distributions • Because they tell us the probability of obtaining particular sample outcomes • Sampling/probability distributions exist for any kind of sample outcome you can imagine • ALL OF THEM PRODUCE “KNOWN” ESTIMATES OF ERROR

  7. Probability Theory III • Standard error • The standard deviation of a sampling/probability distribution • KEY POINT: standard deviations for normal (bell shaped) sampling distribution always contain the same percent of sample outcomes • +/-1 Standard Error contains 68% of outcomes • +/- 1.96 Standard Errors contains 95% of outcomes • +/- 2.58 Standard Errors contains 99% of outcomes

  8. Sampling Distribution .95  95% of all sample outcomes (from infinite number of samples) will be within this window 0.95 True PopulationParameter -1.96 1.96  Standard Errors 5% of the time, you would get a weird finding this different from the true parameter

  9. Probability Theory IV • From sampling distribution to our single sample • IF: 95% of sample statistics (assuming an infinite number of samples) fall within +/- 1.96 standard errors of the true population parameter • Then, there is a 95% chance that our single statistic falls within 1.96 standard errors of the population parameter • SO- we “go out” 1.96 (or 2.58 for 99% confidence) standard errors to create a confidence interval

  10. Calculating what one standard error is “worth” for your sample outcome • The calculations always include • Sample size (N) • Some estimate of dispersion • There are formulas for every situation • Babbie The “Binomial” • Used for agree/disagree questions (% agree) in polling data

  11. Example: Feb 12, 2009 Gallup Poll On Darwin’s Birthday, Only 4 in 10 Believe in Evolution: Belief drops to 24% among frequent church attendees • “PRINCETON, NJ -- On the eve of the 200th anniversary of Charles Darwin's birth, a new Gallup Poll shows that only 39% of Americans say they "believe in the theory of evolution," while a quarter say they do not believe in the theory, and another 36% don't have an opinion either way. These attitudes are strongly related to education and, to an even greater degree, religiosity.” Survey Methods • “Results are based on telephone interviews with 1,018 national adults, aged 18 and older, conducted Feb. 6-7, 2009, as part of Gallup Poll Daily tracking. For results based on the total sample of national adults, one can say with 95% confidence that the maximum margin of sampling error is X percentage points.” • CAN YOU CALCULATE THEIR MARGIN OF ERROR?

  12. Confidence Intervals for Proportions • Sample point estimate (convert % to a proportion): • “39% of Americans say they ‘believe in the theory of evolution,’ while…”  estimate is .39 • Formula in Babbie (p.217): P (1-P) --------- N 95% confidence level  1.96 standard errors

  13. What happens when… • Still using 39%... • Change the sample size to 100 • Use original sample size, but change confidence level to 68% • Use the original sample size, but assume the percentage is 96% rather than 39%

  14. Types of Probability Sampling • EPSEM • SYSTEMATIC (every Kth element) • Stratified • Cluster • Disproportionate (oversample) + weighting

  15. Composite Measures • Why use composite (adding indicators up) as opposed to a single indicator of a concept? • Index • Typically adding yes/no (coded as 0/1) or ordinal scales (codes as 1-4) together • “Simple Summated Index” • SPSS review/demo– or, just do the assignment?

More Related