1 / 30

Prof. Dr. Md. Rezaul Karim Urban and Rural Planning Discipline Khulna University

Higher Education Quality Enhancement Project Urban and Rural Planning Discipline Khulna University T raining on Research Methods and Techniques. Basic Sampling Theory for Simple and Cluster Samples. Prof. Dr. Md. Rezaul Karim Urban and Rural Planning Discipline Khulna University.

uriah-hood
Download Presentation

Prof. Dr. Md. Rezaul Karim Urban and Rural Planning Discipline Khulna University

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Higher Education Quality Enhancement Project Urban and Rural Planning Discipline Khulna University Training on Research Methods and Techniques Basic Sampling Theory for Simple and Cluster Samples Prof.Dr. Md. Rezaul Karim Urban and Rural Planning Discipline Khulna University

  2. Sample design • The focus of the design for a sample must be on the magnitude of the standard errors of sampling not than on an arbitrary percentage of the target population. • The standard errors are used to calculate confidence intervals around the sample data.

  3. Standard errors The next sequence aims to explain standard errors, and how they relate to the underlying target population and a sample drawn from this population.

  4. Graph: Target population • Population: mean = , standard deviation = 

  5. Graph: Sample • Sample: mean = x, standard deviation = s

  6. Graph: Means from many samples • However we could get many different samples with different sample means from the population.

  7. Graph: Distribution of sample means • This gives us a sampling distribution of sample means:

  8. Sampling distribution of sample means • normal distribution • mean =  = mean of underlying population distribution • standard deviation =  / n

  9. Standard error of a population mean The standard deviation of the sampling distribution of sample means is termed the standard error of a mean. standard error of population mean =  / n

  10. Central limit theorem The central limit theorem states that the link between the mean of one sample and the population mean is given by: x =  z. se(popn mean) If z = 1.96 we produce a confidence interval where we find 95% of the sample means, relative to 

  11. Estimated population mean However, we are not interested in finding the sample means given the population mean. Our aim is to locate the population mean given what we know about the sample.

  12. Estimated population mean We start with a simple random sample and assume: • s is a good estimate of  • se(sample mean) is a good estimate of se(popn mean)

  13. Estimated population mean Then instead of x =  z. se(popn mean) we can write  = x  z. se(sample mean) where se(sample mean) = s / n

  14. Standard error of a proportion (srs) The standard error of a percentage (proportion) is: se(prop) =  [p(1-p)/n]

  15. Standard error of a proportion = 0.50 (srs) For p=0.50 se(p50) =  [0.50(1-0.50)/n] The standard error may be multiplied by a finite population correction (FPC) of (N-n)/N

  16. Confidence intervals • Confidence intervals are usually expressed at the 95 per cent level (1.96 standard errors of sampling for a proportion)

  17. Table: Effect of sample size on standard error

  18. Two stage samples • The most efficient method is usually sampling at the first stage with probability proportional to size (pps). • This produces a self-weighting sample. • Easier logistics for administration.

  19. Two stage samples Stage 1 Primary sampling units (psu) are selected with a probability proportional to the size of the target population in the psu. Example of psu: schools

  20. Two stage samples Stage 2 A random cluster of secondary sampling units (ssu) is selected at random from each of the psu. Example of ssu: students in schools

  21. Deff • Two-stage sampling is less efficient than a simple random sample (srs) of the same size. deff = (standard error of sampling for complex sample)2 / (standard error of sampling for srs)2

  22. Deft • The square root of deff is deft, which gives the ratio of the standard errors of sampling. deft = (standard error of sampling for complex sample) / (standard error of sampling for srs)

  23. Simple equivalent sample • The simple equivalent sample (ses) is the size of a simple random sample which has the same standard error as the complex sample. • We sometime use the term effective sample (neff)

  24. Simple equivalent sample The size of simple equivalent sample = size of complex sample / deff deff = 1 + (rho)(b-1) = 1 + (0.10)(20-1) = 2.9 where rho = intraclass correlation b = cluster size

  25. Compare srs and ses For a simple random sample of n=1000, the 95 per cent confidence interval is given by: =  1.96 se(p50) =  1.96 [0.50(1-0.50)/1000] =  0.031 =  3.1%

  26. Compare srs and ses For a simple equivalent sample of n=345 (corresponding to a complex sample of n=1000), the 95 per cent confidence interval is given by: =  1.96 se(p50) =  1.96 [0.50(1-0.50)/345] =  0.053 =  5.3%

  27. Table: Values for deff and simple equivalent sample

  28. Random clusters • PPS sampling assumes a random cluster which is the same size for each ssu. • In practice, we often draw an intact group at random. • This usually increase the intraclass correlation for the sample.

  29. Weighting • Achieved samples are unlikely to properly represent the proportions of persons in the target populations for the strata. • Weights are applied so that the achieved sample for each stratum represents its proportion in the total target population.

  30. Weighting wh = Nh/nh where nh = the size of the achieved sample for the stratum Nh = the size of the target population for the stratum

More Related