230 likes | 502 Views
Module 12: Populations, Samples and Sampling Distributions. This module provides basic information about the statistical concepts of populations and samples, selecting samples from population and the critical issue of sampling distributions. Reviewed 05 May 05 / MODULE 12.
E N D
Module 12: Populations, Samples and Sampling Distributions This module provides basic information about the statistical concepts of populations and samples, selecting samples from population and the critical issue of sampling distributions. Reviewed 05 May 05 / MODULE 12
Populations and Samples • Population: The entire group about which information is desired. • Sample: A proportion or part of the population - usually the proportion from which information is gathered.
Target Population • The participants to whom the answer to the question pertains. • The target population definition has two aspects: • Conceptual • Operational
Population Definition A population definition gives a clear statement of those included. The following are some examples: • Adults and children 10-59 years of age residing in four census tracts in Richfield, a suburb of Minneapolis • Adults 25-59 years of age residing in Cedar County, Iowa and certain rural townships in neighboring counties on July 1, 1973 • Employees of Pacific Northwest Bell Telephone Company working in King County
Sampling • In its broadest sense, sampling is a procedure by which one or more members of a population are picked from the population. • The objective is to make certain observations upon the members of the sample and then, on the basis of these results, to draw conclusions about the characteristics of the entire population.
Selecting a Sample • Haphazard Sample: Haphazard samples are constructed by arbitrarily selecting individual sample members. • Random Sample: There are several methods for constructing random samples—we consider only simple random samples. This process operates so that each member of the population has an equal chance of being selected into the sample.
Selecting a Sample The selection process: • Assign to each member of the population the equivalent of sequential ID number; • Use a random number table or computer generated numbers; • For computer generated numbers, generate one for each ID number, sort the ID numbers in order according to the random number and take the first on the list up to the point when you have the sample size you need • For a table, haphazardly select a starting point and then • Ignore numbers that are too large •Ignore a number after it appears the first time
Fundamental and Important Concept We now begin the discussion of perhaps the most important concept in biostatistics. It is fundamental to understanding and thus interpreting correctly the use of the many statistical tools we will cover in this course. The concept is not complex, in fact, it is rather simple. It does require, however, thinking about issues in a manner that may initially appear somewhat different and unusual.
Looking at the Process When we randomly select a sample from a population, we can use the mean for the sample as an estimate or guess as to the value for the mean of the population. This should bring up the question as to how good is this sample mean or sample statistic as a guess for the value of the population mean or population parameter. The essence of this question has to do with how well this processworks—the process of using a sample to make guesses about the population.
Understanding the Process Two important aspects of this fundamental process: FIRST: It is critical to recognize that it is a process SECOND: It is important to understand how and how well the process works
How Good is a Sample Mean The essential question is “How good is a sample mean as an estimate of the population mean?” One way to examine this question is to understand that we used a process that involved randomly selecting a sample from the population and then calculating the mean for the values of the observations in the sample. We can repeat this process as many times as we wish and examine what it produces.
156 121 149 105 201 Sample with n = 5 46 198 217 189 149 172 162 42 121 198 201 309 111 220 100 201 261 156 … 133 Population of weights Sample of 5 weights
Standard Error of the Mean The population that includes all possible samples of size n is a long list of numbers and the variance for these numbers can, in theory, be calculated. The square root of this variance is called the standard error of the mean. It is simply the standard deviation for this population of means.
145 136 161 154 181 127 113 198 206 101 114 148 191 151 189 105 111 102 133 120 171 Sample with n = 20 Sample of 20 weights