460 likes | 588 Views
BHV 390: Research Methods Probability Sampling Techniques Kimberly Porter Martin, Ph.D. What is a Population?. DEFINITION: The group to which you want to generalize your findings. IN OTHER WORDS: The larger group you are representing with your sample. OR
E N D
BHV 390: Research Methods Probability Sampling Techniques Kimberly Porter Martin, Ph.D.
What is a Population? DEFINITION: The group to which you want to generalize your findings. IN OTHER WORDS: The larger group you are representing with your sample. OR The larger group to which your results will apply.
What is a Sample? DEFINITION A subset of the population being studied from which data is actually collected. A good sample accurately represents all kinds of elements/members in proportion to their presence in the population.
Sampling Techniques Sampling techniques are the processes by which the subset of the population from which you will collect data are chosen. There are TWO general types of sampling techniques: 1) PROBABILITY SAMPLING 2) NON-PROBABILITY SAMPLING
Probability Sampling • The process of selecting a sample from a population using (statistical) probability theory insuring that • each element/member of the population has an equal chance of being included in the sample, and • the researcher can estimate the error caused by not collecting data from all elements/members of the population (called “sampling error”).
Frames DEFINITION A frame is a list of EVERY element/member of a population. In order to do probability sampling, you MUST have access to a frame for the population you have chosen. You CANNOT do probability sampling without a frame.
Probability Sampling is ALWAYS superior to Non-probability Sampling. Probability Sampling is more difficult and time consuming and is not always possible.
Types of Probability Sampling • Simple Random Sample • (SRS) • Systematic Sample with a Random Start (SSRS) • Stratified Sample • (SS) • Multistage Cluster Sample • (MCS)
Steps in a Simple Random Sample (SRS) • Choose and nominally define a population • Locate a frame for the population • Choose a sample size • Assign a consecutive number to the elements/members of the frame • Obtain a table of random numbers (TRN) • Count the number of digits in the number assigned to the LAST member/element of the frame. • Choose a random starting point in the TRN • Begin at that point and mark off sets of numbers in the TRN that have the same number of digits as the last member/element of your frame. • Match the numbers you have taken from the TRN to those on your frame • The members/elements that match the numbers from the TRN are the members of your sample.
Example of a Simple Random Sample • My population is traditional-aged students at ULV. • I can get a list of all 1684 traditional-aged students at ULV from the Registrar’s office. • I want to collect data from 100 people from this population. • I begin at the top of the list (frame) of students and number the students, beginning with 1 and ending with 1684.
Example of a Numbered Frame • Josephine Morales • Zachary Morton • . • . • . • 1000. Bill Zimmerman • Agnes Zuckerman 1. Mary Aalpoel • John Abbinton • Oscar Ackerman . • Temecia Kennedy • Albert Kostas . • Jose Magana • Sarah Martin
Example of a Simple Random Sample • I get a statistics book that has a table of random numbers as an appendix. • The last person listed in my frame has the number 1684. That number has 4 digits. • I choose a random starting point in the TRN. • I count off 100 sets of four digits beginning at the random starting point (one set of digits for each person that I want to have in my sample.
Example of Selecting Numbersfrom a TRN So far the numbers selected for the sample (those that fall between 0 and 1684) are 0002, 0986, 0872, 1000 and 0028. The process continues until 100 numbers have been randomly selected between 0 and 1684.
Example of a Simple Random Sample 9. I list the 100 numbers from the TRN that fall between 1 and 1684 and match those numbers with those in the frame. • The student having the selected numbers will be asked to participate in my study. They will make up my sample of 100. Sample: 2. John Abbington 968. Josephine Morales 28. Temecia Kennedy 1000. Bill Zimmerman 872. Sarah Martin etc.
Steps in a Systematic Sample with a Random Start (SSRS) • Choose and nominally define a population • Locate a frame for the population • Choose a sample size • Examine the elements/members in frame for patterns in their status characteristics/attributes. If patterns are present, go to Stratified Sample. • Assign a consecutive number to each element in the frame. • Determine the sampling interval.
The Sampling Interval In an SSRS you will be systematically selecting members of your sample by counting off in your frame. The sampling interval tells you how far to count before selecting the next member of your sample. The sampling interval (k) is calculated: K = P/s where P = the population size (the number of elements in your frame), and s = your sample size
Steps in a Systematic Sample with a Random Start (SSRS) 7. Choose a random starting point in your frame. 8. Beginning with the random starting point, count off k element/members of the frame and select the kth element as a member of your sample. Continue, selecting each kth element/member of the frame to be included in your sample.
Example of a Systematic Sample with a Random Start • Again, my population is traditional-aged students at ULV. • I can get a list of all 1684 traditional-aged students at ULV from the Registrar’s office. • I want to collect data from at least 100 people in this population. • I begin at the top of the list (frame) of students and number the students, beginning with 1 and ending with 1684 (there are 1684 traditional-aged students registered at this time.
Example of a Numbered Frame • Josephine Morales • Zachary Morton • . • . • . • 1000. Bill Zimmerman • Agnes Zuckerman 1. Mary Aalpoel • John Abbinton • Oscar Ackerman . • Temecia Kennedy • Albert Kostas . • Jose Magana • Sarah Martin
Example of a Systematic Sample with a Random Start • I get a statistics book that has a random numbers table as an appendix. • The last person listed in my frame has the number 1684. That number has 4 digits. • I choose a random starting point in the TRN. • Beginning at the random starting point, I mark off the numbers in the TRN by fours until I reach the first set of four digits that falls between 1 and 1684. That is my random starting point for beginning the count off in my frame.
Example of Selecting a Starting Point for SRSS from a TRN Here I have started at a random point in a TRN and have marked off sets of four numbers until I reached the first number that is between 1 and 1684. That number is 0138. I will therefore start counting off in my frame with student number 138.
Example of Selecting Elements/Members for an SRSS Sample Beginning with 138 as a starting point, I count off 16, ending with 153. I select member/element number 153 to be in my sample. I add 16 to 153 to arrive at member/element 169 in my frame, and select number 169 to be in my sample. I continue to select every 16th member/element in the frame to be in my sample until I have reached the end of my frame. When I reach the end of the list, I continue my count at the beginning of the list until I have reached 138 again. That will give me a sample size of 105 students.
Example of SRSS Sample Members Selected from a Numbered Frame with k = 16 and a random starting point of 138 153. Albert Kirby 169. Sally James 185. Temecia Jamison • Derek Jones 217. Susan Johnston 233. John Maloney 249. Sarah Martin . . . 937. Josephine Solana 953. Jesus Soledad . • 969. Henry Suzuki • . • . • . • 1615. Jesse Wirth • 1647. Martha Zalm • Todd Zimmerman • Agnes Zuckerman1. • 1. Sandra Aalpoel • 17. Joan Anderson • 33. Ian Atchison • Etc until 138 is reached again.
Steps in a Stratified Sample • Choose and nominally define a population • Locate a frame for the population • Choose a sample size • Examine the elements/members in frame for patterns in their status characteristics or attributes. • Reorganize the elements by grouping all those with the same status characteristics together on the list (stratify them). The rest of the procedure is the same as it is for an SRSS. 6. Assign a consecutive number to each of the elements/members of the frame as they appear in their new order. • Determine the sampling interval, k. • Choose a random starting point in your frame. • Beginning with the random starting point, count off k element/members of the frame and select the kth element as a member of your sample. Continue, selecting each kth element/member of the frame to be included in your sample.
Finding Status Patterns in a Frame Lets imagine that your population is the personnel in a particular military platoon that is made up of 10 squads of 10 soldiers, 7 of whom are privates, 2 of whom are sergeants and 1 of whom is a lieutenant. You are given a frame (a list of all personnel in the platoon) by the commanding officer. The frame lists the personnel by squad beginning with the lowest ranking soldier and ending with the highest ranking soldier. The following slide shows what the frame would look like.
Sample Frame for Fictional Platoon • 14. Private • Private • Private • Private • Sergeant • Sergeant • Lieutenant • Private • Private • Private • Private • Private • Private • Private • Sergeant • Sergeant • Lieutenant • Private • Private • Private • Private • Sergeant • Sergeant • Lieutenant • Private • Private • Private • Etc. To 100 soldiers • Private • Private • Private • Private • Private • Private • Private • Sergeant • Sergeant • Lieutenant • Private • Private • Private
Example of a Stratified Sample • My population is soldiers in a single platoon. • I can get a list of all 100 soldiers in this platoon from the commanding officer. • I want to do in-depth interviews with 10 soldiers in the platoon. • I check the frame and find there is a clear pattern in the way that soldiers are listed. They are listed by rank. Since I want to interview 10 soldiers and I have a population of 100, my sampling interval for this group would be 10. If I just sample as I would in a SSRS, I would take every 10th soldier. However, choosing every 10th soldier would give me 10 lieutenants, no sergeants and no privates. This is not representative of the group as a whole. I want to have opinions that reflect the whole platoon, and I want to do interviews with people of different ranks in the proportions in which they exist in the platoon.
Straight SRSS Sampling Results • 14. Private • Private • Private • Private • Sergeant • Sergeant • Lieutenant • Private • Private • Private • Private • Private • Private • Private • Sergeant • Sergeant • Lieutenant • Private • Private • Private • Private • Private • Private • Private • Sergeant • Sergeant • Lieutenant • Etc. To 100 soldiers • Private • Private • Private • Private • Private • Private • Private • Sergeant • Sergeant • Lieutenant • Private • Private • Private
Example of a Stratified Sample 5. I reorganize the elements in the frame by grouping all those with the same status characteristics together on the list (stratify them). This means all privates go together, all sergeants go together and all lieutenants go together in the list. 6. I then assign a consecutive number to each of the soldiers listed in the frame as they appear in their new order. • I determine my sampling interval by dividing my population size (100) by the my sample size (10). k = 100/10 = 10. • I locate a TRN and choose a random starting point using the same techniques that I used in the SRSS. In this case, my population size is 100 so I need to use three digits to be sure that every soldier has an equal chance of being included.
Example of Selecting a Starting Point for SRSS from a TRN Here I have started at a random point in a TRN and have marked off sets of three numbers until I reached the first number that is between 1 and 100. That number is 050. I will therefore start counting off in my frame with soldier number 50.
Example of a Stratified Sample • Beginning with the random starting point, I count off 10 soldiers and select the 10th as a member of my sample. I continue, selecting each 10th soldier as I go through the frame. When I get to 100, I continue uninterrupted in my count until I have reached. The resultant sample will have exactly the proportion of privates, sergeants and lieutenants as are in the platoon: 7 privates, 2 sergeants and 1 lieutenant
SRSS Sampling Resultswith Stratification • Private • Private • Private • Private • Private • Private • Private • Private • Lieutenant • Lieutenant • Lieutenant • Lieutenant • Lieutenant • Lieutenant • Lieutenant • Lieutenant • Lieutenant • Lieutenant • Sergeant • Sergeant • Sergeant • Sergeant • Sergeant • Sergeant • Sergeant • Sergeant • Sergeant • Etc until 10 selected • Private • Private • Private • Private • Private • Private • Private • Private • Private • Private • Private • Private • Private
Example of SS Sample Members Selected from a Numbered Frame with P = 100, k = 10 and a random starting point of 50 59. Private 9. Private 69. Private 19. Private 79. Lieutenant 29. Private 89. Sergeant 39. Private 99. Sergeant 49. Private The result is a sample of 10 with 10% Lieutenants, 20% Sergeants and 70% Privates
What is Multistage Cluster Sampling? Multistage cluster sampling is a technique for: when you do not have a frame that lists all elements of a population, OR when numbers of individual elements in your population are too numerous to sample easily. AND you can obtain frames for groups of population elements/members.
Steps in Multistage Cluster Sampling • 1. Choose and nominally define a population • 2. Choose a sample size • Identify groupings of the elements that make up • your population. • Obtain a frame for the groups of elements. • Randomly sample the groups using SRS or • SSRS. • Obtain a frame for the individual elements within • each group selected during step 5. • Randomly sample the individuals in the groups • selected in step 5 using SRS or SSRS.
Example of MultiStage Cluster Sampling • My population is all the people living in the city of La Verne. There is no frame for all people currently living in La Verne. • I want to survey 100 residents of La Verne about the City Council’s latest action. • I can get a list of all the 1750 streets in La Verne and also for all mailing addresses (representing households) in the city. These groupings of residents will help me use probability sampling to select 100 residents. I will select 25 streets in the city, and then four households from each of the 25 streets.
Example of MultiStage Cluster Sampling • I get a frame that includes all streets in La Verne. • I use Simple Random Sampling techniques to select 25 Streets in La Verne.
Example of MultiStage Cluster Sampling There are 1750 streets in La Verne. I will select 25 streets. I begin with a random starting point in a TRN and, because the total number of streets has four digits, I will mark of sets of four digits in the TRN. The following numbers identify streets that will be included in the sample of streets: 910, 850, 505, 50, 1102, 1209, 1092, 750, 40, 1712, and so forth until 25 streets have been selected.
Example of MultiStage Cluster Sampling • I then want to select 4 addresses for each of the 25 streets selected in Step 5. Street number 910 is Pine View, and has 26 addresses on it. I use SRS techniques to select 4 addresses on Pine View. One adult will be surveyed from each of those addresses.
Example of MultiStage Cluster Sampling There are 26 addresses on Pine View Street. I will select four addresses. I begin with a random starting point in a TRN and, because the total number of addresses has two digits, I will mark of sets of two digits in the TRN. The following numbers identify addresses that will be included in the sample: 2, 9, 17, 19.
Example of Multistage Cluster Sampling • I will repeat Step 6. for each of the 25 streets selected in Step 5. At the end of this process, I should have 100 addresses randomly selected, four from each of 25 randomly selected streets. • I will visit each of these addresses and interview the first adult I am able to talk with from each household.
Study Guide Population Sample Probability sampling Non-probability sampling Element Frame Table of random numbers Nominal definition Number of digits in population size Simple random sample Random starting point Systematic sample with a random start Patterns of status characteristics Stratified sample Multistage cluster sample Populations clusters