1 / 10

Efficient Sampling for Distance Analysis of Student Commutes

This report details the selection of a sample using systematic sampling to analyze student commuting distances. The method was chosen for efficiency and accuracy in representing different year groups. Statistical analysis and evaluation of the sample are provided along with conclusions on average distances traveled.

elizabethk
Download Presentation

Efficient Sampling for Distance Analysis of Student Commutes

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. L2 Sampling Exercise A possible solution

  2. Selecting a sample • I will select my sample using a systematic sampling technique. • For this I will systematically select every 4th student (136/30) starting at student number 71 which I have randomly selected using the random number function on my calculator.

  3. Sample Data

  4. Justify choice of method • I chose a systematic method because it was quite quick and easy to use. • I know that it will give me a good spread of year groups because the data has been sorted into year groups.

  5. Representative • Because I used a systematic method I have a close to, (see later) proportional representation of year levels. • I seem to have selected more females than males, 21 females and 9 males, so males are perhaps under represented.

  6. Statistics • For my sample I got a mean of 1.7967 and a sample standard deviation of 1.168. • Using these values I would estimate a population mean of 1.8 km and a standard deviation for the population of 1.2 km.

  7. Box Plot From my box plot I can see that the data is evenly spread. It is fairly symmetrical about the median and a bit spread out between the upper quartile and the largest value. The median distance is 1.65 km which is just lower than the mean indicating the data is skewed more toward the lower distances. 75% of students live within 2.5 km of school.

  8. Evaluation • The sampling process is an appropriate one given the way the original data was presented. Because it was ordered according to year levels a systematic sampling method gave me the same sort of result that a stratified method would have given me, with a lot less work. • I noticed in my sample more females than males however this would not affect the estimate as a student who lives any given distance from school is just as likely to be male or female. • In dividing 136 by 30 the answer comes out as 4.53. I chose to sample every 4th person rather than every 5th. This should not have biased my result in any way as there is still a representative number chosen from each level.

  9. Evaluation continued • The mean for the 9 yr12 students is 1.7, sd = 1.5 • The mean for the 8 yr 13 students is 1.925, sd = 1.35 • The mean for the 13 yr 11 students is 1.78, sd = 0.84 • From the statistics above it would appear that there is little difference between the distances traveled for the different year levels. • Due to the median being a bit lower than the mean it suggests that the mean has been affected by the unusually large values of 4.7 km and 3.9 km which are significantly larger than the rest.

  10. Conclusion • For this reason I would suggest that a better estimate for the average distance is closer to 1.7 km.

More Related