1 / 26

SESSION 17 & 18

Learn about measures of relative standing, such as median, quartiles, deciles, and percentiles, as well as measures of dispersion, such as range, variance, and standard deviation, for grouped data.

haggard
Download Presentation

SESSION 17 & 18

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. SESSION 17 & 18 Last Update 16th March 2011 Measures of Dispersion Measures of Variability

  2. Grouped Data – Investment B

  3. Learning Objectives • Measures of relative standing: Median, Quartiles, Deciles and Percentiles • Measures of dispersion: Range • Measures of variability: Variance and Standard Deviation

  4. Percentiles The Pth percentile is the value for which P percent are less than that value and (100 – p)% are greater than that value. Some special percentiles commonly used include the median and the quartiles. Percentiles are measures of relative standing.

  5. Terminology 50th Percentile 25th, 50th, 75th,100th Percentile 20th, 40th,…, 100th Percentile 10th, 20th,…, 100th Percentile ½  1 Median ¼  4 Quartiles 1/5  5 Quintiles 1/10  10 Deciles Q2 Q1, Q2, Q3,Q4, Lp

  6. Location of a Percentile The location L of a percentile is a function of the required percentile P and the sample size n: Lp = (n + 1) * (P / 100) As with the median, all observations must be placed in ascending or descending order first.

  7. Calculation of Percentile • Place all observations in order • Calculate the location of the percentile • Since the location will often be a fraction (e.g. n/2), the distance between the two observations in question must be multiplied with the fractional part of the location • The result of 3. is added to the preceding observation to yield the percentile

  8. Percentile: An example The following denotes the number of hours spent on the internet: 0 0 5 7 8 9 12 14 22 23 The values are already placed in order. The sample size is n = 10. We wish to determine L25, L50 and L75 (this is analogous to the quartiles Q1, Q2 and Q3)

  9. Solution – Step 1 Use the formula to calculate the location for each percentile / quartile

  10. Solution – Step 2 Determine the fractional part of the location

  11. Solution – Step 3 Determine the next lower and next higher observation associated with the location. For 2.75, the two observations are 2  0 and 3  5.

  12. Solution – Step 4 In order to determine the quartile associated with a given location, you need to calculate the following: Solution = Lower + (Upper – Lower) * Fraction

  13. Exercises You may use shortcuts if you want! • Determine the first, second and third quartiles: 5 8 2 9 5 3 7 4 2 7 4 10 4 3 5 • Determine the third and eighth deciles (30th and 80th percentile): 10.5 14.7 15.3 17.7 15.9 12.2 10.0 14.1 13.9 18.5 13.9 15.1 15.7

  14. Range The range is the difference between the minimum and maximum observation. It is a measure of dispersion. The interquartile range is the difference between the third and the first quartile: Interquartile Range = Q3 – Q1

  15. Variance The variance expresses the sum of the squared deviation of every single observation from the sample / population mean. All differences are squared so that positive and negative deviations from the mean are not cancelled out. The variance in a measure of variability.

  16. Population and Sample Variance We need to differentiate between population variance and sample variance. From the calculation of the mean, the sample variance has one less degrees of freedom (n-1) in calculating the variance. For the hypothetically infinite population of size N this is not the case.

  17. Formulas

  18. Calculation of Variance • Calculate the average: Sum of observations / number of observations • Subtract the average from every obervation • Square the difference • Sum the squared differences • Divide the result from 4. by either N (population) or n-1 (sample)

  19. Variance: An example The following denotes the number of hours spent on the internet for a sample of n = 10 adults: 0 7 12 5 33 14 8 0 9 22 Calculate the variance.

  20. Solution – Step 1 Use the mean to calculate the differences between the mean and every observation

  21. Solution – Step 2 Square all differences. Next, Sum the differences and divide the sum by n – 1 (sample only) In case of the sample, the sumsq is divided by n-1, in the case of the population it is divided byN

  22. Interpretation Variance The variance may be difficult to interpret. Remember that all differences are squared to avoid positive and negative differences from cancelling out. The statistic may be standardized by taking the square root of the variance. This statistic is called the standard deviation. However, the variances from two datasets may still be referred to when determining the more volatile dataset.

  23. Example – Standard Deviation The population standard deviation: Similarly, the sample standard deviation: Thus, for the internet usage example:

  24. Solution – Step 3 Interpretation: On average, observations of internet usage within the sample of ten people deviates by 6.766 h from the sample mean.

  25. Exercises • Calculate the variance and standard deviation for the following data: 2 8 9 4 1 7 5 4 • Calculate the variance and standard deviation for the following data: 7 -5 -3 8 4 -4 1 -5 9 3

More Related