1 / 52

Measures of Dispersion

Measures of Dispersion. 9/24/2013. Readings. Chapter 2 Measuring and Describing Variables (Pollock) (pp.37-44) Chapter 6. Foundations of Statistical Inference (128-133) (Pollock) Chapter 3 Transforming Variables (Pollock Workbook). Opportunities to discuss course content.

shea-oneill
Download Presentation

Measures of Dispersion

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Measures of Dispersion 9/24/2013

  2. Readings • Chapter 2 Measuring and Describing Variables (Pollock) (pp.37-44) • Chapter 6. Foundations of Statistical Inference (128-133) (Pollock) • Chapter 3 Transforming Variables (Pollock Workbook)

  3. Opportunities to discuss course content

  4. Office Hours For the Week • When • Wednesday 10-12 • Thursday 8-12 • And by appointment • You will get your exams back on Thursday • Homework, now due on October 3rd

  5. Course Learning Objectives • Students will learn the basics of research design and be able to critically analyze the advantages and disadvantages of different types of design.  • Students Will be able to interpret and explain empirical data.

  6. Descriptive statistics

  7. Descriptive Statistics • These simply describe the attributes of a single variable. • You cannot test here (you need two variables) • Why do them?

  8. Categories of Descriptive Statistics Measures of Central Tendency Measures of Dispersion How wide is our range of data, how close to the middle are the values distributed Range, Variance, Standard Deviation • The most common, the middle, the average • Mean, Median and Mode

  9. Frequency Distributions

  10. To Run A Frequency Distribution • Open GSS2008.sav • Analyze (95% of all our statistics will come from this menu) • Descriptive Statistics • Frequencies

  11. Step 2 Select Your Variable Here is the Output

  12. Interpreting the Results What is the Mode (#, cat)? • Percent- relative frequency for all cases • Valid Percent- relative frequency for valid cases (This excludes missing values). • Cumulative Percent- %of observations less than or equal to the category What is the median (#, cat?)

  13. Measures of Central Tendency

  14. First Run A Frequency Distribution Natenvir Variable- Government Spending on Improving and Protecting The Environment The Statistics Window Click on Statistics

  15. The Output

  16. For Ratio Variables Step 2 Step 4 Step 1 Step 3

  17. Case Summaries

  18. How To Do it (using world Dataset) Step 1 Step 2 Check off this box

  19. Measures of disperison

  20. What are They? • these measure the uniformity of the data • they measure how closely or widely cases are separated on a variable.

  21. The Range • The Simplest Measure of Dispersion • Max • Min • Range= max-min (only fun for ratio variables)

  22. Back To the Island • What is the • Maximum • Minimum • Range

  23. High Vs. Low Dispersion • Polarized • Clustered

  24. High Dispersion

  25. Clustering

  26. The Standard Deviation • A More accurate and precise measure than dispersion and clustering • Is the average distance of values in a distribution from the mean

  27. What it tells us • When the value of the standard deviation is small, values are clustered around the mean. • When the value of the standard deviation is high, values are spread far away from the mean.

  28. From 2008 Who was more divisive?

  29. About the Standard Deviation • its based on the mean • the larger the standard deviation, the more spread out the values are and the more different they are • if the standard deviation =0 it means there is no variability in the scores. They are all identical.

  30. Standard Deviation in SPSS • Open up the States.Sav dataset and use the union07 variable. • Analyze • Descriptive Statistics • Descriptives • Select your options

  31. The Standard Deviation and Outliers • Any case that is more than 2 standard deviations away from the mean • These cases often provide valuable insights about our distribution

  32. If you find this amusing or annoying, you get the concept

  33. 2011 Baseball Salaries

  34. How to determine the value of a standard deviation

  35. How to determine the value of a standard deviation • The value of +/- 1 s.d. = mean + value of s.d • e.g. if the mean is 8 and the s.d is 2, the value of -1 s.d's is 6, and + 1 s.d.'s is 10 • The value of +/- 2 s.d. = mean + (value of s.d. *2) • e.g. if the mean is 8 and the s.d is 2, the value of -2 s.d's is 4, and + 2 s.d.'s is 12 • Any value in the distribution lower than 4 and higher than 12 is an outlier

  36. ECU Pirates

  37. An Example from 2008 • States Database • What is the Value of +/- 1 S.D?. (mean+ 1.s.d) • What is the Value of +/-2 S.D? (mean +/- 2 s.d)

  38. Unwrapping The Results • Which are Outliers • How did that shape the 2012 campaign

  39. The Normal Curve

  40. Different Kinds of Curves

  41. Camel Humps Dromedary (one hump) Bactrian (bi-modal)

  42. The Normal/Bell Shaped curve • Symmetrical around the mean • It has 1 hump, it is located in the middle, so the mean, median, and mode are all the same!

  43. Why we use the normal curve • To determine skewness • The Normal Distribution curve is the basis for significance testing

  44. Testing • Causality • Statistical Significance • Practical Significance

  45. Significance Testing

  46. What this Tells us • Roughly 68% of the scores in a sample fall within one standard deviation of the mean • Roughly 95% of the scores fall 2 standard deviations from the mean (the exact # is 1.96 s.d) • Roughly 99% of the scores in the sample fall within three standard deviations of the mean

  47. A Practice Example • Assuming a normal curve compute the age (value) • For someone who is +1 s.d, from the mean • what number is -1 s.d. from the mean • With this is assumption of normality, what % of cases should roughly fall within this range (+/-1 S.D.) • What about 2 Standard Deviations, what percent should fall in this range?

  48. skewness

  49. What is skewness? • an asymmetrical distribution. • Skewnessis also a measure of symmetry, • Most often, the median is used as a measure of central tendency when data sets are skewed.

More Related