1 / 10

Outliers and Measures of Central Tendency / Dispersion

Outliers and Measures of Central Tendency / Dispersion. Wib Leonard Megan Marchini Nick Pajewski. Learning Objectives. Explain how outlying observations effect numeric summary measures of central tendency and dispersion

Download Presentation

Outliers and Measures of Central Tendency / Dispersion

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Outliers and Measures of Central Tendency / Dispersion Wib Leonard Megan Marchini Nick Pajewski

  2. Learning Objectives • Explain how outlying observations effect numeric summary measures of central tendency and dispersion • Explain why rank-based statistics are more robust to outlying observations • Recognize outlying observations graphically through box-plots

  3. Context • Calculate summary measures of central tendency and dispersion • Mean, median, & mode • Standard deviation, Inter-quartile range • Graphically represent data distributions in the form of a box-plot

  4. Basic Activity Description • Students are put in groups of 4-5 • Students collect their own ages and the ages of any siblings • They then add in an outlying observation, the age of the oldest grandparent in the group, creating a second dataset • NOTE: Activity could be adjusted using coins, die, etc.

  5. Activity cont’d. • The “hands-on” portion then involves computing for each of the datasets • Mean, median, mode, standard deviation, IQR • Constructing a boxplot • See attached worksheet • Group results are then collected (on the board, etc. ) to illustrate major objectives and to discuss how the effect of outliers diminishes with sample size

  6. Formal Computer Presentations • After the “hands-on” portion, major concepts could be formalized using an example like the Sharks dataset (Agresti page 45) • Data contains shark attacks worldwide • Florida represents an outlier • 289 attacks vs 64 for next highest • Construct an Excel spreadsheet that contains data and automatically computes summary measures

  7. Applet Presentation http://standards.nctm.org/document/eexamples/chap6/6.6/index.htm#inst1

  8. Summary of Objectives • When using the ages of your group and its siblings, the mean, median and mode should be similar. However, when adding the age of the grandparent, we would expect the mean to be greater than the median. • Summary measures like the mean & sample standard deviation are more sensitive to outliers than rank-based measures like the median and IQR. • The effect of outliers diminishes as the sample size increases.

  9. Follow-up Topics • Dealing with outliers • Exclusion • Data transformations • Hypothesis Testing • Parametric tests vs. rank-based statistics • Regression Models • Residual analysis, influential observations, etc

More Related