1 / 25

Descriptive Statistics: Numerical Measures Distribution

Descriptive Statistics: Numerical Measures Distribution. Chapter 3 BA 201. Distribution. Measures of Distribution Shape, Relative Location, and Detecting Outliers. Distribution Shape. z-Scores. Chebyshev’s Theorem. Empirical Rule. Detecting Outliers. Distribution Shape: Skewness.

hammer
Download Presentation

Descriptive Statistics: Numerical Measures Distribution

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Descriptive Statistics: Numerical MeasuresDistribution Chapter 3 BA 201

  2. Distribution

  3. Measures of Distribution Shape,Relative Location, and Detecting Outliers • Distribution Shape • z-Scores • Chebyshev’s Theorem • Empirical Rule • Detecting Outliers

  4. Distribution Shape: Skewness • An important measure of the shape of a distribution is called skewness. • The formula for the skewness of sample data is

  5. .35 .30 .25 .20 .15 .10 .05 0 Distribution Shape: Skewness • Symmetric (not skewed) • Skewness is zero. • Mean and median are equal. Skewness = 0 Relative Frequency

  6. .35 .30 .25 .20 Relative Frequency .15 .10 .05 0 Distribution Shape: Skewness • Moderately Skewed Left • Skewness is negative. • Mean will usually be less than the median. Skewness = - .31

  7. .35 .30 .25 .20 Relative Frequency .15 .10 .05 0 Distribution Shape: Skewness • Moderately Skewed Right • Skewness is positive. • Mean will usually be more than the median. Skewness = .31

  8. .35 .30 .25 .20 Relative Frequency .15 .10 .05 0 Distribution Shape: Skewness • Highly Skewed Right • Skewness is positive (often above 1.0). • Mean will usually be more than the median. Skewness = 1.25

  9. Distribution Shape: Skewness Apartment Rents

  10. .35 .30 .25 .20 .15 .10 .05 0 Distribution Shape: Skewness Apartment Rents Skewness = 0.92 Relative Frequency

  11. z-Scores The z-score is often called the standardized value. It denotes the number of standard deviations a data value xi is from the mean.

  12. z-Scores • An observation’s z-score is a measure of the relative location of the observation in a data set. z-score < 0 z-score > 0 z-score = 0 x

  13. z-Scores Apartment Rents • z-Score of Smallest Value (425) Standardized Values for Apartment Rents

  14. Practicez-scores

  15. Practice #6 – z-Scores x = 13 s = 7.4

  16. Chebyshev’s Theorem At least (1 - 1/k2) of the items in any data set will be within k standard deviations of the mean, where kis any value greater than 1.

  17. Let z = 1.5 with = 490.80 and s = 54.74 - k(s) = 490.80 - 1.5(54.74) = 409 + k(s) = 490.80 + 1.5(54.74) = 573 Chebyshev’s Theorem Apartment Rents At least (1 - 1/(1.5)2) = 1 - 0.44 = 0.56 or 56% of the rent values must be between and (Actually, 86% of the rent values are between 409 and 573.)

  18. Empirical Rule When data approximate a bell-shaped distribution, the empirical rule can be used to determine the percentage of data values that must be within a specified number of standard deviations of the mean.

  19. 99.72% 95.44% 68.26% Empirical Rule x m m + 3s m – 3s m – 1s m + 1s m – 2s m + 2s

  20. PracticeChebyshev’s Theorem and Empirical Rule

  21. Practice #7 - Chebyshev’s Theorem x = 1200 s = 110 How many items (%) are within k standard deviations? k = 1.25 k = 3.5

  22. Practice #7 – Empirical Rule x = 1200 s = 110 What is the lower bound for 2 standard deviations? The upper bound? How many items (%) are within this area?

  23. Detecting Outliers • An outlier is an unusually small or unusually large • value in a data set. • A data value with a z-score less than -3 or greater • than +3 might be considered an outlier. • It might be: • an incorrectly recorded data value • a data value that was incorrectly included in the • data set • a correctly recorded data value that belongs in • the data set

  24. Detecting Outliers Apartment Rents • The most extreme z-scores are -1.20 and 2.27 • Using |z| > 3 as the criterion for an outlier, there • are no outliers in this data set. Standardized Values for Apartment Rents

More Related