960 likes | 1.27k Views
Chapter 3. 3-1. Data Description. Outline. 3-2. 3-1 Introduction 3-2 Measures of Central Tendency 3-3 Measures of Variation 3-4 Measures of Position 3-5 Exploratory Data Analysis. Objectives. 3-3.
E N D
Chapter 3 3-1 Data Description
Outline 3-2 • 3-1 Introduction • 3-2 Measures of Central Tendency • 3-3 Measures of Variation • 3-4 Measures of Position • 3-5 Exploratory Data Analysis
Objectives 3-3 • Summarize data using the measures of central tendency, such as the mean, median, mode, and midrange. • Describe data using the measures of variation, such as the range, variance, and standard deviation.
Objectives 3-4 • Identify the position of a data value in a data set using various measures of position, such as percentiles, deciles and quartiles.
Objectives 3-5 • Use the techniques of exploratory data analysis, including stem and leaf plots, box plots, and five-number summaries to discover various aspects of data.
3-2 Measures of Central Tendency 3-6 • Astatisticis a characteristic or measure obtained by using the data values from a sample. • Aparameteris a characteristic or measure obtained by using the data values from a specific population.
3-2 The Mean (arithmetic average) 3-7 • Themeanis defined to be the sum of the data values divided by the total number of values. • We will compute two means: one for the sample and one for a finite population of values.
3-2 The Mean (arithmetic average) 3-8 • The mean, in most cases, is not an actual data value.
3-2 The Sample Mean 3-9 T h e s y m b o l X r e p r e s e n t s t h e s a m p l e m e a n . X i s r e a d a s " X - b a r " . T h e G r e e k s y m b o l i s r e a d a s " s i g m a " a n d i t m e a n s " t o s u m " . X X . . . + X + + X = 1 2 n n X . = n
3-2 The Sample Mean - Example 3-10 T h e a g e s i n w e e k s o f a r a n d o m s a m p l e o f s i x k i t t e n s a t a n a n i m a l s h e l t e r a r e 3 , 8 , 5 , 1 2 , 1 4 , a n d 1 2 . F i n d t h e a v e r a g e a g e o f t h i s s a m p l e . T h e s a m p l e m e a n i s X 3 + 8 + 5 + 1 2 + 1 4 + 1 2 X = = n 6 5 4 = 9 w e e k s . = 6
3-2 The Population Mean 3-11 T h e G r e e k s y m b o l r e p r e s e n t s t h e p o p u l a t i o n m m e a n . T h e s y m b o l i s r e a d a s " m u " . m N i s t h e s i z e o f t h e f i n i t e p o p u l a t i o n . X X . . . + X + + = m 1 2 N N X . = N
A s m a l l c o m p a n y c o n s i s t s o f t h e o w n e r , t h e m a n a g e r , t h e s a l e s p e r s o n , a n d t w o t e c h n i c i a n s . T h e s a l a r i e s a r e l i s t e d a s $ 5 0 , , , , , a n d r e s p e c t i v e l y A s s u m e t h i s i s t h e p o p u l a t i o n T h e n t h e p o p u l a t i o n m e a n w i l l b e X N 3-2 The Population Mean-Example 3-12 0 0 0 2 0 0 0 0 1 2 0 0 0 , 9 , 0 0 0 9 , 0 0 0 . ( . ) = m 5 0 , 0 0 0 + 2 0 , 0 0 0 + 1 2 , 0 0 0 + 9 , 0 0 0 + 9 , 0 0 0 = 5 = $ 2 0 , 0 0 0 .
3-2 The Sample Mean for an Ungrouped Frequency Distribution 3-13 T h e m e a n f o r a n u n g r o u p e d f r e q u e n c y d i s t r i b u t u i o n i s g i v e n b y ( f X ) · X = . n H e r e f i s t h e f r e q u e n c y f o r t h e c o r r e s p o n d i n g v a l u e o f X , a n d n = f .
3-2 The Sample Mean for an Ungrouped Frequency Distribution - Example 3-14 The scores for 25 students on a 4 – point quiz are given in the table. Find the mean score S c o r e , X F r e q u e n c y , f S c o r e , X F r e q u e n c y , f 0 2 0 2 1 4 1 4 2 1 2 2 1 2 3 4 3 4 4 3 4 3 5 5
? S c o r e , X F r e q u e n c y , f f X 0 2 0 1 4 4 2 1 2 2 4 3 4 1 2 3-2 The Sample Mean for an Ungrouped Frequency Distribution - Example 3-15 S c o r e , X F r e q u e n c y , f f X · 0 2 0 1 4 4 2 1 2 2 4 3 4 1 2 4 3 1 2 4 3 1 2 5 5 f X 5 2 X = = 2 . 0 8 . = n 2 5
The mean for a grouped frequency distribution is given by × å ( f X ) X = . m n Here X is the correspond ing m class midpoint. 3-2 The Sample Mean for a Grouped Frequency Distribution 3-16
C l a s s F r e q u e n c y , f 1 5 . 5 - 2 0 . 5 3 2 0 . 5 - 2 5 . 5 5 2 5 . 5 - 3 0 . 5 4 3 0 . 5 - 3 5 . 5 3 3 5 . 5 - 4 0 . 5 2 3-2 The Sample Mean for a Grouped Frequency Distribution -Example 3-17 C l a s s F r e q u e n c y , f 1 5 . 5 - 2 0 . 5 3 2 0 . 5 - 2 5 . 5 5 2 5 . 5 - 3 0 . 5 4 3 0 . 5 - 3 5 . 5 3 3 5 . 5 - 4 0 . 5 2 5 5
? C l a s s F r e q u e n c y , f X f X m m 1 5 . 5 - 2 0 . 5 3 1 8 5 4 2 0 . 5 - 2 5 . 5 5 2 3 1 1 5 2 5 . 5 - 3 0 . 5 4 2 8 3-2 The Sample Mean for a Grouped Frequency Distribution -Example 3-18 Table with class midpoints, Xm. C l a s s F r e q u e n c y , f X f X m m 1 5 . 5 - 2 0 . 5 3 1 8 5 4 2 0 . 5 - 2 5 . 5 5 2 3 1 1 5 2 5 . 5 - 3 0 . 5 4 2 8 1 1 2 1 1 2 3 0 . 5 - 3 5 . 5 3 3 3 9 9 3 0 . 5 - 3 5 . 5 3 3 3 9 9 3 5 . 5 - 4 0 . 5 2 3 8 7 6 3 5 . 5 - 4 0 . 5 2 3 8 7 6 5 5
3-2 The Sample Mean for a Grouped Frequency Distribution -Example 3-19 f X 5 4 1 1 5 1 1 2 9 9 7 6 = + + + + m = 4 5 6 a n d n = 1 7 . S o f X X = m n 4 5 6 = 2 6 . 8 2 . = 1 7
3-2 The Median 3-20 • When a data set is ordered, it is called adata array. • Themedianis defined to be the midpoint of the data array. • The symbol used to denote the median isMD.
3-2 The Median - Example 3-21 • The weights (in pounds) of seven army recruits are 180, 201, 220, 191, 219, 209, and 186. Find the median. • Arrange the data in order and select the middle point.
3-2 The Median - Example 3-22 • Data array:180, 186, 191,201, 209, 219, 220. • The median,MD = 201.
3-2 The Median 3-23 • In the previous example, there was an odd number of values in the data set. In this case it is easy to select the middle number in the data array.
3-2 The Median 3-24 • When there is aneven numberof values in the data set, the median is obtained by taking theaverage of the two middle numbers.
3-2 The Median -Example 3-25 • Six customers purchased the following number of magazines: 1, 7, 3, 2, 3, 4. Find the median. • Arrange the data in order and compute the middle point. • Data array: 1, 2, 3, 3, 4, 7. • The median, MD = (3 + 3)/2 = 3.
3-2 The Median -Example 3-26 • The ages of 10 college students are: 18, 24, 20, 35, 19, 23, 26, 23, 19, 20. Find the median. • Arrange the data in order and compute the middle point.
3-2 The Median -Example 3-27 • Data array:18, 19, 19, 20,20,23, 23, 24, 26, 35. • The median,MD = (20 + 23)/2 = 21.5.
3-2 The Median-Ungrouped Frequency Distribution 3-28 • For an ungrouped frequency distribution, find the median by examining the cumulative frequencies to locate the middle value.
3-2 The Median-Ungrouped Frequency Distribution 3-29 • If n is the sample size, compute n/2. Locate the data point where n/2 values fall below and n/2 values fall above.
3-2 The Median-Ungrouped Frequency Distribution -Example 3-30 • LRJ Appliance recorded the number of VCRs sold per week over a one-year period. The data is given below. N o . S e t s S o l d F r e q u e n c y N o . S e t s S o l d F r e q u e n c y 1 4 1 4 2 9 2 9 3 6 3 6 4 2 4 2 5 3 5 3
3-2 The Median-Ungrouped Frequency Distribution -Example 3-31 • To locate the middle point, divide n by 2; 24/2 = 12. • Locate the point where 12 values would fall below and 12 values will fall above. • Consider the cumulative distribution. • The 12th and 13th values fall in class 2. Hence MD = 2.
N o . S e t s S o l d F r e q u e n c y C u m u l a t i v e F r e q u e n c y 1 4 4 2 9 1 3 5 3 2 4 3-2 The Median-Ungrouped Frequency Distribution -Example 3-32 N o . S e t s S o l d F r e q u e n c y C u m u l a t i v e F r e q u e n c y 1 4 4 2 9 1 3 3 6 1 9 3 6 1 9 4 2 2 1 4 2 2 1 5 3 2 4 This class contains the 5th through the 13th values.
The median can be computed from: - ( n 2 ) cf = + MD ( w ) L m f Where n = sum of the frequencies cf = cumulative frequency of the class immediately preceding the median class f = frequency of the median class w = width of the median class = the L lower boundary of median class m 3-2 The Median for a Grouped Frequency Distribution 3-33
C l a s s F r e q u e n c y , f 1 5 . 5 - 2 0 . 5 3 2 0 . 5 - 2 5 . 5 5 2 5 . 5 - 3 0 . 5 4 3 0 . 5 - 3 5 . 5 3 3 5 . 5 - 4 0 . 5 2 3-2 The Median for a Grouped Frequency Distribution -Example 3-34 C l a s s F r e q u e n c y , f 1 5 . 5 - 2 0 . 5 3 2 0 . 5 - 2 5 . 5 5 2 5 . 5 - 3 0 . 5 4 3 0 . 5 - 3 5 . 5 3 3 5 . 5 - 4 0 . 5 2 5 5
3 2 0 . 5 - 2 5 . 5 5 8 2 5 . 5 - 3 0 . 5 4 1 2 3 0 . 5 - 3 5 . 5 3 1 5 3 5 . 5 - 4 0 . 5 2 1 7 3-2 The Median for a Grouped Frequency Distribution -Example 3-35 C l a s s F r e q u e n c y , f C u m u l a t i v e C l a s s F r e q u e n c y , f C u m u l a t i v e r e q u e n c y F F r e q u e n c y 1 5 . 5 - 2 0 . 5 3 1 5 . 5 - 2 0 . 5 3 3 2 0 . 5 - 2 5 . 5 5 8 2 5 . 5 - 3 0 . 5 4 1 2 3 0 . 5 - 3 5 . 5 3 1 5 3 5 . 5 - 4 0 . 5 2 1 7 5 5
3-2 The Median for a Grouped Frequency Distribution -Example 3-36 • To locate the halfway point, divide n by 2; 17/2 = 8.5 9. • Find the class that contains the 9th value. This will be the median class. • Consider the cumulative distribution. • The median class will then be 25.5 – 30.5.
3-2 The Median for a Grouped Frequency Distribution 3-37 n = 17 cf = 8 f = 4 w = 25.5 – 20.5 = 5 = L 25 . 5 m - ( n 2 ) cf (17 / 2) – 8 = + + MD ( w ) L = ( 5 ) 25 . 5 f 4 m = 26.125.
3-2 The Mode 3-38 • Themodeis defined to be the value that occurs most often in a data set. • A data set can have more than one mode. • A data set is said to have no mode if all values occur with equal frequency.
3-2 The Mode -Examples 3-39 • The following data represent the duration (in days) of U.S. space shuttle voyages for the years 1992-94. Find the mode. • Data set: 8, 9, 9, 14, 8, 8, 10, 7, 6, 9, 7, 8, 10, 14, 11, 8, 14, 11. • Ordered set: 6, 7, 7, 8, 8, 8, 8, 8, 9, 9, 9, 10, 10, 11, 11, 14, 14, 14.Mode = 8.
3-2 The Mode -Examples 3-40 • Six strains of bacteria were tested to see how long they could remain alive outside their normal environment. The time, in minutes, is given below. Find the mode. • Data set: 2, 3, 5, 7, 8, 10. • There isno modesince each data value occurs equally with a frequency of one.
3-2 The Mode -Examples 3-41 • Eleven different automobiles were tested at a speed of 15 mph for stopping distances. The distance, in feet, is given below. Find the mode. • Data set: 15, 18, 18, 18, 20, 22, 24, 24, 24, 26, 26. • There aretwo modes (bimodal). The values are 18 and 24. Why?
3-2 The Mode for an Ungrouped Frequency Distribution -Example 3-42 Given the table below, find the mode. V a l u e s F r e q u e n c y , f V a l u e s F r e q u e n c y , f 1 5 3 1 5 3 Mode 2 0 5 2 0 5 2 5 8 2 5 8 3 0 3 3 0 3 3 5 2 3 5 2 5 5
3-2 The Mode - Grouped Frequency Distribution 3-43 • Themode for grouped data is themodal class. • The modal class is the class with the largest frequency. • Sometimes the midpoint of the class is used rather than the boundaries.
C l a s s F r e q u e n c y , f C l a s s F r e q u e n c y , f 1 5 . 5 - 2 0 . 5 3 1 5 . 5 - 2 0 . 5 3 2 0 . 5 - 2 5 . 5 5 2 0 . 5 - 2 5 . 5 5 2 5 . 5 - 3 0 . 5 7 2 5 . 5 - 3 0 . 5 7 3 0 . 5 - 3 5 . 5 3 3 0 . 5 - 3 5 . 5 3 3 5 . 5 - 4 0 . 5 2 5 3-2 The Mode for a Grouped Frequency Distribution -Example 3-44 Given the table below, find the mode. Modal Class 3 5 . 5 - 4 0 . 5 2 5
3-2 The Midrange 3-45 • The midrangeis found by adding the lowest and highest values in the data set and dividing by 2. • The midrange is a rough estimate of the middle value of the data. • The symbol that is used to represent the midrange isMR.
3-2 The Midrange -Example 3-46 • Last winter, the city of Brownsville, Minnesota, reported the following number of water-line breaks per month. The data is as follows: 2, 3, 6, 8, 4, 1. Find the midrange.MR = (1 + 8)/2 = 4.5. • Note:Extreme values influence the midrange and thus may not be a typical description of the middle.
3-2 The Weighted Mean 3-47 • Theweighted meanis used when the values in a data set are not all equally represented. • Theweighted meanof a variable Xisfound by multiplying each value by its corresponding weight and dividing the sum of the products by the sum of the weights.
3-2 The Weighted Mean 3-48 T h e w e i g h t e d m e a n w X w X . . . w X w X + + + X = = 1 1 2 2 n n w w . . . w w + + + 1 2 n w h e r e w , w , . . . , w a r e t h e w e i g h t s 1 2 n f o r t h e v a l u e s X , X , . . . , X . 1 2 n
Distribution Shapes 3-49 • Frequency distributions can assume many shapes. • The three most important shapes arepositively skewed, symmetrical, andnegatively skewed.
Positively Skewed 3-50 Y P o s i t i v e l y S k e w e d X M o d e < M e d i a n < M e a n