150 likes | 294 Views
Outline: Numerical Methods. Measures of Central Tendency Representative value Mean Median, mode, midrange Measures of Dispersion (Variability) How are data points are deviated from the mean? Range variance, standard deviation. Central Tendency: Mean. Arithmetic average
E N D
Outline: Numerical Methods • Measures of Central Tendency • Representative value • Mean • Median, mode, midrange • Measures of Dispersion (Variability) • How are data points are deviated from the mean? • Range • variance, standard deviation (c) 2007 IUPUI SPEA K300 (4392)
Central Tendency: Mean • Arithmetic average • Sum divided by N (# of observations) • Key statistic in data analysis (c) 2007 IUPUI SPEA K300 (4392)
Central Tendency: Mean Question 12 on page 117 The mean is 100.12963 = 10814 / 108 (c) 2007 IUPUI SPEA K300 (4392)
Central Tendency: Median • Midpoint of data arranged in order • When even number of observations, mean of the two data points in the middle • Useful when data are skewed to the right or left substantially. (c) 2007 IUPUI SPEA K300 (4392)
Central Tendency: Mode • Value that occurs most often • Peak in the histogram • Bimodal with two peaks (modes) • Figure 3-1 on page 115 (c) 2007 IUPUI SPEA K300 (4392)
Central Tendency: Midrange • Mean of minimum and maximum values • (minimum + maximum)/2 (c) 2007 IUPUI SPEA K300 (4392)
Central Tendency: Others • Weighted mean when individual data points should be weighted differently (other than 1) • Trimmed mean in the presence of outliers (extremely large or small data points) (c) 2007 IUPUI SPEA K300 (4392)
Quantiles • Quantiles are points taken at equal intervals from CDF (cumulative density function) • 100 quantiles: percentiles • 10 quantiles: dociles • 5 quantiles: quintiles • 4 quantiles: quartiles • 2 quantiles: ? (c) 2007 IUPUI SPEA K300 (4392)
Percentiles • Percentiles divide data into 100 groups with an equal interval • 100 quantiles • Nth percentile is located at nth from the smallest in data w/ 100 observations • Median = 50th percentile • Table 3-3 on page 141 • Figure 3-5 on page 142 (c) 2007 IUPUI SPEA K300 (4392)
Quartiles • 4-quantiles • 1st quartile (25th percentile) • 2st (50th percentile or median) • 3st (75th percentile) • IQR (interquartile range) = 3Q-1Q • Box plot include 1Q, 2Q, 3Q, minimum, maximum (c) 2007 IUPUI SPEA K300 (4392)
Quartiles in a Box Plots (c) 2007 IUPUI SPEA K300 (4392)
Quartiles in Histograms (c) 2007 IUPUI SPEA K300 (4392)
Example 1: Example 3-38, p159 (c) 2007 IUPUI SPEA K300 (4392)
Example 2: SAS Output SAS includes mean in the box plot Histogram # Boxplot 2.75+* 1 | .** 4 | .******** 23 | .**************** 46 | .*********************** 68 +-----+ .*************************** 80 | | .*************************************** 116 *--+--* .********************** 64 +-----+ .******************* 56 | .********* 27 | .***** 13 | -2.75+* 2 | ----+----+----+----+----+----+----+---- * may represent up to 3 counts (c) 2007 IUPUI SPEA K300 (4392)
Example 3: Box Plots | 9 + | | | | | | | | | | | 8 + | *-----* | | | | | | | + | | | | | 7 + +-----+ +-----+ | | | | | | | | | | | | 6 + | | | | | | | | | | | | | | | | 5 + | | +-----+ 0 | | + | | | | | | | | | | | | | 4 + *-----* | | | | | | | | | | | + | | | | | | 3 + +-----+ *-----* | | | | | | | | | | | | 2 + | +-----+ | | | | | | | | | 1 + | | ------------+-----------+-----------+----------- • Site 102 134 137 (c) 2007 IUPUI SPEA K300 (4392)