190 likes | 340 Views
Measures of Central Location (Averages) and Percentiles. BUSA 2100, Section 3.1. Introduction. Values of a variable tend to cluster around a central point. A measure of central location indicates a center, average, or typical value.
E N D
Measures of Central Location(Averages) and Percentiles BUSA 2100, Section 3.1
Introduction • Values of a variable tend to cluster around a central point. • A measure of central location indicates a center, average, or typical value. • A measure of central location (average) helps summarize data concisely with one value.
1st Type of Average: Mode • Definition:The mode is the item that occurs most often. • Example 1 – Consider this data set: {96,91,91,87,84, 82,79,75,72,69,62}. • The mode is 91 since it occurs twice. • But 91 isn’t a “central” or “typical” value. • Another problem: some data sets have no mode, e.g. set above with one 91 removed.
Mode (Page 2) • Some data sets have more than one mode. Example: Height of adults is bimodal (two modes). Why? • So the mode is not very useful and not reliable except for categorical data. • Example 2: (categorical data) Ask students to state their favorite kind of pie.
2nd Type of Average: Median • Definition: The median is the middle item of a ranked set of data. • It is the (n+1)/2 th item in a ranked set of n items. • Example 1: Find the median of this set of 7 items. {75,64,82,96,72,47,59}
Median (Page 2) • Example 2: Add 89 to the previous set. {89,75,64,82,96,72,47,59}
3rd Type of Average: Mean • Definition: The sample mean, X-bar = Sum of the X’s divided by n, where n = number of items in the data set (sample). • Example 1: {89,75,64,82,96,72,47,59}
Mean (Page 2) • Mean is the most widely used and best measure of central location except in one situation (to be discussed later). • Advantages of the mean: (1) More comprehensive because it uses all of the data (not just the center item(s)). • (2) Combined or weighted means can be calculated.
Mean (Page 3) • Example 2 (combined mean): Class #1 had a mean test score of 80; Class #2 had a mean test score of 60. • What is the overall mean for both classes combined? • Is it 70, the average of 80 and 60?
Mean (Page 4) • Class #1 has 40 students; Class #2 has 10 students.
Mean (Page 5) • Example 3 (weighted mean): In a course, a professor gives 3 tests and a final exam, and requires a project. • The final exam counts 1 1/2 times as much as each test and the project counts twice as much as each test. • Charles Malone made 80, 74, 67, 86, and 90. What is his course average?
Mean (Page 6) • Note: The unweighted mean is 79.4.
Mean (Page 7) • The median is preferred to the mean if there are extreme values present. • Example 4: Incomes for 5 families are: {$30K, $40K, $50K, $60K, $820K} • Mean = $1,000,000 / 5 = $200,000, but this is not a “center” or “typical” value. • Median = $50,000 (more accurate)
Percentiles • Definition: The pth percentile is a value that is > p percent of the values in a data distribution. • Values for p: 0, 1, 2,...,98, 99, 99.5, 99.9 • Example: If you were in the 86th percentile on a test, what does that mean?
Percentiles (Page 2) • Three steps for calculating percentiles: • (1) Arrange data in ascending order • (2) Calculate index (rank): i = (p/100)* n. • (a) If i is not a whole number, round up to the next whole number. • (b) If i is a whole number, use i + .5 . • (3) Identify the answer.
Percentiles (Page 3) • Example: {2450, 2500, 2650, 2430, 2355, 2260, 2490, 2680, 2540, 2775, 2525, 2465, 2610, 2390} are monthly salaries for 14 business graduates. • Find the 67th percentile. • Step 1: Arrange in ascending order: {2260, 2355, 2390, 2430, 2450, 2465, 2490, 2500, 2525, 2540, 2610, 2650, 2680, 2775}
Percentiles (Page 4) • Find the 50th percentile.
Percentiles (Page 5) • The 50th percentile is the median. • The 25th percentile is the 1st quartile. • The 75th percentile is the 3rd quartile.