1.37k likes | 3.04k Views
AS-Level Maths: Statistics 1 for Edexcel. S1.2 Calculating means and standard deviations. This icon indicates the slide contains activities created in Flash. These activities are not editable. For more detailed instructions, see the Getting Started presentation. 1 of 26.
E N D
AS-Level Maths:Statistics 1for Edexcel S1.2 Calculating means and standard deviations This icon indicates the slide contains activities created in Flash. These activities are not editable. For more detailed instructions, see the Getting Started presentation. 1 of 26 © Boardworks Ltd 2005
Means • Calculating means • Calculating standard deviations • Coding Contents © Boardworks Ltd 2005 2 of 26
Mean The mean is the most widely used average in statistics. It is found by adding up all the values in the data and dividing by how many values there are. Notation: If the data values are , then the mean is This symbol means the total of all the x values This is the mean symbol Note: The mean takes into account every piece of data, so it is affected by outliers in the data. The median is preferred over the mean if the data contains outliers or is skewed.
Mean If data are presented in a frequency table: then the mean is
Mean Example: The table shows the results of a survey into household size. Find the mean size. TOTAL 114 343 To find the mean, we add a 3rd column to the table. Mean = 343 ÷ 114 = 3.01
Standard deviation • Calculating means • Calculating standard deviations • Coding Contents © Boardworks Ltd 2005 6 of 26
Standard deviation There are three commonly used measures of spread (or dispersion) – the range, the inter-quartile range and the standard deviation. The standard deviation is widely used in statistics to measure spread. It is based on all the values in the data, so it is sensitive to the presence of outliers in the data. The variance is related to the standard deviation: variance = (standard deviation)2 The following formulae can be used to find the variance and s.d.
Standard deviation Example: The mid-day temperatures (in °C) recorded for one week in June were: 21, 23, 24, 19, 19, 20, 21 First we find the mean: °C So variance = 22 ÷ 7 = 3.143 So, s.d. = 1.77°C(3 s.f.) Total: 22
Standard deviation There is an alternative formula which is usually a more convenient way to find the variance: Therefore, and
Standard deviation Example (continued): Looking again at the temperature data for June: 21, 23, 24, 19, 19, 20, 21 We know that °C Also, = 3109 So, °C Note: Essentiallythe standard deviation is a measure of how close the values are to the mean value.
Calculating standard deviation from a table When the data is presented in a frequency table, the formula for finding the standard deviation needs to be adjusted slightly: Example: A class of 20 students were asked how many times they exercise in a normal week. Find the mean and the standard deviation.
Calculating standard deviation from a table TOTAL: 20 38 116 The table can be extended to help find the mean and the s.d.
Calculating standard deviation from a table If data is presented in a grouped frequency table, it is only possible to estimate the mean and the standard deviation. This is because the exact data values are not known. An estimate is obtained by using the mid-point of an interval to represent each of the values in that interval. Example: The table shows the annual mileage for the employees of an insurance company. Estimate the mean and standard deviation.
Calculating standard deviation from a table TOTAL 45 480,000 6,587,500,000 miles miles
Most calculators have a built-in function which will find the standard deviation for you. Learn how to use this facility on your calculator. Notes about standard deviation Here are some notes to consider about standard deviation. • In most distributions, about 67% of the data will lie within 1 standard deviation of the mean, whilst nearly all the data values will lie within 2 standard deviations of the mean. • Values that lie more than 2 standard deviations from the mean are sometimes classed as outliers – any such values should be treated carefully. • Standard deviation is measured in the same units as the original data. Variance is measured in the same units squared.
The mean and the standard deviation of the ages of the people in the queue on Monday evening were 29 and 6.2 respectively. Compare the ages of the people queuing at the cinema in the afternoon with those in the evening. Examination-style question Examination-style question: The ages of the people in a cinema queue one Monday afternoon are shown in the stem-and-leaf diagram: Explain why the diagram suggests that the mean and standard deviation can be sensibly used as measures of location and spread respectively. Calculate the mean and the standard deviation of the ages.
The cinemagoers in the evening had a smaller mean age, meaning that they were, on average, younger than those in the afternoon. The standard deviation for the ages in the evening was also smaller, suggesting that the evening audience were closer together in age. Examination-style question a) The mean and the standard deviation are appropriate, as the distribution of ages is roughly symmetrical and there are no outliers. b)
Combining sets of data Sometimes in examination questions you are asked to pool two sets of data together. Example: Six male and five female students sit an A-level examination. The mean marks were 52% and 57% for the males and females respectively. The standard deviations were 14 and 18 respectively. Find the combined mean and the standard deviation for the marks of all 11 students.
Combining sets of data Let be the marks for the 6 male students. Let be the marks of the 5 female students. To find the overall mean, we first need to find the total marks for all 11 students. Therefore So the combined mean is:
Combining sets of data To find the overall standard deviation, we need to find the total of the marks squared for all 11 students. Notice that the formula rearranges to give Therefore, So the combined s.d. is: (to 3 s.f.)
Coding • Calculating means • Calculating standard deviations • Coding Contents © Boardworks Ltd 2005 21 of 26
Coding Coding is a technique that can simplify the numerical effort required in finding a mean or standard deviation. Enter some data below, and see how it changes when you add or multiply by different numbers.
Coding Adding So, if a number b is added to each piece of data, the mean value is also increased by b. The standard deviation is unchanged. Multiplying If each piece of data is multiplied by a, the mean value is multiplied by a. The standard deviation is also multiplied by a. More formally, if then:
Coding Example: Find the mean and the standard deviation of the values in the table. Use the transformation below to help you. Using the given transformation, add a y column to the table.
Coding Total 20 35 85 To find the mean: To find the s.d.:
Coding You have now found the mean and standard deviation of y. To find them for the x values, you must reverse the coding. We can rearrange: to get: Therefore the mean of x is: And the standard deviation of x is: 10 × 1.09 = 10.9 Note how the coding helped to simplify the calculations by making the numbers smaller.