1 / 25

Dot Plots

Dot Plots. A dot plot is a type of graphical display used to compare frequency counts within categories or groups. Dot Plots. Dot Plots - Symmetry. A symmetric distribution can be divided at the centre so that each half is a mirror image of the other. Dot Plots - Skewness.

mulan
Download Presentation

Dot Plots

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Dot Plots A dot plot is a type of graphical display used to compare frequency counts within categories or groups

  2. Dot Plots

  3. Dot Plots - Symmetry A symmetric distribution can be divided at the centre so that each half is a mirror image of the other.

  4. Dot Plots - Skewness

  5. Dot Plots - Outliers A data point that diverges greatly from the overall pattern of data is called an outlier.

  6. Dot Plots e.g. The dotplot below shows the number of televisions owned in each household in a city block

  7. Calculating Statistics • Mean (average) – The mean can be affected by extreme values • Median – middle value data. Not affected by extreme values • Mode – the most common value/s

  8. Calculating Statistics • Range – a measure of how spread out the data is. The difference between the highest and lowest values. • Lower Quartile (LQ) – halfway between the lowest value and the median • Upper Quartile (UQ) – halfway between the highest value and the median • Interquartile Range (IRQ) – the difference between the LQ and the UQ. This is a measure of the spread of the middle 50% of the data.

  9. e.g. 1 1 2 2 3 3 4 4 4 5 6 7 18 LQ UQ The following data represents the number of flying geese sighted on each day of a 13-day tour of England 5 1 2 6 3 3 18 4 4 1 7 2 4 Find: a.) the min and max number of geese sighted b.) the median c.) the mean d.) the upper and lower quartiles e.) the IQR f.) extreme values Min – 1 Max - 18 Order the data - 4 Add all the numbers and divide by 13 – 4.62 (2 dp) UQ – 2 + 2 = 2 UQ – 5 + 6 = 5.5 2 2 5.5 – 2 = 3.5 18

  10. UQ e.g. LQ UQ LQ A used car dealership owns 2 yards. They are forced to close one of them. The weekly sales for each dealership were recorded over a 10 week period * Leave out any extreme values 17 14 27 25 21 23 22.3 20.4 19.5 17.5 22.5 26

  11. Note 2: Calculating Averages • In statistics, there are 3 types of averages: • mean • median • mode Mode Median Mean - x The middle value when all values are placed in order The most common value(s) Affected by extreme values Not Affected by extreme values These are all measures of central tendency

  12. Note 3: Quartiles An indication of the spread of data. Lower Quartile – Q1 Upper Quartile – Q3 Median of bottom half Median of top half First identify the median to split the data into halves – do not include the median in either of these halves e.g. 40, 41, 42, 43, 44, 45, 49, 52, 52, 53 LQ median UQ Range – how spread out the data is. It is the difference between the maximum and minimum values Inter-quartile Range - the difference between the UQ & LQ – measures the spread of the middle 50% of data

  13. Note 3: Quartiles e.g. Calculate the median, and lower and upper quartiles for this set of numbers 35 95 29 95 49 82 78 48 14 92 1 82 43 89 Arrange the numbers in order 1 14 29 35 43 48 49 78 82 82 89 92 95 95 LQ UQ median Median – halfway between 49 and 78, i.e. = 63.5 LQ – bottom half has a median of 35 UQ – top half has a median of 89

  14. Note 4: Statistical Tables Tables are efficient in organising large amounts of data. If data is counted, you can enter directly into the table using tally marks e.g 33 students in 11JI were asked how many times they bought lunch at the canteen. Below is the tally of individual results. 0 4 0 3 5 0 5 5 0 2 1 0 5 2 3 0 0 5 5 1 2 5 5 3 0 0 1 5 0 5 1 3 0 The data can be summarised in a frequency table

  15. Note 4: Statistical Tables Calculate the mean = = = = 2.3 Why is this mean misleading? Most students either do not buy their lunch at the canteen or buy it there every day. Total 33

  16. 47.In a javelin competition two competitiors were vying to represent their province in a national competition. On the basis of these results who would you select and why? Results in metres Peter: 42.4, 39.5, 43.2, 47.2, 31.6, 40.2, 41.4, 38.5, 29.5, 34.4 Quade: 37.8, 41.2, 40.8, 42.4, 41.2, 36.7, 42.3, 41.9, 34.2, 35.7 Quade is more consistent – Lower inter-quartile range & stnd dev. 38.8 39.4 29.5 34.2 Peterhas a higher UQ and longer maximum throw Which is more important ? 34.4 36.7 39.85 41.0 42.4 41.9 47.2 42.4 Choose: Peter 5.2 2.9 8.0 5.2 NuLake Q47. pg 229

  17. Note 5: Data Display Box and Whisker Plot – comparing data Male Female x median minimum maximum Upper quartile extreme value Lower quartile IQR

  18. Note 5: Data Display Line Graphs – identify patterns & trends over time Interpolation - Reading in between tabulated values Extrapolation - Estimating values outside of the range Looking at patterns and trends 0 1 2 3 4 5 6 7 8 9 10 11

  19. Note 5: Data Display Pie Graph – show proportion Multiply each percentage of the pie by 360° 60% - 0.6 × 360° = 216° Scatter Graph – show relationship between 2 sets of data Plot a number of coordinates for the 2 variables Draw a line of best fit - trend Reveal possible outliers (extreme values)

  20. Note 5: Data Display Histogram– display grouped continuous data – area represents the frequency frequency Bar Graphs– display discrete data Distance (cm) – counted data – draw bars (lines) with the same width – height is important factor

  21. Note 5: Data Display Stem & Leaf – Similar to a bar graph but it has the individual numerical data values as part of the display – the data is ordered, this makes it easy to locate median, UQ, LQ 3 3 4 8 5 10 9 8 8 3 11 2 3 6 7 8 Back to Back Stem & Leaf – useful to compare spread & shape of two data sets 4 2 0 12 1 9 9 3 3 13 0 2 2 14 5 Key: 10 3 means 10.3

  22. Note 6: Cumulative Frequency When dealing with large data sets or grouped data, a cumulative frequency can be used to find medians and quartiles. Cumulative frequency is calculated by ‘accumulating’ the frequencies as we move down the table.

  23. Example: The table shows the number of customers in a small cinema The shaded figure shows that on 25 occasions there were fewer than 40 customers

  24. Lower Quartile = 40 Median = 48 Upper Quartile = 57 What percentage of the time was there 65 or more customers in the cinema? 100 – 86 = 14%

More Related