210 likes | 522 Views
Exploring Normal Distributions with the EMPIRICAL RULE. Normal Distribution. A normal distribution is a continuous, symmetrical, bell-shaped distribution of a variable. Characteristics of Normal Distribution. 1. A normal distribution curve is bell-shaped.
E N D
Exploring Normal Distributions with the EMPIRICAL RULE
Normal Distribution A normal distribution is a continuous, symmetrical, bell-shaped distribution of a variable.
Characteristics of Normal Distribution 1. A normal distribution curve is bell-shaped. 2. The mean, median, and mode are equal and are located at the center of the distribution. 3. The curve is unimodal (i.e., it has only one mode) 4. The curve is symmetric about the mean (its shape is the same on both sides of a vertical line passing through the center) 5. The curve is continuous. There are no gaps or holes. For each value of X, there is a corresponding value of Y.
Characteristics of Normal Distribution 6. The curve never touches the x axis. No matter how far in either direction the curve extends, it never meets the x axis – but it gets increasingly closer. 7. The total area under a normal distribution curve is equal to 1.00, or 100%.
The area under the part of a normal curve that lies within 1 standard deviation of the mean is approximately 0.68, or 68%; within 2 standard deviations, about 0.95, or 95%; and within 3 standard deviations, about 0.997, or 99.7%.
The Empirical Rule(a.k.a. the “68-95-99.7 Rule”) • In a normal distribution, almost all data will lie within 3 standard deviations of the mean. • About 68% of all data lies within 1 standard deviation of the mean. • About 95% of all data lies within 2 standard deviations of the mean. • About 99.7% of all data lies within 3 standard deviations of the mean.
Why Do We Need It? The Empirical Rule is most often used in statistics for forecasting or predicting final outcomes. After a standard deviation is calculated, and before exact data can be collected, the Empirical Rule can be used to estimate impending data. The probability based on the Empirical Rule can be used if gathering appropriate data may be time consuming, or even impossible to obtain.
Tips for Using the Empirical Rule Before applying the Empirical Rule it is a good idea to identify the data being described and the value of the mean and standard deviation. Sketch a graph summarizing the information provided by the empirical rule and identify the percentages for each region of the graph (+/- 1, +/- 2, +/- 3) Remember that data must be normally distributed for the Empirical Rule to apply.
Example: Estimating with the Empirical Rule You have purchased fluorescent light bulbs for your home. The average bulb life is 500 hours with a standard deviation of 24. The data is normally distributed. One of your bulbs burns out at 450 hours. Would you send the bulb back for a refund?
Problem: You have purchased fluorescent light bulbs for your home. The average bulb life is 500 hours with a standard deviation of 24. The data is normally distributed. One of your bulbs burns out at 450 hours. Would you send the bulb back for a refund? Solution: • According to the Empirical Rule: • 68% of the light bulbs should last between 500 ± 24 or between 476 to 524 hours. • 95% of the light bulbs should last between 500 ± 2(24) or between 452 to 548 hours. • 99.7% of the light bulbs should last between 500 ± 3(24) or between 428 to 572 hours. • If the light bulb only lasted 450 hours, I would consider it a defective bulb. Less than 2.5% should last less than 452 hours.
The mean is 0.9 and the standard deviation is 0.1. If one standard deviation is added to the mean, the result is 1.0 ounce. Therefore, dispensing 1 or more ounces falls into the category above one standard deviation to the right of the mean. Using the Empirical Rule, 16% of data falls at or above 1 standard deviation. 16% x 500 = 80 times to dispense one or more ounces of mustard.
The 99% implies a distribution within 3 standard deviations of the mean. The difference from 585 milliliters to 595 milliliters is 10 milliliters. Symmetrically divided, there are 5 milliliters used to create 3 standard deviations on one side of the mean. Dividing 5 by 3, we get the standard deviation to be 1.67 milliliters, to the nearest hundredth.
Subtracting, we see that 1 s.d. below is 439 days, an exact match to our question and an indication that the Empirical Rule can be used to find the answer. The question is asking what percent of a distribution is beyond one standard deviation to the left of the mean. Answer: 16%
a. 50% In a normal distribution, the mean divides the data into two equal areas. Since 11 is the mean, 50% of the data is above 11 and 50% is below 11. b. 14 is exactly two standard deviations above the mean. Using the Empirical Rule we see that 2.5% will fall above two standard deviations. Probability is 0.025.
Five hundred values are normally distributedwith a mean of 125 and a standard deviation of 10.a. What percent of the values lies in the interval 115 - 135, to the nearest percent? b. What interval about the mean includes 95% of the data? a. What percent of the values is in the interval 115 - 135?mean + one standard deviation = 135mean - one standard deviation = 115 Percent within one standard deviation of the mean = 68% b. 2 standard deviations about the mean for a total interval size of 40, with the mean in the center. mean + 2 standard deviations = 145mean - 2 standard deviations = 105Interval: [105,145]
Your Turn The scores for all high school seniors taking the verbal section of the Scholastic Aptitude Test (SAT) in a particular year had a mean of 490 and a standard deviation of 100. The distribution of SAT scores is normal. What percentage of seniors scored between 390 and 590 on this SAT test? One student scored 795 on this test. How did this student do compared to the rest of the scores? A rather exclusive university only admits students who were among the highest 16% of the scores on this test. What score would a student need on this test to be qualified for admittance to this university?
1. What percentage of seniors scored between 390 and 590 on this SAT test? • Calculate the difference between the mean and the test scores: • |390-490| = 100 1 s.d. • |590-490| = 100 1 s.d. So, about 68% of seniors would score between 390 and 590 on the SAT test. The mean = 490 and the standard deviation = 100. So, we can draw the Normal Curve to depict the data.
2. One student scored 795 on this test. How did this student do compared to the other scores? Since about 99.7% of seniors would score between 190 and 790 on the SAT test, this is an exceptionally high score. Only 0.15% of students would have scores above 790. This is an example of using the Empirical Rule to estimate an answer. Calculate the difference between the mean and the student’s test score:795-390 = 405 This student’s score is more than 3 std. deviations from the mean (3 standard deviations = 3 * 100)
3 . A rather exclusive university only admits students who were among the highest 16% of the scores on this test. What score would a student need on this test to be qualified for admittance to this university? • So, to qualify for admittance to this university, a student would need to score 590 or above on the SAT test. • About 68% of the scores are between 390 and 590, so this leaves 32% of the scores outside this interval. • Since a bell-shaped curve is symmetrical, one-half of the scores (16%) are on each end of the distribution. • 16% of the students scored above 590 on this SAT test.