1 / 24

Normal Approximation for Binomial Probabilities

Learn to use the normal distribution to approximate binomial probabilities efficiently, especially for large n and x values. Understand the criteria for using the normal approximation and the correction for continuity method.

ewa
Download Presentation

Normal Approximation for Binomial Probabilities

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Chapter 7Section 5 The Normal Approximation to the Binomial Probability Distribution

  2. 1 Chapter 7 – Section 5 • Learning objectives • Approximate binomial probabilities using the normal distribution

  3. Chapter 7 – Section 5 • Recall from section 6.2 that a binomial experiment is one where • An experiment is performed n independent times • Each time, or trial, has two possible outcomes – success or failure • The outcome of success – p – is the same for each trial

  4. Chapter 7 – Section 5 • The binomial probabilities are the probabilities of observing x successes in n trials • If X has a binomial distribution, then P(X = x) = nCx px (1 – p)n-x • The binomial probabilities are the probabilities of observing x successes in n trials • If X has a binomial distribution, then P(X = x) = nCx px (1 – p)n-x • We would like to calculate these probabilities, for example when both n and x are large numbers

  5. Chapter 7 – Section 5 • If both n and x are large numbers, then calculating the number of combinations nCx in P(X = x) = nCx px (1 – p)n-x is time-consuming since we need to compute terms such as n! and x! • If both n and x are large numbers, then calculating the number of combinations nCx in P(X = x) = nCx px (1 – p)n-x is time-consuming since we need to compute terms such as n! and x! • Calculations for probabilities such as P(X≥ x) are even more time-consuming

  6. Chapter 7 – Section 5 • For example, if n = 1000, then P(X≥ 700) = P(X = 700) + P(X = 701) + … + P(X = 1000) which is the sum of 301 terms • For example, if n = 1000, then P(X≥ 700) = P(X = 700) + P(X = 701) + … + P(X = 1000) which is the sum of 301 terms • Each P(X = x) by itself is an involved calculation, and we need to do this 301 times, so this becomes an extremely long and tedious sum to compute!

  7. Chapter 7 – Section 5 • However, we can simplify this calculation • However, we can simplify this calculation • As the number of trials, n, increases, then the distribution of binomial random variables becomes more and more bell shaped • However, we can simplify this calculation • As the number of trials, n, increases, then the distribution of binomial random variables becomes more and more bell shaped • As a rule of thumb, if np(1-p)≥ 10, then the distribution will be reasonably bell shaped • However, we can simplify this calculation • As the number of trials, n, increases, then the distribution of binomial random variables becomes more and more bell shaped • As a rule of thumb, if np(1-p)≥ 10, then the distribution will be reasonably bell shaped

  8. Chapter 7 – Section 5 • To show that the approximation is pretty good, the following chart shows the binomial withn = 40, p = 0.5, and its corresponding normal

  9. Chapter 7 – Section 5 • Even if the original binomial trial isn’t symmetric, such as for p = 0.3, the normal is still a good approximation (n = 60 in the chart below)

  10. Chapter 7 – Section 5 • Because the binomial distribution is approximately normal when np(1-p)≥ 10, the normal distribution can be used to approximate a binomial distribution • Because the binomial distribution is approximately normal when np(1-p)≥ 10, the normal distribution can be used to approximate a binomial distribution • Remember that a binomial distribution has • Mean = n p • Standard deviation = √ n p (1 – p)

  11. Chapter 7 – Section 5 • It would make sense to use an approximation that has the same mean and standard deviation • It would make sense to use an approximation that has the same mean and standard deviation • Thus to approximate a binomial random variable X (with parameters n and p), we use a normal random variable Y with parameters • μ = n p • σ = √ n p (1 – p)

  12. Chapter 7 – Section 5 • This matches the mean and the standard deviation between X and Y • We could then approximate the probabilities of X with probabilities of Y P(X ≤ a) = P(Y ≤ a) • This matches the mean and the standard deviation between X and Y • We could then approximate the probabilities of X with probabilities of Y P(X ≤ a) = P(Y ≤ a) • However, there is a slightly better method • This better method is called the correctionforcontinuity

  13. Chapter 7 – Section 5 • First we will look at an example • Consider a binomial variable X with • n = 60 trials and • p = 0.3 probability of success • First we will look at an example • Consider a binomial variable X with • n = 60 trials and • p = 0.3 probability of success • This variable has • Mean = n p = 18 • Standard deviation = √ n p (1 – p) = 3.55 • First we will look at an example • Consider a binomial variable X with • n = 60 trials and • p = 0.3 probability of success • This variable has • Mean = n p = 18 • Standard deviation = √ n p (1 – p) = 3.55 • Thus we use a normal random variable Y with mean μ = 18 and standard deviation σ = 3.55

  14. Chapter 7 – Section 5 • The graph shows the situation for P(X = 17) • The pink curve shows the binomial probabilities for X • The blue curve shows the normal approximation for Y

  15. Chapter 7 – Section 5 • A better approximation for P(X = 17)

  16. Chapter 7 – Section 5 • A better approximation for P(X = 17) would be P(16 ½ ≤ Y ≤ 17 ½)

  17. Chapter 7 – Section 5 • Thus a better approximation for P(X = a) is P(a – ½ ≤ Y ≤ a + ½) • Thus a better approximation for P(X = a) is P(a – ½ ≤ Y ≤ a + ½) and a better approximation for P(X≤ a) is P(Y ≤ a + ½) • This is the correction for continuity

  18. Chapter 7 – Section 5 • Now we look back at the example • We want to approximate P(X≤ 17) for a binomial variable X with n = 60 trials andp = 0.3 probability of success • Now we look back at the example • We want to approximate P(X≤ 17) for a binomial variable X with n = 60 trials andp = 0.3 probability of success • X has mean 18 and standard deviation 3.55 • We use a normal random variable Y with mean 18 and standard deviation 3.55

  19. Chapter 7 – Section 5 P(X≤ 17) ≈ P(Y ≤ 17.5) • Since Y has mean μ = 18 and standard deviation σ = 3.55, the Z-score of 17.5 is and P(Z ≤ – 0.14) = 0.4440 • Thus the normal approximation to the binomial is P(X ≤ 17) ≈ 0.4440

  20. Chapter 7 – Section 5 • From the normal, an approximation is P(X ≤ 17) ≈ 0.4440 • The actual value (computed from Excel) is P(X ≤ 17) = 0.4514 • These are quite close • However, technology can compute the Binomial probabilities directly, so the normal approximation is not needed as much these days

  21. Summary: Chapter 7 – Section 5 • The binomial distribution is approximately bell shaped for large enough values of np(1 – p) • The normal distribution, with the same mean and standard deviation, can be used to approximate this binomial distribution • With technology, however, this approximation is not as needed as it used to be

  22. Example: Chapter 7 – Section 5 • in the United States never use the Internet. (Source: gallup.com) Suppose 200 people are randomly selected. Use the normal approximation to the binomial to • a. approximate the probability that exactly 54 of them will never use the Internet. (0.0635) • b. approximate the probability that more than 45 of them will never use the Internet. (0.9121) • c. approximate the probability that more than 60 of them will never use the Internet. (0.1503) • d. approximate the probability that the number of them who will never use the Internet is between 46 and 60, inclusive. (0.7618)

  23. Example: Chapter 7 – Section 5 • Darren did not prepare for a multiple-choice test in his biochemistry class. He is forced to randomly guess on each question. Suppose the test has 60 questions with 4 possible answers for each question. Use the normal approximation to the binomial to • a. approximate the probability that he will get exactly 12 questions right. (0.0797) • b. approximate the probability that he will get more than 15 questions right. (0.4407) • c. approximate the probability that he will get more than 20 questions right. (0.0505) • d. approximate the probability that he will get more than 25 questions right. (0.0009) • e. approximate the probability that the number of questions he will get is between 12 and 17, inclusive. (0.6236)

  24. Example: Chapter 7 – Section 5 • Most nurseries guarantee their plants for one year. If the plant dies within the first year of the purchase date, it can be returned for credit. Experience has shown that about 5% of all plants purchased from Evergreen Nursery will die in their first year. Evergreen Nursery sold 250 walnut trees this season. Use the normal approximation to the binomial to • a. approximate the probability that exactly 12 walnut trees will be returned for credit. (0.1142) • b. approximate the probability that more than 15 walnut trees will be returned for credit. (0.1920) • c. approximate the probability that more than 20 walnut trees will be returned for credit. (0.0101) • d. approximate the probability that more than 25 walnut trees will be returned for credit. (0.0001)

More Related