1 / 25

Lesson 8 - R

This lesson provides a comprehensive review of the binomial and geometric distributions. It covers the concepts, calculations, and applications of these distributions, as well as the use of technology for solving probability questions in these settings.

lindsayk
Download Presentation

Lesson 8 - R

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Lesson 8 - R Review of Chapter 8 Discrete PDFs Binomial and Geometeric

  2. Objectives • Explain what is meant by a binomial setting and binomial distribution. • Use technology to solve probability questions in a binomial setting. • Calculate the mean and variance of a binomial random variable. • Solve a binomial probability problem using a Normal approximation. • Explain what is meant by a geometric setting. • Solve probability questions in a geometric setting. • Calculate the mean and variance of a geometric random variable.

  3. Vocabulary • None new

  4. Binomial PDF Objectives • Identify a random variable as binomial by verifying four conditions: two outcomes (success and failure); fixed number of trials; independent trials; and the same probability of success for each trial. • Use technology or the formula to determine binomial probabilities and to construct probability distribution tables and histograms. • Calculate cumulative distribution functions for binomial random variables, and construct cumulative distribution tables and histograms. • Calculate means (expected values) and standard deviations of binomial random variables. • Use a Normal approximation to the binomial distribution to compute probabilities.

  5. Geometric PDF Objectives • Identify a random variable as geometric by verifying four conditions: two outcomes (success and failure); independent trials; the same probability of success for each trial; and the count of interest is the number of trials required to get the first success. • Use formulas or technology to determine geometric probabilities and to construct probability distribution tables and histograms. • Calculate cumulative distribution functions for geometric random variables, and construct cumulative distribution tables and histograms. • Calculate expected values and standard deviations of geometric random variables.

  6. English Phrases P(x ≤ A) = cdf (A) P(x = A) = pdf (A) P(X) ∑P(x) = 1 Cumulative probability or cdf P(x ≤ A) P(x > A) = 1 – P(x ≤ A) Values of Discrete Variable, X X=A

  7. Binomial Probability Criteria A random variable is said to be a binomial provided: • For each trial there are two mutually exclusive (disjoint) outcomes: success or failure • The trials are independent • The probability of success is the same for each trial of the experiment • The experiment is performed a fixed number of times. Each repetition is called a trial Most important skill for using binomial distributions is the ability to recognize situations to which they do and don’t apply

  8. Binomial PDF The probability of obtaining x successes in n independent trials of a binomial experiment, where the probability of success is p, is given by: P(x) = nCx px (1 – p)n-x, x = 0, 1, 2, 3, …, n nCx is also called a binomial coefficient and is defined by combination of n items taken x at a time or where n! is n  (n-1)  (n-2)  …  2  1 n n! = -------------- k k! (n – k)!

  9. Geometric Probability Criteria A random variable is said to be a geometric provided: • For each trial there are two mutually exclusive (disjoint) outcomes: success or failure • The trials are independent • The probability of success is the same for each trial of the experiment • We repeat the trials until we get a success

  10. Geometric PDF When we studied the Binomial distribution, we were only interested in the probability for a success or a failure to happen. The geometric distribution addresses the number of trials necessary before the first success. If the trials are repeated k  times until the first success, we will have had k  – 1 failures. If p  is the probability for a success and q  (1 – p) the probability for a failure, the probability for the first success to occur at the kth  trial will be (where x = k) P(x) = p(1 – p)x-1, x = 1, 2, 3, … The probability that more than n trials are needed before the first success will be P(k > n) = qn= (1 – p)n

  11. Means and Normal Apx to Binomial • Means and Standard Deviations • Binomial • Mean: E(X) = μ = np • Variance: ² = np(1 – p) • Geometric • Mean: E(X) = μ =1/p • Variance: ² = (1- p) / p² • Normal distribution N(μ,σ) can approximate a Binomial curve, when conditions are met • n < 0.10N (sample small enough – independence) • np ≥ 10 and n(1-p) ≥ 10 (for normality)

  12. TI-83 Reminders • Binomial • N: number of trials • P: probability of success • X: number of successes • Geometric • P: probability of success • X: number of trials until first success • Remember to use catalog help • PDF X = # • CDF  X ≤ # • Complement Rule for X ≥ #

  13. TI-83 Binomial Support • For P(X = k) using the calculator: 2nd VARS binompdf(n,p,k) • For P(k ≤ X) using the calculator: 2nd VARS binomcdf(n,p,k) • For P(X ≥ k) use 1 – P(k < X) = 1 – P(k-1 ≤ X)

  14. TI-83 Geometric Support • For P(X = k) using the calculator: 2nd VARS geometpdf(p,k) • For P(k ≤ X) using the calculator: 2nd VARS geometcdf(p,k) • For P(X > k) use 1 – P(k ≤ X) or (1- p)k

  15. Non AP Distributions - ID • Hypergeometric • Small population sampling without replacement • Example: drawing names out of a hat • Negative Binomial • Number of trials until the nth success • Example: number of foul shots until his 3rd successful one • Geometric is a special case of this (n = 1) • Poisson • Successes spread over spatial random variable(time or area) • Example: arrivals per minute at McD, potholes per mile on I-81

  16. Example 1a/b: Which PDF? Determine which probability distribution (Binomial, Negative Binomial, Geometric, Hyper-geometric, and Poisson) best fits the following. Use only once. a. A stats class using a bucket filled with 20 red and 20 green balls, pulls a ball out of the bucket and records its color. They record the number of pulls required until they have 5 green balls pulled and repeat whole process 50 times.b. A stats class using a bucket filled with 20 red and 20 green balls, drops the bucket and scatters the balls across the room. They record the number of balls per floor tile and repeat this 50 times.

  17. Which PDF? Determine which probability distribution (Binomial, Negative Binomial, Geometric, Hyper-geometric, and Poisson) best fits the following. Use only once. c. A stats class using a bucket filled with 20 red and 20 green balls, pulls a ball out of the bucket, records its color, replaces it and repeats until they pull a green ball. They record the number of pulls before the green ball is pulled out.d. A stats class using a bucket filled with 20 red and 20 green balls, pulls a ball out of the bucket and records its color and repeat it 50 times

  18. Which PDF? Determine which probability distribution (Binomial, Negative Binomial, Geometric, Hyper-geometric, and Poisson) best fits the following. Use only once. e. A stats class using a bucket filled with 20 red and 20 green balls, pulls 5 balls out of the bucket and records the number of red balls and repeat it 50 times. a. Negative Binomial (pull until rth success) b. Poisson (successes over an area) c. Geometric (pulls till first success) d. Binomial (with n=1) e. Hyper-geometric (w/o replacement)

  19. Summary and Homework • Summary • Use pdf for an X = # • Use cdf for an X ≤ # • Use complement rule for X ≥ # • P(X ≥ #) = 1 – P(X < #) • P(X > #) = 1 – P( • Binomial – Bernoulli with fixed # of trials • Mean: np Variance: np(1-p) • Geometric – Bernoulli until first success • Mean: 1/p Variance: (1-p)/p² • Homework: pg 556 – 59; 8.59 - 8.66

  20. Problem 1 • The binomial setting and the geometric setting are similar in that they both involve 1)2)3) (b) How do the binomial and geometric settings differ? success or failure (mutually exclusive or binary outcomes) probability of success is constant independent outcomes (trials) BinomialGeometric fixed number of trials repeat trials until first success

  21. Problem 2 According to the manufacturers, 13% of the M&M’s produced today are brown. (Did you know that at one time all M&M’s were brown?) Assume that all large bags of M&M’s contain 13% brown. Suppose you start taking individual candies out of a large bag, hoping for a brown one. Let X represent the number of the draw on which you get your first brown M&M. (a) On average, how many M&M’s would you expect to select in order to find a brown one?  (b) Construct a table showing the probability distribution for X (up through X = 5). Show work for probabilities in the space below. Round probabilities to 3 decimal places.  X = Probability = X~G(0.13) E(X) = 1/p = 1/(0.13) = 7.69 1 2 3 4 5 0.130 0.113 0.098 0.086 0.074

  22. Problem 2 cont According to the manufacturers, 13% of the M&M’s produced today are brown. (Did you know that at one time all M&M’s were brown?) Assume that all large bags of M&M’s contain 13% brown. Suppose you start taking individual candies out of a large bag, hoping for a brown one. Let X represent the number of the draw on which you get your first brown M&M. (c) Construct a histogram that shows the cumulative probability distribution for X (up through X = 5). Label the height of each bar in addition to providing a scale on the vertical axis. 1 2 3 4 5 0.130 0.113 0.098 0.086 0.074 0.5 0.4 Probability 0.3 0.2 0.1 0.51 0.44 0.35 0.24 0.13 1 2 3 4 5 Nr of trials until first brown M&M

  23. Problem 3 When an oil company conducts exploratory oil drilling, each well is classified as a producer well or a dry well. Past experience shows that 15% of all wells drilled are producer wells. The company has plans to drill at 12 new locations. (a) What is the probability that exactly three wells will be producer wells? Be sure to provide support for your answer.  (b) Calculate the probability that at least three wells will be producer wells. Be sure to provide support for your answer. X ~ B(0.15,12) P(X=3) = 0.1720 binompdf(12,.15,3) X ~ B(0.15,12) P(X≥3) = 1 – P(X<3) = 1 – P(X ≤ 2) = 0.2642 1 - binomcdf(12,.15,2)

  24. Problem 4 A seed producer claims that 95% of a certain type seed will germinate under ideal conditions. A testing agency attempts to germinate 3000 of these seeds. (a) Give the mean and standard deviation for the number of seeds that would germinate if the producer’s claim is correct. Mean = Standard deviation = (b) 2830 of the testing agency’s seeds eventually germinate. Use a normal approximation to estimate the probability that 2830 or fewer seeds would germinate if the producer’s claim is correct. Show work. X ~ B(0.95,3000) √np(1-p) √3000(.95(.05) = 11.94 np = .95(3000) = 2850 Check conditions: assume > 30000 seeds produced np ≥ 10  n(1-p) ≥ 10  2850 ≥ 10 150 ≥ 10 X ~ N(2850,11.94) P(X≤2830) = 0.047 normalcdf(-E99,2830,2850,11.94)

  25. Problem 4 A seed producer claims that 95% of a certain type seed will germinate under ideal conditions. A testing agency attempts to germinate 3000 of these seeds. (a) Give the mean and standard deviation for the number of seeds that would germinate if the producer’s claim is correct. Mean = Standard deviation = (b) 2830 of the testing agency’s seeds eventually germinate. Use a normal approximation to estimate the probability that 2830 or fewer seeds would germinate if the producer’s claim is correct. Show work. X ~ B(0.95,3000) √np(1-p) √3000(.95(.05) = 11.94 np = .95(3000) = 2850 Check conditions: np ≥ 10  n(1-p) ≥ 10  2850 ≥ 10 150 ≥ 10 X ~ N(2850,11.94) P(X≤2830) = 0.047 normalcdf(-E99,2830,2850,11.94)

More Related