320 likes | 600 Views
STAT 110 - Section 5 Lecture 21. Professor Hao Wang University of South Carolina Spring 2012. TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: A A A A A. Chapter 18 – Probability Models.
E N D
STAT 110 - Section 5 Lecture 21 Professor Hao Wang University of South Carolina Spring 2012 TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: AAAAA
Chapter 18 – Probability Models probability model – describes all possible outcomes and says how to assign probabilities to any collection of outcomes sample space – collection of all unique outcomes of a random circumstance event – a collection of outcomes
Coin Example Suppose you are asked to roll a die with 6 faces. What is the sample space? • Possible events are • Roll is an even number • Roll is an odd number • Roll is 5 or 6
What about a roulette Sample space ? Example of events ?
Probability Rules • Any probability is a number between 0 and 1. • So if we observe an event A then we know
Probability Rules • 2. All possible outcomes together must have probability 1. • An outcome must occur on every trial. • The sum of the probabilities for all possible outcomes must be exactly 1.
Marital Status of a Random Sample of Women • Consider the following assignment of probabilities • Marital Status of a Random Sample of Women Ages 25 to 29 Marital Status Probability Never married 0.386 Married 0.555 Widowed 0.004 Divorced 0.055
Marital Status of a Random Sample of Women • Each of the probabilities is a number between 0 and 1. The probabilities total to 1. 0.386 + 0.555 + 0.004 + 0.055 = 1 Any assignment of probabilities to all individual outcomes that satisfies Rules 1 and 2 is legitimate.
What does the probability of D need to be to make this a probability model? • P(A)=0.3 P(B)=0.2 P(C)=0.1 P(D)=? • 0.0 • 0.1 • 0.2 • 0.3 • 0.4
Which of the following is not a possible probability model: • P(A)=0.3 P(B)=0.4 P(C)=0.3 • P(A)=0.3 P(B)=0.7 • P(A)=1.0 • P(A)=0.3 P(B)=0.6 P(C)=0.2
Incoherent If a set of probabilities don’t satisfy rules 1 and 2 we say they are incoherent. This often occurs with someone’s personal probabilities in complicated situations
Probability Rules • The probability that an event does not occur is 1 minus the probability that the event does occur. • This is known as the complement rule. • Suppose that P(A) = .70 • Using this rule we can determine P(not A) • P(not A) = 1- P(A) = 1-.70 = .30 The event “not A” is known as the complement of A which can be written as
Suppose the probability of a horse winning a race is 0.85. What is the probability of the horse not winning? • A. 0.85 • B. 0.15 • C 0.7 • D 0.2
Probability Rules • If two events have no outcomes in common, the probability that one or the other occurs is the sum of their individual probabilities. • If this is true then the events are said to be disjoint. Suppose events A and B are disjoint and you know that P(A) = .40 and P(B) = .35. What is the P(A or B)?
If P(A)=0.5 and P(B)=0.4 and A and B are disjoint, then what is P(A or B)? • 0.1 • 0.2 • 0.4 • 0.5 • 0.9
The probability a student is in honors math is 0.25, the probability a student is in honors science is 0.3, and the probability a student is in both is 0.2. • What is the probability a student is in at least one honors class?
The probability it will rain Wednesday AM is 30%. The probability it will rain Wednesday PM is 30%. The probability it will rain both Wednesday AM and Wednesday PM is 10%. What is the probability it will rain on Wednesday? A) 20% D) 50% B) 30% E) 60% C) 40%
Rule 5. Multiplication Rule: If two events, A and B, are independent then P(A and B) = P(A)P(B) • Independence means that the occurrence of event A does not affect the occurrence of event B • What is the probability that the first roll of a die is even, and the second roll is odd? • A – 1/36 • B – 1/12 • C – 1/4 • D – 1/2
You have data on returns on common stocks for all years since 1945. To show clearly how returns have changed over time, your best choice of graph is a A line graph B bar graph C pie chart D histogram
In our class poll, information was collected on your height in inches. This is an example of a ____________ variable. A categorical B quantitative
9) This data set is best described as: a) Skewed Left b) Symmetric c) Skewed Right d) Bimodal
10) The mean of this data set is: a) Approximately equal to the median b) Greater than the median c) Less than the median d) Can’t tell from the picture
Questions 14-15 are based on the data set: 4 2 6 5 12 3 9 14) The median is: a) 5.0 b) 6.0 c) 7.0 d) 8.0 e) 9.0
Questions 14-15 are based on the data set: 4 2 6 5 12 3 9 15) The third quartile (Q3) is: a) 2.0 b) 3.0 c) 5.0 d) 7.5 e) 9.0
16) A list of 20 exam scores range from 64 to 98. If a typo was made and the 64 was entered as a 4, then a) The mean would become smaller and the median would stay the same b) The mean would stay the same and the median would become smaller c) Both the mean and the median would stay the same d) Both the mean and the median would become smaller
20) For a standard normal distribution, what can we say about the percentage of the data is within two standard deviations of the mean? a) 68% b) at least 75% c) at least 88.9% d) 95% e) Can’t tell from the given information
Questions 23-24 refer to the heights of a group of men that are approximately normally distributed with a mean of 70 inches and a standard deviation of 3 inches. 23) Approximately what percentage of men in this group are between 73 inches and 76 inches? a) 13.5% b) 27% c) 34% d) 47.5% e) 81.5%
Questions 23-24 refer to the heights of a group of men that are approximately normally distributed with a mean of 70 inches and a standard deviation of 3 inches. 24) What percentile would someone in this group who was 67 inches tall be? a) 0.15% b) 2.5% c) 16% d) 50% e) 84%
25) Physics classes are notorious for having very low grades on their exams (that get curved at the end of the semester). The first exam’s scores in one semester were approximately normally distributed with a mean of 40 and a standard deviation of 10. The second exam’s scores were approximately normally distributed with a mean of 30 and a standard deviation of 5. Use the z-scores = to determine which would be a better score, a 50 on exam 1 or a 40 on exam 2? a) 50 on exam 1 b) 40 on exam 2 c) The two scores are equivalent
Suppose an algebra professor found that the correlation between study time (in hours) and exam score (out of 100) is +.80, and the regression line was found to be y = 20 + 4x. He arrived at this equation through years of collecting data on his students, all of whom reported studying anywhere from 0 to 20 hours for his exams. For which values of study time does the professor’s regression equation make sense in terms of predicting exam scores? a. Between 0 and 20 hours. b. Between 0 and 100 hours. c. Anything greater than or equal to 0 hours. d. It is not possible to predict exam score with study time.