MGMT 276: Statistical Inference in Management Spring , 2013

MGMT 276: Statistical Inference in ManagementSpring, 2013 Welcome

Statistical Inference in Management Instructor:Suzanne Delaney, Ph.D. Office:405 “N” McClelland Hall Phone:621-2045 Email:delaney@u.arizona.edu Office hours:2:00 – 3:30Mondays and Fridays and by appointment

Please read before our next exam (March 19th) - Chapters 5 - 11 in Lind book - Chapters 10, 11, 12 & 14 in Plous book: Lind Chapter 5: Survey of Probability Concepts Chapter 6: Discrete Probability Distributions Chapter 7: Continuous Probability Distributions Chapter 8: Sampling Methods and CLT Chapter 9: Estimation and Confidence Interval Chapter 10: One sample Tests of Hypothesis Chapter 11: Two sample Tests of Hypothesis Plous Chapter 10: The Representativeness Heuristic Chapter 11: The Availability Heuristic Chapter 12: Probability and Risk Chapter 14: The Perception of Randomness

Use this as your study guide By the end of lecture today2/28/13 Confidence Intervals Logic of hypothesis testing Steps for hypothesis testing Levels of significance (Levels of alpha) what does p < 0.05 mean? what does p < 0.01 mean?

Homework due – Tuesday (March 5th) On class website: Please print and complete homework worksheet #11 Dan Gilbert Reading and the Law of Large Numbers Homework due – Thursday (March 7th) On class website: Please print and complete homework worksheet #12 Confidence intervals and Type I versus Type II Errors

Please click in My last name starts with a letter somewhere between A. A – D B. E – L C. M – R D. S – Z

Review of Homework Worksheetjust in case of questions

Homework review Based on data (Percent of stocks that meet reach or exceed target price on first day) Based on expert opinion - don’t have previous data for these two companies merging together Based on data (Percent of rockets that successfully launch) Based on apriori probability – not previous experience and not data-driven

Homework review Based on expert opinion (experience of experts), but not actual percent of space stations that have actually been critically damaged by debris. Based on actual data (percent of results that are fake pages)

. .8276 .1056 .2029 .1915 .3944 .4332 .3944 .3944 55 55 55 52 44 50 50 44 - 50 4 52 - 50 4 -1.5 +.5 = = 55 - 50 4 +1.25 = z of 1.5 = area of .4332 z of 1.5 = area of .1915 1.25 = area of .3944 55 - 50 4 55 - 50 4 +1.25 +1.25 = = .5000 - .3944 = .1056 z of 1.25 = area of .3944 z of 1.25 = area of .3944 .4332 +.3944 = .8276 .3944 -.1915 = .2029

.3264 Homework review .2152 .5143 .1255 .3888 .1736 .1736 .3888 3,000 3,500 2,500 3,500 3,000 2500 - 2708 650 3000 - 2708 650 3000 - 2708 650 -.32 = 0.45 0.45 = = z of -0.32 = area of .1255 z of 0.45 = area of .1736 z of 0.45 = area of .1736 3500 - 2708 650 3500 - 2708 650 1.22 = 1.22 = .5000 - .1736 = .3264 z of 1.22 = area of .3888 z of 1.22 = area of .3888 .3888 +.1255= .5143 .3888 - .1736 = .2152

.0764 Homework review .9236 .1185 .4236 .4236 .4236 .3051 10 12 20 20 10 - 15 3.5 -1.43 = 20 - 15 3.5 20 - 15 3.5 1.43 1.43 = = z of -1.43 = area of .4236 z of 1.43 = area of .4236 z of 1.43 = area of .4236 12 - 15 3.5 -0.86 = .5000 + .4236 = .9236 .5000 - .4236 = .0764 z of -.86 = area of .3051 .4236 – .3051 = .1185

Problem with point estimate Mean kids IQ of 100. Mean income of $35,000 a year. Mean weight 7 pounds. Are we right always? - no How close is our estimation? - what other information about these distributions would we want to know? Variability! Which of these distributions would allow our guess to be closest to what’s right?

Standard Error of the Mean (SEM) Remember confidence intervals? Revisit Confidence Intervals Confidence Intervals (based on z): We are using this to estimate a value such as a population mean, with a known degree of certainty with a range of values • The interval refers to possible values of the population mean. • We can be reasonably confident that the population mean • falls in this range (90%, 95%, or 99% confident) • In the long run, series of intervals, like the one we • figured out will describe the population mean about 95% • of the time. Greater confidence implies loss of precision.(95% confidence is most often used) Can actually generate CI for any confidence level you want – these are just the most common

Confidence Intervals (based on z): A range of values that, with a known degree of certainty, includes an unknown population characteristic, such as a population mean • How can we make our confidence interval smaller? • Increase sample size (This will decrease variability) • Decrease variability through more careful assessment • and measurement practices (minimize noise) . • Decrease level of confidence 95% 95%

? ? Mean = 50Standard deviation = 10 Find the scores for the middle 95% 95% x = mean ± (z)(standard deviation) 30.4 69.6 .9500 Please note: We will be using this same logic for “confidence intervals” .4750 .4750 ? 1) Go to z table - find z score for for area .4750 z = 1.96 2) x = mean + (z)(standard deviation) x = 50 + (-1.96)(10) x = 30.4 30.4 3) x = mean + (z)(standard deviation) x = 50 + (1.96)(10) x = 69.6 69.6 Scores 30.4 - 69.6 capture the middle 95% of the curve

? ? Mean = 50Standard deviation = 10 n = 100 s.e.m. = 1 Confidence intervals σ 95% standard error of the mean = Find the scores for the middle 95% n √ 48.04 51.96 For “confidence intervals” same logic – same z-score But - we’ll replace standard deviation with the standard error of the mean .9500 .4750 .4750 ? 10 = 100 √ x = mean ± (z)(s.e.m.) x = 50 + (1.96)(1) x = 51.96 x = 50 + (-1.96)(1) x = 48.04 95% Confidence Interval is captured by the scores 48.04 – 51.96

mean = 121 standard deviation= 15 n = 25 σ standard error of the mean = Find a 95% Confidence Interval for this distribution n √ 100 110 120 130 140 raw score = mean + (z score)(standard error) 15 = = 3 √ 25 raw score = mean ± (z score)(sem) Please notice: We know the standard deviation and can calculate the standard error of the mean from it X = 121 ± (1.96)(3) = 121 ± 5.88 (115.12, 126.88) x = x ± (z)(σx) confidence interval

Confidence intervals ? ? σ standard error of the mean 95% = n √ Mean = 50 Standard error mean = 10 Hint always draw a picture! Tell me the scores associated that border exactly the middle 95% of the curve We know this raw score = mean ± (z score)(standard deviation) Construct a 95 percent confidence interval around the mean Similar, but uses standard error the mean raw score = mean ± (z score)(standard error of the mean)

Confidence Interval of 95%Has and alpha of 5%α = .05 Confidence Interval of 99% Has and alpha of 1% α = .01 99% Area outside confidence interval is alpha 95% Area in the tails is called alpha 90% Confidence Interval of 90% Has and alpha of 10% α = . 10 Area associated with most extreme scores is called alpha

Measurements that occur within the middle part of the curve are ordinary (typical) and probably belong there Area outside confidence interval is alpha Area outside confidence interval is alpha Moving from descriptive stats into inferential stats…. 99% 95% Measurements that occur outside this middle ranges are suspicious, may be an error or belong elsewhere 90%

How do we know if something is going on?How rare/weird is rare/weird enough? Every day examples about when is weird, weird enough to think something is going on? • Handing in blue versus white test forms • Psychic friend – guesses 7 out of 10 coin tosses right • Cancer clusters – how many cases before investigation • Weight gain treatment – one group gained an average of 1 pound more than other group…what if 10?

Why do we care about the z scores that define the middle 95% of the curve?Inferential Statistics Hypothesis testing with z scores allows us to make inferences about whether the sample mean is consistent with the known population mean. • Is the mean of my observed sample consistent with the • known population mean or did it come from some other • distribution?

Why do we care about the z scores that define the middle 95% of the curve? If the z score falls outside the middle 95% of the curve, it must be from some other distribution Main assumption: We assume that weird, or unusual or rare things don’t happen If a score falls out into the 5% range we conclude that it “must be” actually a common score but from some other distribution That’s why we care about the z scores that define the middle 95% of the curve

. Main assumption: We assume that weird, or unusual or rare things don’t happen I’m not an outlier I just haven’t found my distribution yet If a score falls out into the tails (low probability) we conclude that it “must be” a common score from some other distribution

. .. Reject the null hypothesis Relative to this distribution I am unusual maybe even an outlier 95% X Relative to this distribution I am utterly typical 95% X Support for alternative hypothesis

Rejecting the null hypothesis . notnull null big z score x x • If the observed z falls beyond the critical z in the distribution (curve): • then it is so rare, we conclude it must be from some other distribution • then we reject the null hypothesis • then we have support for our alternative hypothesis Alternative Hypothesis • If the observed z falls within the critical z in the distribution (curve): • then we know it is a common score and is likely to be part of this distribution, • we conclude it must be from this distribution • then we do not reject the null hypothesis • then we do not have support for our alternative . null x x small z score

Rejecting the null hypothesis • If the observed z falls beyond the critical z in the distribution (curve): • then it is so rare, we conclude it must be from some other distribution • then we reject the null hypothesis • then we have support for our alternative hypothesis • If the observed z falls within the critical z in the distribution (curve): • then we know it is a common score and is likely to be part of this distribution, • we conclude it must be from this distribution • then we do not reject the null hypothesis • then we do not have support for our alternative hypothesis

How do we know how rare is rare enough? Area in the tails is alpha α = .01 α = .10 α = .05 99% Level of significance is called alpha (α) • The degree of rarity required for an observed outcome • to be “weird enough” to reject the null hypothesis • Which alpha level would be associated with most “weird” or rare scores? 95% Critical z: A z score that separates common from rare outcomes and hence dictates whether the null hypothesis should be retained (same logic will hold for “critical t”) 90% If the observed z falls beyond the critical z in the distribution (curve) then it is so rare, we conclude it must be from some other distribution

Rejecting the null hypothesis • The result is “statistically significant” if: • the observed statistic is larger than the critical statistic (which can be a ‘z” or “t” or “r” or “F” or x2) • observed stat > critical stat If we want to reject the null, we want our t (or z or r or F or x2) to be big!! • the p value is less than 0.05 (which is our alpha) • p < 0.05 If we want to reject the null, we want our “p” to be small!! • we reject the null hypothesis • then we have support for our alternative hypothesis

Confidence Interval of 95%Has and alpha of 5%α = .05 Critical z 2.58 Critical z -2.58 Confidence Interval of 99% Has and alpha of 1% α = .01 99% Area in the tails is called alpha Critical z 1.96 Critical z -1.96 95% Critical Z separates rare from common scores 90% Critical z 1.64 Critical z -1.64 Confidence Interval of 90% Has and alpha of 10% α = . 10

Measurements that occur within the middle part of the curve are ordinary (typical) and probably belong there For scores that fall into the middle range, we do not reject the null Moving from descriptive stats into inferential stats…. Critical z 1.64 Critical z -1.64 90% 5% 5% Measurements that occur outside this middle ranges are suspicious, may be an error or belong elsewhere For scores that fall into the regions of rejection, we reject the null What percent of the distribution will fall in region of rejection Critical Values http://today.msnbc.msn.com/id/33411196/ns/today-today_health/ http://www.youtube.com/watch?v=0r7NXEWpheg

Rejecting the null hypothesis • The result is “statistically significant” if: • the observed statistic is larger than the critical statistic • observed stat > critical stat If we want to reject the null, we want our t (or z or r or F or x2) to be big!! • the p value is less than 0.05 (which is our alpha) • p < 0.05 If we want to reject the null, we want our “p” to be small!! • we reject the null hypothesis • then we have support for our alternative hypothesis A note on decision making following procedure versus being right relative to the “TRUTH”

. Decision making: Procedures versus outcome Best guess versus “truth” What does it mean to be correct? • Why do we say: • “innocent until proven guilty” • “not guilty” rather than “innocent” • Is it possible we got a verdict wrong?

. The null hypothesis is typically that something is not present, that there is no effect, that there is no difference between population and sample or between treatment and control. Null Hypothesis A measure of sickness people taking drugpeople not taking drug (There are two distributions here, they are just on top of each other) (overlapping) people taking drug people not taking drug A measure of sickness A measure of sickness Null is FALSE Null is TRUE Drug does have effect Something going on Nothing going on No effect of drug There is no difference between the groups There is a difference between the groups

Remember: “procedure” vs “TRUTH” . (There are two distributions here, they are just on top of each other) (overlapping) A measure of sickness people taking drug people not taking drug people taking drugpeople not taking drug A measure of sickness A measure of sickness Null is FALSE Null is TRUE Score should fall in this region critical stat critical stat critical stat critical stat Score should fall in one of these regions Score should fall in one of these regions Null is TRUE Null is FALSE No effect of drug Nothing going on Drug does have effect Something going on

. Two ways to be right: Status of Null Hypothesis(actually, via magic truth-line) True Ho False Ho Do notReject Ho Decision madeby experimenter Reject Ho 1. “Reject a false null hypothesis” “there really is something going on” 2. “Do not reject a true null hypothesis” “there really is no difference between groups” You are right! Correct decision You are right! Correct decision

. Two ways to be wrong: Status of Null Hypothesis(actually, via magic truth-line) True Ho False Ho Do notReject Ho Decision madeby experimenter Reject Ho 1. “Reject a true null hypothesis” say there’s a difference when there’s not (Type I)The score fell in the tails but the null was actually “TRUE” 2. “Do not reject a false null hypothesis” say there really is no difference between groupswhen there really is (Type II) The score fell in the middle but the null was still “FALSE” You are wrong! Type II error(miss) You are wrong! Type I error(false alarm)

Possible outcomes of hypothesis test Status of Null Hypothesis(actually, via magic truth-line) True Ho False Ho Do notReject Ho Decision madeby experimenter Reject Ho You are wrong! Type II error(miss) You are right! Correct decision You are wrong! Type I error(false alarm) You are right! Correct decision • Probability of rejecting a true null hypothesis = alpha • The alpha you choose becomes the probability of • making a Type I error

. What’s worse – Type I or Type II error? Status of Null Hypothesis(actually, via magic truth-line) True Ho False Ho Do notReject Ho Decision madeby experimenter Reject Ho . You are wrong! Type II error(miss) You are right! Correct decision You are wrong! Type I error(false alarm) You are right! Correct decision

We make decisions at Security Check Points . .

. Type I or Type II error? . Does this airline passengerhave a snow globe? Null Hypothesis means she does not have a snow globe(that nothing unusual is happening) – Should we reject it???!! As detectives, do we accuse her of brandishing a snow globe?

. Does this airline passenger have a snow globe? Status of Null Hypothesis(actually, via magic truth-line) Are we correct or have we made a Type I or Type II error? False Ho Yes snow globe True Ho No snow globe You are wrong! Type II error(miss) Do not reject Ho“no snow globe move on” You are right! Correct decision Decision madeby experimenter You are wrong! Type I error(false alarm) Reject Ho “yes snow globe, stop!” You are right! Correct decision Note: Null Hypothesis means she does not have a snow globe (that nothing unusual is happening) – Should we reject it???!!

. Type I or type II error? True Ho False Ho You are right! Correct decision You are wrong! Type II error(miss) Do notReject Ho Decision madeby experimenter You are wrong! Type I error(false alarm) You are right! Correct decision Reject Ho Does this airline passenger have a snow globe? • Two ways to be correct: • Say she does have snow globe when she does have snow globe • Say she doesn’t have any when she doesn’t have any • Two ways to be incorrect: • Say she does when she doesn’t (false alarm) • Say she does not have any when she does (miss) Which is worse? What would null hypothesis be? This passenger does not have any snow globe Type I error: Rejecting a true null hypothesis Saying the she does have snow globe when in fact she does not (false alarm) Type II error: Not rejecting a false null hypothesis Saying she does not have snow globe when in fact she does (miss)

. Type I or type II error True Ho False Ho You are right! Correct decision You are wrong! Type II error(miss) Do notReject Ho Decision madeby experimenter You are wrong! Type I error(false alarm) You are right! Correct decision Reject Ho Does advertising affect sales? • Two ways to be correct: • Say it helps when it does • Say it does not help when it doesn’t help Which is worse? • Two ways to be incorrect: • Say it helps when it doesn’t • Say it does not help when it does What would null hypothesis be? This new advertising has no effect on sales Type I error: Rejecting a true null hypothesis Saying the advertising would help sales, when it really wouldn’t help people (false alarm) Type II error: Not rejecting a false null hypothesis Saying the advertising would not help when in fact it would (miss)

. What is worse a type I or type II error? True Ho False Ho You are right! Correct decision You are wrong! Type II error(miss) Do notReject Ho Decision madeby experimenter You are wrong! Type I error(false alarm) You are right! Correct decision Reject Ho What if we were lookingat a new HIV drug that had no unpleasant side affects • Two ways to be correct: • Say it helps when it does • Say it does not help when it doesn’t help • Two ways to be incorrect: • Say it helps when it doesn’t • Say it does not help when it does Which is worse? What would null hypothesis be? This new drug has no effect on HIV Type I error: Rejecting a true null hypothesis Saying the drug would help people, when it really wouldn’t help people (false alarm) Type II error: Not rejecting a false null hypothesis Saying the drug would not help when in fact it would (miss)

. Type I or type II error Which is worse? What if we were looking to see if there is a fire burning in an apartment building full of cute puppies • Two ways to be correct: • Say “fire” when it’s really there • Say “no fire” when there isn’t one • Two ways to be incorrect: • Say “fire” when there’s no fire (false alarm) • Say “no fire” when there is one (miss) What would null hypothesis be? No fire is occurring Type I error: Rejecting a true null hypothesis (false alarm) Type II error: Not rejecting a false null hypothesis (miss)

. Type I or type II error Which is worse? What if we were looking to see if an individualwere guilty of a crime? • Two ways to be correct: • Say they are guilty when they are guilty • Say they are not guilty when they are innocent • Two ways to be incorrect: • Say they are guilty when they are not • Say they are not guilty when they are What would null hypothesis be? This person is innocent - there is no crime here Type I error: Rejecting a true null hypothesis Saying the person is guilty when they are not (false alarm) Sending an innocent person to jail (& guilty person to stays free) Type II error: Not rejecting a false null hypothesis Saying the person in innocent when they are guilty (miss) Allowing a guilty person to stay free

Thank you! See you next time!!

MGMT 276: Statistical Inference in Management Spring , 2013