330 likes | 543 Views
Lecture 8 . Matched Pairs Review Summary The Flow approach to problem solving Example. Announcements. Extra office hours: Today, after class – 1:00. Sunday, 2-5 Monday 9-11:30. I also should be in my office most of the afternoon after 2:15. The Matched Pairs Experiment. Example 13.04
E N D
Lecture 8 • Matched Pairs • Review • Summary • The Flow approach to problem solving • Example
Announcements Extra office hours: • Today, after class – 1:00. • Sunday, 2-5 • Monday 9-11:30. I also should be in my office most of the afternoon after 2:15
The Matched Pairs Experiment • Example 13.04 • It was suspected that salary offers were affected by students’ GPA, (which caused S12 and S22 to increase). • To reduce this variability, the following procedure was used: • 25 ranges of GPAs were predetermined. • Students from each major were randomly selected, one from each GPA range. • The highest salary offer for each student was recorded. • From the data presented can we conclude that Finance majors are offered higher salaries?
Adminstrative Info for Midterm • Location: Steinberg Hall-Dietrich Hall 351 • Time: Monday (Feb. 10th), 6:00-8:00 p.m. • Closed book. • Allowed one double-sided 8.5 x 11 note sheet • Bring calculator.
Matched Pairs => One-Sample Test • After taking differences of observations within each pair, one can continue with a one-sample test with
Finance Marketing The matched pairs hypothesis test • Solution 13.04 (by hand) • The parameter tested is mD (=m1 – m2) • The hypotheses:H0: mD = 0H1: mD > 0 • The t statistic: The rejection region is t > t.05,25-1 = 1.711 Degrees of freedom = nD – 1
The matched pairs hypothesis test • Solution 13.04 • Calculate t
The matched pairs hypothesis test Conclusion: There is sufficient evidence to infer at 5% significance level that the Finance MBAs’ highest salary offer is, on the average, higher than that of the Marketing MBAs.
Matched Pairs vs. Independent Samples • Potential advantage of matched pairs: The variance of can be much less than the variance of if the variable(s) on which the pairs are matched account for much of the variation within each group. • However, if the variance of is close to the variance of , the independent samples design will produce a more powerful test because it has almost twice as many degrees of freedom.
Recognizing Matched Pairs Experiments • Does there exist some natural relationship between the first pairs of observations that makes it more appropriate to compare the first pair than the first observation in group 1 and the second observation in group 2? • Before and after designs • Example: A researcher for OSHA wants to see whether cutbacks in enforcement of safety regulations coincided with an increase in work related accidents. For 20 industrial plants, she has number of accidents in 1980 and 1995.
Examples • Example: A scientist claims that tomatoes will grow large if they are played soft, soothing music. To test the claim, 10 tomatoes grown without music and 10 tomatoes grown with music are randomly selected. The tomatoes are weighed in ounces. • Example: Many Americans use tax preparation companies to prepare their taxes. In order to investigate whether there are are any differences between companies, an experiment was conducted in which two of the largest companies were asked to prepare the tax returns of a sample of 55 taxpayers. The amounts of tax payable for each taxpayer at the two companies were recorded.
Caveat • Keep in mind the difference between observational and experimental data in terms of what can be inferred. • Does the study of finance/marketing majors in Example 13.4 show that students educated in finance are more attractive to prospective employers?
Additional Example-Problem 13.75 Tire example contd. (Problem 13.49) Suppose now we redo the experiment: On 20 randomly selected cars, one of each type of tire is installed on the rear wheels and, as before, the cars are driven until the tires wear out. The number of miles(in 1000s) is stored in Xr13- 75. Can we conclude that the new tire is superior?
A Review of Chapters 12 and 13 • Summary of techniques seen • Here (Chapter 14) we build a framework that helps decide which technique (or techniques) should be used in solving a problem. • Logical flow chart of techniques for Chapters 12 and 13 is presented next.
Summary of statistical inference:Chapters 12 and 13 • Problem objective: Describe a population. • Data type: Interval • Descriptive measurement: Central location • Parameter: • Test statistic: • Interval estimator: • Required condition: Normal population
Summary - continued • Problem objective: Describe a population. • Data type: Interval • Descriptive measurement: Variability. • Parameter: s2 • Test statistic: • Interval estimator: • Required condition: normal population.
Summary - continued • Problem objective: Describe a population. • Data type: Nominal • Parameter: p • Test statistic: • Interval estimator: • Required condition:
d.f. = n1 + n2 -2 Summary - continued • Problem objective: Compare two populations. • Data type: Interval • Descriptive measurement: Central location • Experimental design: Independent samples • population variances: • Parameter: m1 - m2 • Test statistic:Interval estimator: • Required condition: Normal populations
Summary - continued • Problem objective: Compare two populations. • Data type: Interval. • Descriptive measurement: Central location • Experimental design: Independent samples • population variances: • Parameter: m1 - m2 • Test statistic:Interval estimator: • Required condition: Normal populations
Summary - continued • Problem objective: Compare two populations. • Data type: Interval. • Descriptive measurement: Central location • Experimental design: Matched pairs • Parameter: mD • Test statistic:Interval estimator: • Required condition: Normal differences d.f. = nD - 1
Describe a population Compare two populations Nominal Interval Nominal Interval Z test & estimator of p Variability Central location Variability Central location t- test & estimator of m c2- test & estimator of s2 Continue Continue Problem objective? Data type? Data type? Z test & estimator of p1-p2 Type of descriptive measurement? Type of descriptive measurement? Experimental design? F- test & estimator of s12/s22
Experimental design? Independent samples Matched pairs Population variances? Equal Unequal t- test & estimator of m1-m2 (Unequal variances) t- test & estimator of m1-m2 (Equal variances) t- test & estimator of mD Continue Continue
Identifying the appropriate technique • Example 14.1 • Is the antilock braking system (ABS) really effective? • Two aspects of the effectiveness were examined: • The number of accidents. • Cost of repair when accidents do occur. • An experiment was conducted as follows: • 500 cars with ABS and 500 cars without ABS were randomly selected. • For each car it was recorded whether the car was involved in an accident. • If a car was involved with an accident, the cost of repair was recorded.
Identifying the appropriate technique • Example – continued • Data • 42 cars without ABS had an accident, • 38 cars equipped with ABS had an accident • The costs of repairs were recorded (see Xm14-01). • Can we conclude that ABS is effective?
Identifying the appropriate technique • Solution • Question 1: Is there sufficient evidence to infer that the cost of repairing accident damage in ABS equipped cars is less than that of cars without ABS? • Question 2: How much cheaper is it to repair ABS equipped cars than cars without ABS?
Question 2: Compare the mean repair costs per accident • Solution Problem objective? Describing a single population Compare two populations Data type? Cost of repair per accident Nominal Interval Type of descriptive measurements? Variability Central location
Population variances equal? Equal t- test & estimator of m1-m2 (Equal variances) Question 2: Compare the mean repair costs per accident Central location • Solution - continued Experimental design? Independent samples Matched pairs Run the F test for the ratio of two variances. Equal Unequal
Question 2: Compare the mean repair costs per accident • Solution – continued • m1 = mean cost of repairing cars without ABSm2 = mean cost of repairing cars with ABS • The hypotheses tested H0: m1 – m2 = 0 H1: m1 – m2 > 0 • For the equal variance case we use
Do not reject H0. There is insufficient evidence to conclude that the two variances are unequal. Question 2: Compare the mean repair costs per accident • Solution – continued • To determine whether the population variances differ we apply the F test • From JMP we have (Xm14-01)
Question 2: Compare the mean repair costs per accident • Solution – continued • Assuming the variances are really equal we run the equal-variances t-test of the difference between two means At 5% significance levelthere is sufficient evidenceto infer that the cost of repairsafter accidents for cars with ABS is smaller than the cost of repairs for cars without ABS.
Checking required conditions • The two populations should be normal (or at least not extremely nonnormal)
Question 3: Estimate the difference in repair costs • Solution • Use Estimators Workbook: t-Test_2 Means (Eq-Var) worksheet We estimate that the cost of repairing a car not equipped with ABS is between $71 and $651 more expensive than to repair an ABS equipped car.