430 likes | 568 Views
Planned Contrasts and Data Management. Class 19. QUIZ 3 ON THURSDAY, DEC. 5 Covers: Two-way ANOVA through Moderated Multiple Regression. Variance for All Factors. Subject Variance. Factor A X Factor B (Birth X Gender). df A X B = (a –1) (b – 1). df s/AB = ab(s – 1).
E N D
QUIZ 3 ON THURSDAY, DEC. 5 Covers: Two-way ANOVA through Moderated Multiple Regression
Variance for All Factors Subject Variance Factor A X Factor B (Birth X Gender) dfA X B = (a –1) (b – 1) dfs/AB = ab(s – 1) dfTotal = abs – 1 (2-1) x (2-1) = 1 dfs/AB = n - ab dfTotal = n – 1 10 – 1 = 9 10 – (2 x 2) = 6 Degrees of Freedom in 2-Way ANOVA Between Groups Factor A (Birth Order) df A = a - 1 2 – 1 = 1 Factor B (Gender) df B = b – 1 2 – 1 = 1 Interaction Effect Error Effect Total Effect
Conceptualizing Degrees of Freedom (df) in Factorial ANOVA Birth Order Gender Youngest Oldest Sum 9.00 Males 4.50 4.50 11.00 Females 5.50 5.50 Sum 10.00 10.00 20.00 NOTE: “Fictional sums” for demonstration.
Conceptualizing Degrees of Freedom (df) in Factorial ANOVA Factor A Factor B a1 a2 a3 Sum b1 # # X B1 b2 # # XB2 b3X X X X Sum A1 A2X T A, B, # = free to vary; T has been computed X = determined by A,B, #s Once A, B, # are established, Xs are known
Analysis of Variance Summary Table: Two Factor (Two Way) ANOVA A SSA a - 1 SSA dfA MSA MSS/AB B SSB b - 1 SSb dfb MSB MSS/AB A X B SSA X B (a - 1)(b - 1) SSAB dfA X B MSA X B MSS/AB Within (S/AB) SSS/A ab (s- 1) SSS/AB dfS/AB Total SST abs - 1 Source of Variation Sum of Squares df Mean Square F Ratio (SS) (MS)
Mean Men Mean Women Sum of Sqrs. Betw'n dt Betw'n MS Betw'n Sum of Sqrs. Within df Within MS Within F p One Way 4.78 3.58 3.42 1 3.42 22.45 8 2.81 1.22 .30 Two Way 4.78 3.58 3.42 1 3.42 5.09 6 .85 4.03 .09 Effect of Multi-Factorial Design on Significance Levels
Source Sum of Squares df Mean Square F Sig. Source Gender 3.42 Sum of Squares df 1 Mean Square 3.42 F 4.03 .09 Sig. Birth Order 16.02 1 16.02 18.87 .005 Gender 3.42 1 3.42 1.22 .34 Interaction 3.75 1 3.75 4.42 .08 Error 22.45 8 2.81 Error 5.09 6 0.85 Total 9 ONEWAY ANOVA AND GENDER MAIN EFFECT TWO-WAY ANOVA AND GENDER MAIN EFFECT Oneway F: 3.42 = 1.22 Twoway F: 3.42 = 4.42 2.81 .85
Topics Covered Today 1. Planned Contrasts 2. Analysis of Residual Variance 3. Post-hoc tests 4. Data Management a. Setting up data files b. Cleaning data
"Pop" Culture: Gun Support as a Function of Political Party and Gender
Support of Gun Control: Which Party? Which Gender? How much do you support handgun instruction in school? 1 2 3 4 5 We predict: GOP Men? GOP Women? Dem Men? Dem. Women? 5 5 5 2
Planned Contrast: Function 1. Factorial ANOVA tests for orthogonal (perpendicular) interactions. 2. Some studies predict non-orthogonal interactions. 3. Planned contrast provides more predictive power to confirm non-orthogonal contrasts of any particular shape (“wedge”, “arrow” [like above] or other).
Planned Contrast: Execution (Conceptual) 1. Must predict pattern of interaction before gathering data. Predict that Democratic women will be most opposed to gun instruction in school, compared to Democratic men, Republican men, and Republican women.
Convert Separate Factors into Single Factor 1. Two separate factors pol.party 1) GOP 2) Democrat gender 1) Male 2) Female • Convert the two separate factors into a single factor genparty 1) Male Republican 2) Male Democrat 3) Female Republican 4) Female Democrat
Convert Separate Factors into Single Factor SPSS Syntax (commands) genparty 1 = Male Republican 2 = Male Democrat 3 = Female Republican 4 = Female Democrat
Political Party Male Female Republican 5.00 4.75 Democrat 4.50 2.75 Converting Multi-factors into Single Factor for Planned Contrast Gender Converted into single factor with four levels GENPARTY 1 = Male/Republican 5.00 2 = Male/Democrat 4.50 3 = Female/Republican 4.75 4 = Female/Democrat 2.75
Planned Contrast: Execution (Conceptual) 3. Conduct one-way ANOVA, with new single variable as predictor. 4. Assign weights to the four levels, as follows: 1) Male Republican -1 2) Male Democrat -1 3) Female Republican -1 4) Female Democrat 3 * Weights indicate which sub-groups are to be compared. * Weights must add up to zero 5. Planned contrast then limits comparison to the indicated groups, but “counts” all subjects in terms of degrees of freedom and computation of error. This provides greater predictive power. This is even true if weight for some group(s) set at zero.
Graph of Gender X Political Party and Opposition to Gun Instruction in School
Univariate Analysis of Variance [DataSet1] Orthogonal Interaction
Planned Contrast, Page 1 Note: This is ANOVA p value, NOT contrast p value
Contrast Tests Std. Error Contrast value Sig. (2 –tailed) df t Contrast Assumes eq. var. -6.000 1.768 -3.394 .005 12 Doesn’t assume eq. var. -6.000 1.696 -3.539 5.501 .014 Planned Contrast, Page 2
Predicted this Obtained this “Quality Control” for Planned Contrast Issue: Planned contrast can be a very “liberal” test, confirming patterns that don’t closely fit with actual predictions. Result of this –1, -1, -1, + 3 planned contrast might still be significant How to assess the “quality” of a significant planned contrast?
Analysis of Residual Variance Logic of test: Did (Between groups effect – Contrast effect) leave significant amount of systematic (non-random) variance unexplained? If so, then the contrast did not do a good job. It did not explain the outcome fully. However, if “what’s left over” (i.e., between effect – contrast) is not significant, then the contrast accounts for most of the treatment. In this case, the contrast did do a good job.
Contrast Should “Absorb” Most of Between-Groups Effect ═ ─ Between Groups Variance Contrast Effect Remaining Variance
Steps in Analysis of Residual Variance Test • Get SPSS printout of planned contrast • Get t of contrast, square it to get contrast F (t = F ) • Compute SS contrast (SSc): Multiply contrast F by mean sq. w/n (MSw) of oneway ANOVA. This results in SS contrast (SSc). • Compute SS residuals (SSr): Get SS between (SSb) from oneway, and subtract SSc. (SSb – SSc) = SS residuals (SSr) • Compute MS contrast (MSc): Divide SSr by df, which is (oneway df – planned contrast df). This produces the MS contrast (MSc) • Compute F residuals: Divide MSc by MSw. MSc/MSw = F residuals • Compute df for F resid: numerator df = (df oneway – df contrast; see 5, above), denominator df = df within (from oneway). • Check this F in F table from any stats book. If significant, contrast is not a good fit. If not significant, the contrast is a good fit.
Residuals Analysis Test 1. Get SPSS printout of planned contrast 2. F of contrast(Fcont) = t2; t = -3.39 t2= 11.49 3.SScontrast (SScont):F cont XMSw = 11.49 X 1.04 = 11.95 4. SSresiduals (SSres): SSbetween (SSb) = 12.50 SSb – SScont = 12.50 – 11.95 = .55 5a. Contrast df = df oneway – df contrast = 15 -12 = 3 5b. MScontrast (MScont) = SSres / contrast df = .55/3 = 0.18 • F residuals (Fresid): Divide MScont by MSw = 0.18/1.04 = .17 • DF for Fresid= df contrast (see 5a, above), df within: (3, 12) 8. F table at (3, 12) df, for criterion p < .25; F = 1.56 9. Obtained Fresid = 0.17< 1.56, therefore residual is not significant, therefore contrast result is a good fit for data.
Post Hoc Tests Do female democrats differ from other groups? 1 = Male/Republican 5.00 2 = Male/Democrat 4.50 3 = Female/Republican 4.75 4 = Female/Democrat 2.75 Conduct three t tests? NO. Why not? Will capitalizes on chance. Solution: Post hoc tests of multiple comparisons. Post hoc tests consider the inflated likelihood of Type I error Kent's favorite—Tukey test of multiple comparisons, which is the most generous. NOTE: Post hoc tests can be done on any multiple set of means, not only on planned contrasts.
Conducting Post Hoc Tests 1. Recode data from multiple factors into single factor, as per planned contrast. 2. Run oneway ANOVA statistic 3. Select "posthoc tests" option. Note: Not necessary to conduct planned contrast to conduct post-hoc test ONEWAY gunctrl BY genparty /CONTRAST= -1 -1 -1 3 /STATISTICS DESCRIPTIVES /MISSING ANALYSIS /POSTHOC = TUKEY ALPHA(.05). Selected post-hoc test
Data Management Issues Setting up data file Checking accuracy of data Disposition of data Why obsess on these details? Murphy's Law If something can go wrong, it will go wrong, and at the worst possible time. Errars Happin!
Creating a Coding Master 1. Get survey copy 2. Assign variable names 3. Assign variable values 4. Assign missing values 5. Proof master for accuracy 6. Make spare copy, keep in file drawer
Coding Master variable values variable names Note: Var. values not needed for scales
Cleaning Data Set 1. Exercise in delay of gratification 2. Purpose: Reduce random error 3. Improve power of inferential stats.
Complete Data Set Note: Are any cases missing data?
Checking Descriptives Are any “Minimums” too low? Are any “Maximums” too high? Do Ns indicate missing data? Do SDs indicate extreme outliers?
Checking Correlations Between Variables Do variables correlate in the expected manner?
Number of Siblings Gender Oldest Oldest Youngest Youngest Only Child Only Child None Males 10 4 10 3 6 20 Females One 3 5 4 15 20 0 More than one TOTAL 3 15 25 4 40 2 TOTAL 10 10 8 Using Cross Tabs to Check for Missing or Erroneous Data Entry Case A: Expect equal cell sizes Case B: Impossible outcome
Storing Data Raw Data 1. Hold raw data in secure place 2. File raw data by ID # 3. Hold raw date for at least 5 years post publication, per APA Automated Data 1. One pristine source, one working file, one syntax file 2. Back up, Back up, Back up ` 3. Use external hard drive as back-up for PC
21-40 41-60 61-80 81-100 101-120 File Raw Data Records By ID Number 01-20
COMMENT SYNTAX FILE GUN CONTROL STUDY SPRING 2007 COMMENT DATA MANAGEMENT IF (gender = 1 & party = 1) genparty = 1 . EXECUTE . IF (gender = 1 & party = 2) genparty = 2 . EXECUTE . IF (gender = 2 & party = 1) genparty = 3 . EXECUTE . IF (gender = 2 & party = 2) genparty = 4 . EXECUTE . COMMENT ANALYSES UNIANOVA gunctrl BY gender party /METHOD = SSTYPE(3) /INTERCEPT = INCLUDE /PRINT = DESCRIPTIVE /CRITERIA = ALPHA(.05) /DESIGN = gender party gender*party . ONEWAY gunctrl BY genparty /CONTRAST= -1 -1 -1 3 /STATISTICS DESCRIPTIVES /MISSING ANALYSIS /POSTHOC = TUKEY ALPHA(.05). Save Syntax File!!!
Research Project Notebook Purpose: All-in-one handy summary of research project Content: 1. Administrative (timeline, list of staff, etc.) 2. Overview 3. Experiment Materials * Surveys * Consents, debriefings * Manipulations * Procedures summary/instructions 4. IRB materials * Application * Approval 5. Data * Coding forms * Syntax file * Primary outcomes