250 likes | 259 Views
Exploring covariation and group differences to understand variability in research. Focus is on identifying systematic variability in variables and their relationships. Common applications and calculations like Pearson's r are discussed.
E N D
Simple Covariation Focus is still on ‘Understanding the Variability” With Group Difference approaches, issue has been: Can group membership (based on ‘levels of the IV’) account for variability of the DV? Information used was differences in ‘typical’ outcomes across the levels of the IV.
Simple Covariation Group Difference - Did typical outcomes differ enough to suggest the presence of systematic variability? Was variability in IV associated with variability in DV to a degree unlikely to be due to ‘unsystematic’ variations? How much of the variability has been ‘explained’, and how much has not (residual)
Simple Covariation Now the focus is on the degree to which pairs of variables from a common source covary. (are systematic changes in one variable associated with systematic changes in the other) No longer looking at typical performance for the group, now variability of both variables is at the individual level. If the two variables covary systematically, then knowing one variable might ‘explain’ or ‘account for’ variability of the other
Simple Covariation Source of paired scores can be any type of entity: people, days, families, countries…. No longer categorize variables as IV and DV, just two variables from the same ‘source’ Seek to measure the strength of the relationship (covariation) between two variables.
Simple Covariation Correlation Coefficient is an index of the relationship All of these provide an index of ‘strength’ of the relationship on a 0 – 1 ordinal scale Some also provide ‘direction’ information, when appropriate (+/-)
Simple Covariation Correlation Coefficient is the index of the relationship Various forms, depending upon data Pearson’s r – two interval/ratio variables eta – one categorical, one interval/ratio variable phi or Cramer’s V – two categorical variables Spearman’s rho – two ordinal or one ordinal and one interval variable (scores converted to ranks) see examples next Not all provide meaningful direction information – but SPSS will still give sign
Simple Covariation Common applications Preliminary evidence, prior to controlled experiment - If Cause and Effect exists, covariation should Assess degree of association/similarity among variables – Is Cheerfulness the same as Agreeableness Is Optimism related to Risk Taking Develop prediction strategy – can SAT predict College Success?
Simple Covariation Pearson’s Product Moment Coefficient (r) Index of strength and direction of a linear relationship if two variables covary in a linear relationship, then an individual’s relative position (deviations from means) on each variable should be similar
Simple Covariation Pearson’s Product Moment Coefficient (r) r = covariance/‘variance’ – refresh on calculation of variance show connection to covariance r = sum (zx * zy)/df (n-1) where n = # of pairs r2 = shared variance (ratio scale) coefficient of determination Ho: r = 0, tested using a t-test with n-2 df n = # of pairs of measures
Simple Covariation Examine the relationship using scatter plot Perceived Stress in the Past, and Expected Stress in the Future No stress 0 to 56 Highest stress
Simple Covariation Assumptions for Pearson’s r interval/ratio data independent observations (pairs) each variable normally distributed (or not obviously not normal) linear relationship (no evidence of clear nonlinear pattern) bivariate normal distribution – (3 dimensional normal pile) homoscedasticity (similar variability of Y at values of X)
Simple Covariation Limiting conditions for Pearson’s r bivariate outliers – reduces r if truly outlier on both variables truncated range – effect depends upon actual relationship (linear or nonlinear)
Simple Covariation With all data With two pairs removed Limiting conditions for Pearson’s r bivariate outliers truncated range If try to ‘fit’ a straight line through the scatter-plot. How would the 2 outliers impact the line?
Simple Covariation Typical sequence in evaluating r check assumptions calculate r When reporting r, df are number of ‘pairs’ minus 2 assess statistical significance t-test for r=0 compute r2 Coefficient of determination interpret strength and direction discuss “effect size” – shared variance (r2) Note: in GPower use Correlation: Bivariate Normal for Pearson r
Simple Covariation If you wanted to interpret all of the r’s, you would have 15 tests on the same individuals – so Type 1 will be inflated. However, you may only care about r’s for GREs with GPA Total, so only 4 r’s are relevant. As always, balance Type 1 and Type 2. Listwise – must have score on every variable Note sample size here, and on next page, from SAME data set!
Simple Covariation N’s much lower in column for GRE Analytic – why? Pairwise – included whenever have both scores for a coefficient N’s range from 263 to 399 using Pairwise
Simple Covariation Pearson r vs. Spearman rho Difference based on whether you were willing to consider rating scale: Definitely no (1) to (9) Definitely yes to be interval or ordinal
Simple Covariation Covariation and causality Conditions needed to infer Cause-Effect 1 two variables covary (covariation) 2 cause precedes the effect 3 other potential causes controlled
Simple Covariation Covariation and causality Conditions needed to infer Cause-Effect 1 two variables covary (covariation) Correlation coefficients can provide a reasonable test of condition #1 Is there evidence for significant (systematic) covariation? 2 cause precedes the effect 3 other potential causes controlled
Simple Covariation Covariation and causality Conditions needed to infer Cause-Effect 1 two variables covary (covariation) 2 cause precedes the effect Correlation does not directly deal with this condition – creating the… Directionality problem X Y or Y X - which of these is more likely to be true Cross-lagged strategy – provides evidence to help decide
Simple Covariation Covariation and causality Cross-lagged strategy Time 1 Var X (TV violence) Var Y (Aggressive Behaviors) Time 2 Var X (TV violence Var Y (Aggressive Behavior) Y as Cause X as Cause Which direction of cause – effect receives stronger support
Simple Covariation Covariation and causality Conditions needed to infer Cause-Effect 1 two variables covary (covariation) 2 cause precedes the effect 3 other potential causes controlled Because you simply select for or measure your variables, have less potential to isolate the variables of interest from other extraneous variables – creating… “Third” Variable Problem The Solution – Partial Correlation
Simple Covariation Covariation and causality Partial correlation (pr) Examine correlation of X & Y after ‘removing’ variation in each that can be explained by variable Z (correlation of the residuals for X and Y after removing relationship with Z) – clearer after regression X Y Third variable problem exists when both X and Y are related to Z, the Third variable, so the covariation of X and Y is the result of Z influencing both X and Y Z
Simple Covariation Women in dating relationships where there had been physical abuse, were asked for rated Commitment to a partner, Time in relationship, and Perceived Investments in the relationship Commitment and How long in months you have been in the relationship are correlated at +.349 Note that for correlation table, N is reported, but for Partial Correlation, df are reported. When control for Investments made to relationship, correlation reduced to +.165