1 / 55

Heads Up! Sept 22 – Oct 4

Heads Up! Sept 22 – Oct 4. Probability Perceived by many as a difficult topic Get ready ahead of time. Last Time:. Least Squares Regression (Simple Linear Regression) Correlation. In Least-Squares Regression:. Computational Formula. Can we do this?. Totals:.

Download Presentation

Heads Up! Sept 22 – Oct 4

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Heads Up!Sept 22 – Oct 4 Probability Perceived by many as a difficult topic Get ready ahead of time

  2. Last Time: Least Squares Regression (Simple Linear Regression) Correlation

  3. In Least-Squares Regression: Computational Formula

  4. Can wedo this? Totals:

  5. Calculating the Least Squares Regression Line contd.

  6. Regression Equation TRIAL = 1.09 PRACTICE - 9 Slope is 1.09 10.9 10 Intercept is -9 You can’t see it in this graph

  7. A view from further away….

  8. Look at the residuals: We want a shot-gun blast shape, i.e., a random blob

  9. Look at Residuals & Line Fit Residual Plot Problem: Relationship is not linear Line Fit Plot

  10. Look at Residuals & Line Fit Residual Plot Problem: Predictions are very precise for small predicted values, but very unprecise for large predicted values. (Not good)

  11. Look at Residuals Residual Plot 1 2 3 4 5 6 7 8 9 10 11 12 Problem: Lurking (third) variables (?) Here: Seasonal Trend?

  12. Correlation Slope in regression of standardized variables How strong is the linear relationship between two variables X and Y? This slope tells me How much a given change (in standardized units) of X translates into a change (in standardized units) of Y

  13. Correlation How strong is the linear relationship between two variables X and Y? Correlation Coefficient Computational Formula:

  14. Properties of Correlation • Symmetric Measure (You can exchange X and Y and get the same value) • -1 ≤ r ≤ 1 • -1 is “perfect” negative correlation • 1 is “perfect” positive correlation • Not dependent on linear transformations of X and Y • Measures linear relationship only

  15. Let’s try it out on our X = PRACTICE, Y = TRIAL Data Set Check this calculation at home!

  16. Today Finish Theory on Regression Pathologies and Traps in Linear Regression and Correlation Relationships between Categorical Variables

  17. Regression on Standardized Variables

  18. ?

  19. What is the variance of ?

  20. Proportion of Variance explained Variance of predicted Y’s Proportion of Variance of observed Y’s that is accounted for by the regression Variance of observed Y’s

  21. Proportion of Variance explained Proportion of Variance of observed Y’s that is accounted for by the regression Note: If you exchange X and Y in the regression, you find the same r and r squared

  22. Correlation only checks magnitude of Linear Relationships! It can happen that r=0, even though X and Y are highly related to each other! Need to look at Scatter Plot and Residual Plot to make sure that you don’t miss an obvious relationship overlooked by linear regression!

  23. How does a Linear Regression Model approximate (for X=1,2,…,15) For these particular data the regression model finds a = -45 b = 16 The residuals have a systematic trend!! This Linear Regression is inappropriate!!

  24. How does a Linear Regression Model approximate (for X=-8,-7,…,7,8) For these particular data the regression model finds a = 24 b = 0 The residuals have a systematic trend!! This Linear Regression is inappropriate!!

  25. How does a Linear Regression Model approximate (for X=-8,-7,…,7,8) For these particular data the regression model finds a = 24 b = 0 r = 0 Correlation is Zero: No LINEAR Relationship Is there “no relationship” between X and Y? There is an extremely strong (nonlinear) relationship here!

  26. How does a Linear Regression Model approximate (for X=1,2,…,15) For these particular data the regression model finds a = .54 b = .16 The residuals have a systematic trend!! This Linear Regression is inappropriate!!

  27. Correlation is not Causation! Correlation between the size of your big toe and your performance on reading tasks is highly positive! ?? Lurking Third Variable: AGE

  28. Correlation is not Causation! ? Only experimentation allows us to attribute causation to the relationship between independent and dependent variables.

  29. Ecological Correlation:Correlations between averages are higher than correlations between individuals Y Group averages Y X X Group averages

  30. Problem of Restricted Range Success in Graduate School GRE scores Strong Linear Relationship No Linear Relationship

  31. Extrapolations are Dangerous Number of Passengers Year

  32. Regression toward the Mean The term “Regression” is associated with Sir Francis Galton (1822 – 1911) Galton (1885) “Regression towards Mediocrity In Hereditary Stature” Journal of the Anthropological Institute Picture taken from http://www.gene.ucl.ac.uk/

  33. Regression toward the Mean Suppose:

  34. Regression toward Mediocrity?? Predictions are closer to zero (the mean) then the observations!!

  35. r=.60 1.2 1.2 2.0 2.0

  36. r=.60 1.2 2.0 Among families where the father is approximately 2 standard deviations above the mean, the average son is only about 1.2 standard deviations above the mean.

  37. Regression toward Mediocrity?? Do the sons just become more similar to each other than their fathers were?

  38. Regression toward Mediocrity?? Variability of the Z scores is the same! No slide into mediocrity!!

  39. Regression toward the mean When you have a lucky and exceptionally good performance in an exam, you expect to do worse next time, because there is no reason to believe that you will be so exceptionally lucky again. When you have a mental block and exceptionally bad performance in an exam, you expect to do better next time, because there is no reason to believe that you will be so exceptionally unlucky again. This does not mean that you are becoming more and more average as time progresses. It means that your average performance, as a reasonable predictor for future performance, will lead to such a pattern of relationships between observed and predicted performance

  40. Regression toward the mean Your room mate makes a huge mess in your room. You complain. The next few days are cleaner. Your room mate has cleaned up the room. You praise your room mate. The next few days the room gets dirtier. Does this mean that punishment leads to better performance and reward leads to worse performance? No….

  41. Regression toward the mean Your room mate makes a huge mess in your room. You do nothing. The next few days are cleaner. Your room mate has cleaned up the room. You do nothing. The next few days the room gets dirtier. Your room mate simply makes messes, cleans them, makes messes, cleans them … Your best guess for the future is an “average” level of messiness

  42. Implications for Research It is very risky to study anything based on selection of extreme groups Test   Retest Extremes become less extreme May look like a treatment effect!

  43. Relationships between Categorical Variables 237 50 32 255 287 Marginal Distributions

  44. Theory “Mothers tend to hold their babies with the non-dominant hand, so that the dominant hand is available to do stuff.”

  45. Relationships between Categorical Variables .826 (82.6%) .174 (17.4%) .889 (88.9%) .111 (11.1%) Marginal Proportions (Percentages) Vast majority of babies held left Vast majority of mothers right-handed

  46. Relationships between Categorical Variables 1 (100%) 1 (100%) Absolute size not taken into account Conditional proportions, given side on which the baby is held

  47. Relationships between Categorical Variables Absolute size not taken into account 1 (100%) 1 (100%) Conditional proportions, given dexterity of mother

  48. Relationships between Categorical Variables Absolute size not taken into account 1 (100%) 1 (100%) For any given dexterity of the mother, there is an overwhelming tendency to hold the baby on the left hand side.

  49. Segmented Bargraphs

  50. Segmented Bargraphs

More Related