1 / 11

2.4: Cautions about Regression and Correlation

Learn about the cautions when using regression and correlation analysis. Understand the limitations of correlation, dangers of extrapolation, issues with lurking variables, influential data points, and more.

Download Presentation

2.4: Cautions about Regression and Correlation

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. 2.4: Cautions about Regression and Correlation

  2. Cautions: Regression & Correlation • Correlation measures only linear association. • Extrapolation often produces unreliable predictions. • Correlation and least-squares regression are not resistant. • Lurking variables can make a correlation or regression misleading.

  3. Residual Plots • A residual plot is a scatterplot of the regression residuals (i.e., errors) against the explanatory variable. • Residual plots make patterns in the original scatterplot of data more apparent. • If the regression catches the overall pattern of the data, there should be no evident pattern to the residuals.

  4. Cautions: Regression & Correlation • Correlation measures only linear association. • Extrapolation often produces unreliable predictions. • Correlation and least-squares regression are not resistant. • Lurking variables can make a correlation or regression misleading.

  5. Cautions: Regression & Correlation • Correlation measures only linear association. • Extrapolation often produces unreliable predictions. • Correlation and least-squares regression are not resistant. • Lurking variables can make a correlation or regression misleading.

  6. Outliers & Influential Data Points • Remember, an outlier is an observation that lies outside the overall pattern of the other observations. • In a least-squares regression, does an outlier have to have a large residual?

  7. Outliers & Influential Data Points • Points that are outliers in the y direction have large regression residuals. • Other outliers need not have large residuals.

  8. Outliers & Influential Data Points • An observation is influential if removing it would markedly change the result of the regression. • Outliers in the x direction of a scatterplot are often influential in least-squares regression.

  9. Cautions: Regression & Correlation • Correlation measures only linear association. • Extrapolation often produces unreliable predictions. • Correlation and least-squares regression are not resistant. • Lurking variables can make a correlation or regression misleading.

  10. Lurking Variable • A lurking variable is a variable that is not among the explanatory and response variables, yet may influence the interpretation of the relationships among those variables. • Association does not imply causation! • A lurking variable may have a cause-and-effect relationship with the x and y variables, creating a strong association between x and y.

More Related