1 / 21

Correlation with a Non - Linear Emphasis

Understand the complexities of correlation with a nonlinear emphasis. Learn to interpret scatterplots, residual patterns, and various regression shapes. Gain insights into lurking variables and causation pitfalls. Enhance your statistical analysis skills today!

biancaj
Download Presentation

Correlation with a Non - Linear Emphasis

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Correlation with a Non - Linear Emphasis Day 2

  2. Correlation measures the strength of the linearassociation between 2 quantitative variables. • Before you use correlation, you must check several conditions:

  3. Quantitative Variables Condition: Are both variables quantitative? • Straight Enough Condition: Is the form of the scatterplot straight enough that a linear relationship makes sense? If the relationship is not linear, the correlation will be misleading. • Outlier Condition: Outliers can distort the correlation dramatically. If an outlier is present it is often good to report the correlation with and without that point.

  4. A hidden variable that stands behind a relationship and determines it by simultaneously affecting the other two variables is called a lurking (confounding) variable. • Scatterplots and correlation coefficients NEVER prove causation.

  5. Don’t ever assume the relationship is linear just because the correlation coefficient is high. • In order to determine whether a relationship is linear or not linear, we must always look at the residual plot.

  6. Residuals • A residualis the vertical distance between a data point and the graph of a regression equation.

  7. The Residual is • positive if the data point is above the graph. • negative if the data point is below the graph. • Is 0 only when the graph passes through the data point.

  8. What should you look for to tell if it is not linear?...... • Sometimes a high “r” value for linear regression is deceptive. You must look at the scatter plot AND you must look at the residual pattern it makes. • If the residuals have a curved pattern then it is NOT linear.

  9. To prove linearity • A scatterplot of the residuals vs. the x-values should be the most boring scatterplot you’ve ever seen. • It shouldn’t have any interesting features, like a direction or a shape. • It should stretch horizontally, with about the same amount of scatter throughout. • It should show no bends. • It should show no outliers.

  10. Some Non Linear Regression Shapes…… • Positive Quadratic Regression: • Negative Quadratic Regression:

  11. More Non Linear Regression Shapes…… • Positive Exponential Regression: • Negative Exponential Regression:

  12. Quadratic and Exponential on GDC…… • Quadratic: • Exponential:

  13. Example……The scatter plot could possibly be linear. You must check the residual pattern.

  14. Change y-list to resid after running a linear correlation regression – 2nd stat resid: • Notice the curved pattern in the residuals.

  15. NOTE!!!!!! • Just because the curved pattern on the residuals looks like a quadratic we cannot determine that until we check the “r” value of other curved functions and see how well the data fits. • You should also consider “real-life” implications when deciding.

  16. When you see that the residuals are curved you must check the correlation coefficient for the exponential and the quadratic to choose the stronger correlation. • A check on the exponential regression yield an r – value of -0.956. (Strong Negative but check out the quadratic….)

  17. This is a quadratic regression….. • Equation: y=.00946x² - 0.839x+18.5 r = 0.966 This value is even stronger than the exponential.

  18. Example 2……Is it linear?

  19. Look at the residuals…… • There is a curved pattern in the residuals. It is NOT linear – it is either quadratic or exponential. (Positive) • Use the “r” value to help you decide.

  20. And the Winner is….. • Here is the equation you should use for predictions: y = 1(2) x

  21. Homework • Follow the flowchart.

More Related