1 / 16

Two Quantitative Variables

Learn how to draw scatterplots, interpret association, calculate correlation, and analyze relationships between variables. Explore examples and understand the strength of linear relationships in data sets.

janwoods
Download Presentation

Two Quantitative Variables

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Two Quantitative Variables • Scatterplots • examples • how to draw them • Association • what to look for in a scatterplot • Correlation • strength of a linear relationship • how to calculate • good news and bad news

  2. 80 BOATS 50 20 40 20 30 CARS Scatterplot

  3. 80 BOATS 50 20 40 20 30 CARS Scatterplot

  4. Made-up Examples STATE AVE SCORE PERCENT TAKING SAT

  5. Made-up Examples IQ SHOE SIZE

  6. Made-up Examples JUDGE’S IMPRESSION 450 250 350 BAKING TEMP

  7. Made-up Examples LIFE EXPECTANCY GDP PER CAPITA

  8. Scatterplots: Which variable goes where? • RESPONSE VARIABLE goes on Y axis • (“Y”) (“dependent variable”) • EXPLANATORY VARIABLE goes on X axis • (“X”) (“independent variable”) • If neither is really a response variable, it doesn’t matter which variable goes where.

  9. Scatterplots: Drawing Considerations • Don’t show the axes without a good reason • Don’t show gridlines without a good reason • Scales should cover the ranges of the variables-- • —outliers? • —no need to include 0 • —what if same units?

  10. What to look for in a scatterplot… • Do the cases break up into separate clusters? • Are there outliers? • Is there an ASSOCIATION between the • variables? OR are they INDEPENDENT? • ALWAYS DRAW THE PICTURE !!!!

  11. Kinds of Association… • Positive vs. Negative • Strong vs. Weak • Linear vs. Non-linear

  12. CORRELATION • CORRELATION • (or, the CORRELATION COEFFICIENT) • measures the strength of a linear relationship. • If the relationship is non-linear, it measures the strength of the linear part of the relationship. But then it doesn’t tell the whole story. • Correlation can be positive or negative.

  13. Computing correlation… • Replace each variable with its standardized version. • Take an “average” of ( xi’ times yi’ ):

  14. Computing correlation sum of all the products r, or R, or greek  (rho) n-1, not n

  15. Good things about correlation • It’s symmetric ( correlation of x and y means same as correlation of y and x ) • It doesn’t depend on scale or units • — adding or multiplying either variable by • a constant doesn’t change r • — of course not; r depend only on the • standardized versions • r is always in the range from -1 to +1 • +1 means perfect positive correlation; dots on line • -1 means perfect negative correlation; dots on line • 0 means no relationship, OR no linear relationship

  16. Bad things about correlation • Sensitive to outliers • Misses non-linear relationships • Doesn’t imply causality

More Related