360 likes | 371 Views
Learn how graphing can help you better understand the relationship between variables and effectively communicate your data to your audience. Discover techniques to minimize non-data ink and avoid chart junk. Use graphs as diagnostics for regressions and explore the use of logs for trending variables.
E N D
Coin data • How can we see what’s going on better? • Long run vs. short run
Graphing • Graphing helps you see the relationship between variables • Time series plots vs. scatter plots
Elements of Graphical Style • Know your audience & know your goals • Show the data and appeal to the viewer • Minimize non-data ink • Avoid chart junk • Revise and edit, again and again
Non-data ink Note that after 1990, the pattern of velocity and opportunity cost changed significantly, with a couple of years of transition and a new pattern to the slope. The new intercept was much higher than the old one and thus the relationship changed so that it was much more difficult to use the model for forecasting.
Chartjunk • Don’t use 3 dimensions for a 2-dimensional object • Don’t add decorations, cartoons, etc. that do not tell your story Hi, I’m irrelevant!
Make graphs tell your story • The golden ratio of height to width is 0.618 • Use scale to show variations in a variable
Beware of Outliers • Measurement outliers • Data errors • Innovation outliers • A shock or innovation
Adding recession bars • Often help explain data well
Graphs as diagnostics for regressions • Plot actual and fitted values; residuals over time • Plot residuals squared or absolute values of residuals over time (solutions: interactive data analysis) • Do a scatter plot of residual vs. explanatory variable
Example: consumption & income • We can save residuals and do plots of residuals themselves, actual & predicted, residuals vs. explanatory variables • Later, using saved residuals, we can plot squares and absolute values • Note that non-random residuals suggests that a non-linear model may be better
Logs for trending variables • When variables trend upwards, the graph of the variable shows too much recent information, not enough past information