800 likes | 986 Views
SEEING IS BELIEVING: Telling stories with statistics – in pictures. We’re failing. Do you see the same thing here?. This is your brain on statistics. The total sample is (roughly) evenly divided by gender.
E N D
SEEING IS BELIEVING: Telling stories with statistics – in pictures
The total sample is (roughly) evenly divided by gender. Subtracting 72 from the 150 one would expect gives a value of about 80, which squared is 6,400. It is already obvious this is significant.
Seeing is a learned skill Statisticians may see things in a picture others don’t
My points (surprisingly, I do have some)
Data Visualization Graphics do not necessarily stand alone
Visual representation in one context is often misapplied to another.
Basic Assumptions • Our audience needs to be taught to read visual data just as we read numeric data, and we need to learn to have some discussion beyond the choices of line graphs vs. pie charts
You learned to read numbers You need to learn to WRITE pictures Or, to be more specific, you need to explain to others what you see in pictures
Bad visualization for one question can be good for another • Who will win the election? • Which regions support the Democrats? Poll dataset did not include Hawaii or Alaska
AN EXAMPLE OF PROGRAM EVALUATION DATA VISUALIZATION BY EXAMPLE
The government is smarter than you think (No, I’m serious)
GOPTIONS HBY = 2 ;PROCGPLOT DATA=wussexample UNIFORM; PLOT z_total_post * z_total_pre / VREF=0 ; BY group;
NOTE: Regression equation : z_total_post = 0.13379 + 0.776552*z_total_pre. NOTE: The above message was for the following BY group: group=CONTROL NOTE: Regression equation : z_total_post = 1.233616 + 0.578418*z_total_pre. NOTE: The above message was for the following BY group: group=EXPERIMENTAL Equations in the sas log for the statistician in you
Admittedly, we did not train people while flying on a trapeze Training was administered to four cohorts
Creating the interaction graph First, in the RESULTS window, type sgedit on
Creating the interaction graph First, in the RESULTS window, type sgedit on Ods listing sge = on ; Ods graphics on ; procglm data = plots ; class TestType cohort ; model z_total = TestType cohort TestType*cohort ; where group = "EXPERIMENTAL" ;
Of course, that is kind of like being the smaller midget Oddly, the most time-consuming part of this is making the lines thicker
Using SGEDIT to, well, edit • Double-click on the .sge file in the RESULTS window • Right-click in the plot area & select PLOT PROPERTIES • Select desired line thickness
Yes, the TestType*Cohort*Group interaction (F=5.84, p < .0001) AND the TestType*Group interaction (F=22.92, p < 0001) in the other repeated measures ANOVA were significant. Thanks for asking!
Graphs sometimes provide better information than numbers (Especially true for small samples)
or… How SAS ODS GRAPHICS can improve your life
Are these test related? R=.22
Another example • Years of Education as predictor of gain score • R-square = .46 • Correlation = .68) • P <.01.
Now looky here … Is it a real relationship?
What should we do? Throw the score out? Keep the score in? Something else?
Ignoring my partner … Compare your answers with the people next to you
Sometimes outliers are the most interesting part of your study
ODS GRAPHICS ON; <some procedure> ODS GRAPHICS OFF;
One last example on knowing your data Not just telling a story, having a conversation
Custom Map-making How to plot the largest category in a frequency distribution