1.14k likes | 1.16k Views
Learn the principles of creating excellent graphics and charts for computer systems performance analysis. Discover guidelines for good graphics, common mistakes to avoid, and how to effectively present data. Benefit from useful reference works and enhance your data visualization skills.
E N D
Graphical Presentation Andy Wang CIS 5930 Computer Systems Performance Analysis
The Art of Graphical Presentation • Reference works • Types of variables • Guidelines for good graphics charts • Common mistakes in graphics • Pictorial games • Special-purpose charts
Useful Reference Works • Tufte ER, The Visual Display of Quantitative Information,1983 • Tufte ER, Envisioning Information,1990 • Tufte ER, Visual Explanations,1997 • Huff D, How to Lie With Statistics, 1954
Types of Variables • Qualitative • Ordered (e.g., modem, Ethernet, satellite) • Unordered (e.g., CS, math, literature) • Quantitative • Discrete (e.g., number of terminals) • Continuous (e.g., time)
Charting Basedon Variable Types • Qualitative variables usually work best with bar charts or Kiviat graphs • If ordered, use bar charts to show order • Quantitative variables work well in X-Y graphs • Use points if discrete, lines if continuous • Bar charts sometimes work well for discrete
Guidelines for Good Graphics Charts • Principles of graphical excellence • Principles of good graphics • Specific hints for specific situations • Aesthetics • Friendliness
Principlesof Graphical Excellence • Graphical excellence is the well-designed presentation of interesting data: • Substance • Statistics • Design
Graphical Excellence (2) • Complex ideas get communicated with: • Clarity • Precision • Efficiency
Graphical Excellence (3) • Viewer gets: • Greatest number of ideas • In the shortest time • With the least ink • In the smallest space
Graphical Excellence (4) • Is nearly always multivariate • Requires telling truth about data
Principles of Good Graphics • Above all else show the data • Maximize the data-ink ratio • Erase non-data ink • Erase redundant data ink • Revise and edit
Erase Non-Data Ink North West East
Erase Redundant Data Ink North West East
Erase Redundant Data Ink North West East
Specific Things to Do • Give information the reader needs • Limit complexity and confusion • Have a point • Show statistics graphically • Don’t always use graphics • Discuss it in the text
Give Informationthe Reader Needs • Show informative axes • Use axes to indicate range • Label things fully and intelligently • Highlight important points on the graph
Limit Complexityand Confusion • Not too many curves • Single scale for all curves • No “extra” curves • No pointless decoration (“ducks”)
Have a Point • Graphs should add information not otherwise available to reader • Don’t plot data just because you collected it • Know what you’re trying to show, and make sure the graph shows it
Having a Point • Sales were up 15% this quarter:
Show Statistics Graphically • Put bars in a reasonable order • Geographical • Best to worst • Even alphabetic • Make bar widths reflect interval widths • Hard to do with most graphing software • Show confidence intervals on the graph • Examples will be shown later
Don’t AlwaysUse Graphics • Tables are best for small sets of numbers • Tufte says 20 or fewer • Also best for certain arrangements of data • E.g., 10 graphs of 3 points each • Sometimes a simple sentence will do • Always ask whether the chart is the best way to present the information • And whether it brings out your message
Discuss It in the Text • Figures should be self-explanatory • Many people scan papers, just look at graphs • Good graphs build interest, “hook” readers • But text should highlight and aid figures • Tell readers when to look at figures • Point out what figure is telling them • Expand on what figure has to say
Aesthetics • Not everyone is an artist • But figures should be visually pleasing • Elegance is found in • Simplicity of design • Complexity of data
Principles of Aesthetics • Use appropriate format and design • Use words, numbers, drawings together • Reflect balance, proportion, relevant scale • Keep detail and complexity accessible • Have story about the data (narrative quality) • Do professional job of drawing • Avoid decoration and chartjunk
Use AppropriateFormat and Design • Don’t automatically draw a graph • Mentioned before • Choose graphical format carefully • Sometimes “text graphic” works best • Use text placement to communicate numbers • Very close to being a table
CEA: +4.7 WEF: 6.8 DR: +4.5 CB: 6.7 NABE: +4.5 NABE: 6.7 WEF: +4.5 IBM: 6.6 CBO: +4.4 DR: 6.5 CB: +4.2 NABE: +6.2 CBO: 6.3 IBM: +4.1 IBM: +5.9 WEF: +21 CEA: 6.3 GNP: +3.8 IPG: +5.8 CPI: +7.7 Profits: +13.3 Unempl: 6.0 CE: +2.9 CB: +5.5 IBM: +6.6 DR: +10.5 DR: +5.2 NABE: +6.5 IBM: +10.4 WEF: +4.8 CB: +6.2 CE: +6.5 Using Text as a Graphic About a year ago, eight forecasters were asked for their predictions on some key economic indicators. Here’s how the forecasts stack up against the probable 1978 results (shown in the black panel). (New York Times, Jan. 2, 1979)
Family Structure… Hard to spot trends; can’t get meaningful results by applying data mining directly on categories
Family Structure • Family structure vs. ID theft percentage (3.6% average) vs. annual household income ($33K average) vs number of moves in the past 5 years (0.59 average) • Household annual income is negatively correlated with ID theft rate (- 28%) M = male ~M = female S = living with spouse ~S = without spouse C = with children ~C = without children R = with relatives ~R = without relatives F = with non-relatives ~F = without non-relatives
The Stem-and-Leaf Plot • Date set: 44, 46, 47, 49, 63, 64, 66, 68, 68, 72, 72, 75, 76, 81, 84, 88 • Stem-and-leaf plot 4 | 4 6 7 9 5 | 6 | 3 4 6 8 8 7 | 2 2 5 6 8 | 1 4 8