430 likes | 450 Views
Explore methods to reduce clutter in database visualization, improve user performance, and enhance visual appeal. Discover techniques such as semantic zoom, buffering intermediate results, and goal-directed zoom. Learn about data lineage, weak inversion, and software systems supporting fine-grained lineage. See how visualization tools like DataSplash and Tioga optimize data flow for efficient analysis.
E N D
Allison WoodruffUniversity of California, Berkeley Clutter Reduction in Database Visualization
Motivation • Clutter can have negative effects • Decreased user performance • Diminished visual appeal • ...
Related work • Semantic zoom • SDMS, Pad, Pad++ • Filters/Highlighting • IVEE, Magic Lenses, EDV • Focus + context • Fisheyes
Outline • Data lineage • Data Engineering ‘97 • Buffering of intermediate results • Semantic zoom in DataSplash • Density width bars • Non-uniform data • Semi-automated construction • Goal-directed zoom
Data lineageMotivation • Earth scientist • Processes input data • Views result • Detects anomaly • Queries system • System • Tracks at coarse level • Fails to track at fine-grained level ?
Data lineageInversion of a function • User wants to identify parts of the input that map onto a given subset of the output • Unfortunately, a function that inverts perfectly doesn’t always exist Input Output f Perfect inversion
Data lineageWeak inversion • Often a weak inversion function exists • Doesn’t find perfect inverse • Finds a weak inverse that has certain guaranteed properties with relationship to the perfect inverse • There are rules for combining weak inverses to preserve properties Input Output f Weakinversion
Data lineageSummary • Software system • Expert users register weak inversion functions • User specifies anomaly and desired properties • System designs an execution plan to generate and combine weak inverses • Method to support fine-grained data lineage • Relies on limited information about processing steps • Provides imperfect inversions along with qualitative guarantees about the accuracy of these inversions • Eliminates much of the irrelevant source data
Outline • Data lineage • Buffering of intermediate results • Visual Languages ‘95 • Semantic zoom in DataSplash • Density width bars • Non-uniform data • Semi-automated construction • Goal-directed zoom
BufferingTioga • Dataflow language for visualization • Nodes are database procedures • Inputs and final results are stored in database tables • Intermediate results not stored • All results may be viewed by users
BufferingMotivation • Users revisit intermediate results • Debugging • Tuning • Data lineage (manual) • System revisits intermediate results • Re-execute with new inputs • Animation • Data lineage (weak inversion)
BufferingApproach • Buffer intermediate results to minimize recomputation over a sequence of queries to intermediate nodes
computed query buffered from Step 1 guarded BufferingExample scenario Step 1 At each step, the buffering algorithm must choose what to keep Step 2 Buffered nodes eliminate the need to compute their ancestors
BufferingResults • New heuristics are the most effective • Traditional methods are the least effective Offline Graph structure and user movement Graph structure Relative execution time Random Traditional (FIFO) Typical Randommoves Bushygraphs Variablesizes Largegraphs Dataflow graph structure and user movement
Outline • Data lineage • Buffering of intermediate results • Semantic zoom in DataSplash • Visual Languages ‘94, Visual Databases ‘95,Data Engineering ‘96 • Density width bars • Non-uniform data • Semi-automated construction • Goal-directed zoom
Semantic zoom in DataSplashMotivation • Definition of semantic zoom • Generally, two-dimensional canvas • User can pan and zoom • Objects change as the user zooms • Behavior of objects during zooming usually programmed by experts • Goal: make semantic zoom end-user programmable
Semantic zoom in DataSplashLayer manager • DataSplash objects are organized into layers • Each layer appears as a vertical layer bar in the layer manager • Layer bar shows elevation range at which layer is visible • Users can directly manipulate the bars Highelevation Low elevation
Semantic zoom in DataSplashView from a high elevation At the user’s current elevation, only the state outline layer is visible
Semantic zoom in DataSplashView from an intermediate elevation The cities circles layer becomes visible when the user zooms
Semantic zoom in DataSplashView from a low elevation The graph layer replaces the circles layer
Outline • Data lineage • Buffering of intermediate results • Semantic zoom in DataSplash • Density width bars • Advanced Visual Interfaces ‘98 • Non-uniform data • Semi-automated construction • Goal-directed zoom
Density width barsMotivation This visualization seems to have appropriate detail...
Density width barsMotivation, cont. …but the same visualization is cluttered at a higher elevation.
Density width barsApproach • Cartographic Principle of Constant Information Density (derived from Töpfer ’66): the number of objects/area should be constant at any scale • More generally, the amount of information should remain constant as the user pans and zooms • Visual Information Density Adjuster (VIDA) gives users feedback about the density of their applications
Density width barsDensity functions • Density functions • Input: region in canvas • Output: measurement of density • System-provided functions • Number of objects, number of vertices • User-defined density functions • There are many complex density functions • The number of edges between nodes • The number of text objects that overlap • ...
Density width barsVisual feedback about density • Width bars • The width of each layer bar at a given elevation is proportional to the layer’s average density at that same elevation • Tick marks • Colored to show cumulative density (red is too dense) • User slides the bars around, trying to maximize the number of green tick marks
Outline • Data lineage • Buffering of intermediate results • Semantic zoom in DataSplash • Density width bars • Non-uniform data • In preparation • Semi-automated construction • Goal-directed zoom
Non-uniform dataMotivation • Width bars ensure uniformity in the z dimension • Many data sets are non-uniform in the x and y dimensions
Non-uniform dataApproach • Extend the Principle of Constant Information Density to subdivisions of the canvas • Break the screen into a grid • For each subdivision of the screen, choose a valid combination of layers with appropriate density • Allow user to specify constraints on combinations of layers • MUTUALLY EXCLUSIVE (city can be a dot or a circle) • ADDITIVE (states outlines can appear alone or with city circles)
Population visualization Before After
Housing cost/Income visualization Before After
Outline • Data lineage • Buffering of intermediate results • Semantic zoom in DataSplash • Density width bars • Non-uniform data • Semi-automated construction • Advanced Visual Interfaces ‘98 • Goal-directed zoom
Semi-automated constructionMotivation • Previously, we considered using the Principle of Constant Information Density to decide when to display layers • We could also try to decide the contents of layers
Semi-automated construction Operations to decrease density Original Select Aggregate Reclassify Change shape Reduce size Remove attribute association Change color
Semi-automated constructionTransformation space • These operations can generate a huge number of options (“transformations”) • We would like to present transformations visually to the user
Semi-automated constructionPortals • Windows onto other canvases • Graphical links
Semi-automated construction The transformation canvas • Transformations presented to user in a “transformation” canvas • Each transformation appears as a portal • When the user zooms, the visualizations all change • How should user navigate the transformation space?
Outline • Data lineage • Buffering of intermediate results • Semantic zoom in DataSplash • Density width bars • Non-uniform data • Semi-automated construction • Goal-directed zoom • CHI ‘98
Goal-directed zoomMotivation • Naïve zoom • Independent control of zoom and representation • Semantic zoom • Elevation implies representation • Goal-directed zoom • Representation implies elevation
Goal-directed zoomFortune 500 visualization • Displayed using VIDA’s technique for non-uniform data • Outliers have more detailed representation # employees % profit
Goal-directed zoomExample When user selects an object, a menu of representations appears When user selects an item from the menu, VIDA pans and zooms until that representation appears at appropriate visual detail