1 / 58

Software Visualization Space Filling Approach & Semantic Zooming

This article discusses the use of a space-filling approach in software visualization, specifically using treemaps to display hierarchical data. It covers topics such as subsystem information, directory information, error-prone code, recurring problems, and system evolution. The SeeSys system is introduced as a tool for implementing this visualization technique.

cosborn
Download Presentation

Software Visualization Space Filling Approach & Semantic Zooming

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Software VisualizationSpace Filling Approach & Semantic Zooming Siva Venkatachalam 03/23/2004

  2. Articles covered • Marla J. Baker, Stephen G. Eick, Space-Filling Software Visualization, Journal of Visual Languages and Computing, 6(2), 1995, p119-133. • K.L. Summers, T.E.Goldsmith, S.Kubica, T.P.Caudell, An Experimental Evaluation of Continuous Semantic Zooming in Program Visualization, IEEE Symposium on Information Visualization, 2003.

  3. Software Visualization • Software visualization is the use of computer graphics and animation to help illustrate and present computer programs, processes, and algorithms.

  4. Space filling approach • The space filling approach used in this paper is based on the generalized idea of treemaps developed by Shneirderman, which aids in showing hierarchical data. • The hierarchy for a software system is Software system  subsystems  directories  files.

  5. Motivation The main motivation for such a system is to gather information on: • Subsystem information • Directory information • Error-prone code • Recurring problems • System evolution

  6. Typical software system information • Non-commentary source lines (NCSL) • Software complexity metrics • Number and scope of modifications • Number of programmers making modifications • Number and type of bugs

  7. Approach • The approach adopted in this paper is very similar to that of a treemap approach. • A rectangular space is partitioned into subsystems depending on its total NCSL. • A subsystem is in-turn divided into directories, which in-turn contain files. • The software visualization technique developed is implemented in a system called SeeSys.

  8. Space Filling Approach • Three subsystems X, Y, Z • X subsystem has 5 directories • Filling may indicate the amount of new NCSL • In terms of %, directory 5 has the largest change in NCSL. • Subsystem Y with its internal directories and files • The leftmost directory contains 5 files and the shading may be indicative of new NCSL.

  9. 1. Subsystem information • Questions: • Which subsystems are the largest? • Where is the new development activity? • Procedure: • The outermost rectangle represents the total size of the system. • The individual rectangles denote the size of the subsystems in NCSL. • Color coding is used to encode the size of the individual subsystems.

  10. Subsystem Information Three of the largest subsystems (visually) Three subsystems with the most development activity NCSL for t during various releases Color codes

  11. 2. Directory information • Questions: • Where are the large directories? • Is there even distribution of • Large and small directories • New development between directories • Which directories are stable? • Which directories have the most activity?

  12. Back Directory level detail Almost no development Subsystems n and D have no large directories Subsystems e and Z have the largest directories Max development

  13. Zoomed view of subsystem t

  14. 3. Errors in code • Questions: • Which subsystems and directories have the most errors (bugs)? • How much of the development activity is apportioned to. • Fixing bugs. • Adding new functionality.

  15. Bug rates by subsystem and directory Max bug fix rate The size of the subsystems are based on the new NCSL during the development of the system Most development activity Lesser bug fix rate compared to new development

  16. 4. Recurring error problems • Questions: • Are the bugs really fixed or are they a recurring problem? • Are there any components that would need complete restructuring or reconstruction? • Fix-on-fix rate • A fix-on-fix bug is a software bug correction that modifies an earlier bug fix.

  17. Fix-on-fix rates The area representing each subsystem and directory is proportional to the number of bugs. The fill area represents the fix-on-fix rates. Subsystems i and K have high fix-on-fix rates

  18. 5. System Evolution • Questions: • Which were the major software releases? • Have any subsystems shrunk or disappeared during releases? • What is the rate of growth for subsystems and directories? • Where has the development work been done historically?

  19. System Evolution • SeeSys animates the display over the evolution the code. • The bounding rectangle represents the maximum size of the subsystem across all releases. The filled portion is the amount of development during that particular release. • A set of frame sliders control the display frame.

  20. Code growth animation – frame 1 Frame sliders

  21. Code growth animation – frame 2

  22. Code growth animation – frame 3

  23. Evolution changes • Subsystem O has disappeared. • Subsystems F and J have shrunk. • Subsystem D has a slow growth at first, but a faster growth during later releases. • Subsystems t, k and Z are the fastest growing subsystems.

  24. Link User interaction • SeeSys uses mouse movements to interact with the system. The active component at any particular time is indicated by a red highlighted boundary. • The available statistics are displayed on the lower left corner. • There are five buttons that can be used to change the appearance on the screen. • There are sliders that control the number of rows to be displayed and the animation.

  25. Display principles • The visualization principles used in SeeSys • The individual components can be assembled to form the whole. • Pairs of components can be compared to see how they differ. • The components can be disassembled into smaller components namely subsystems and directories.

  26. SeeSys properties • Screen real estate • The total screen space (100%) is utilized as the rectangles are placed next to each other. • To view the smaller components in greater detail, the zoom feature can be used. • Spatial Relationship • Example: Comparing the new development activity by subsystem involves making all subsystems of the same size. • Color • Color could be used to encode various attributes like NCSL, age, complexity, activity, number of programmers etc of the software system.

  27. New development by subsystem

  28. Continuous Semantic Zooming An Experimental Evaluation

  29. Zooming • Geometric Zooming • This kind of zooming simply provides a blowup of graph content. (E.g. Zooming a picture) • Semantic Zooming • This kind of zooming means that the information content changes and more details are shown when approaching a particular area of the graph. (E.g. plots in Matlab) • Issues • Providing the appropriate level of detail is a challenge.

  30. Examples of Visualization tools Flat representation of all the data items on the available screen space

  31. Pan and Zoom

  32. Overview + detail

  33. Focus + context

  34. Objective • Study the effect of viewing visual programs on the users’ understanding. • Comparison of three methods • Flat zooming • Semantic zooming • Continuous semantic zooming

  35. Semantic Zooming • Details in the zoom area become more distinct. • Changes in the representation as the result of the zoom. • Example: Procedural programming • Drawback: • When viewing the top-level program, the contents of the procedures are hidden and vice-versa.

  36. Programs without procedures

  37. Encapsulated procedure and details

  38. Continuous Semantic Zooming (CSZ) • CSZ uses the concept of semantic zooming with blending and proximity. • This allows the user to view both the procedure details and the details of the higher levels (focus + context). • The smooth transition between levels helps maintain the mental map of the representations.

  39. Continuous Semantic Zooming

  40. Zooming with top-level details hidden

  41. Evaluating CSZ • The evaluation of the CSZ method is done by comparing it with flat and semantic zooming. • Flat representation contains no procedures. • Semantic representations have procedures.

  42. Zooming Hierarchical view with procedures Flat representation with all procedures exploded

  43. Evaluation Study • The evaluation consisted of two pilot studies and one main study. • Metrics • Time to find the elements. • Accuracy • Pilot studies were mainly done to calibrate the metrics. • Compares only the flat and semantic representations.

  44. Example program • Available polygons • Lines, triangles, rectangles & ovals. • The polygon’s relationship between one another is determined by the connections between program elements. • SGPL – Simple Graphics Programming Language.

  45. Pilot Study 1 – Program output

  46. Pilot study 1 - Flat representation

  47. Pilot study 1 - Hierarchical representation

  48. Pilot Study 2 - Program output

  49. Pilot studies - Observations • The pilot studies were conducted to corroborate that the measures were sensitive enough to detect differences in time and accuracy. • Observations • Pattern matching was used for simple polygon identification. • Remedies • Add more similar features • Add complexity

  50. Main study • Comparisons between flat, SZ and CSZ representations. • 60 subjects, 20 per method • 14 tasks • The picture contained 164 polygons and the corresponding program contained around 200 elements.

More Related