1 / 13

Visualizing textual data

Visualizing textual data. CPSC 601.28 A. Butt / Feb. 26 '09. Overview. Project implications Summarize "Tilebars" Hearst / PARC (Xerox) Summarize "Visualizing the Non-Visual" Wise et al / Pacific Northwest Lab (Battelle) Key Issues Summary References. Project Implications.

shiloh
Download Presentation

Visualizing textual data

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Visualizing textual data CPSC 601.28 A. Butt / Feb. 26 '09

  2. Overview • Project implications • Summarize "Tilebars" • Hearst / PARC (Xerox) • Summarize "Visualizing the Non-Visual" • Wise et al / Pacific Northwest Lab (Battelle) • Key Issues • Summary • References

  3. Project Implications • Research area is partly based on text-based environmental reports • textual reporting feeds into textual (quasi-judicial) regulatory framework • rooms of binders (e.g. >20,000 pages for Mackenzie Pipeline Project) • Vocabulary specialized / semantically complete • "no significant adverse environmental impacts"

  4. TileBars • goals are to simultaneously view: • length of a document • relative frequency of specific words • distribution of words with respect to each other • benefits include: • enhanced relevancy of search response • patterns of frequency by document / author • compactness of information

  5. Tilebars • Visual representation via • rectangular block: size equates to document length • three bars within the block: each corresponds to a query • in each bar tiles indicate location, saturation of tile indicates frequency • 5 articles, 3 search queries • 1st, 2nd, 5th appear compact / relevant • 1st and 2nd appear to have better concurrency • 3rd and 4th potentially less relevant, greater time investment to read

  6. Visualizing the Non-Visual • goals are to: • overcome time constraints in processing textual information • overcome attention constraints; avoid becoming overwhelmed by volume of textual information • benefits include: • escape limitations of traditional text • increase throughput and comprehension of information processing • feedback on text structure to enhance visualization

  7. Visualizing the Non-Visual • Employ a "natural landscape" metaphor • leverage evolutionary psychological adaptations via natural landscapes for representation • galaxy or star-fields ("night sky") • themescapes ("cartographic" or "landscape") • although statistical measures used for clustering, they are not used as directly as in tile bars • self-organizing maps

  8. Galaxies • PNL software development (DOE) • Display is a review of cancer literature • Branched to SPIRE / In-SPIRE for government documents

  9. Themescapes • PNL software development (DOE) • Branched to SPIRE / In-SPIRE for government documents (renamed "Themeview") • Branched into NVAC (National Visual and Analytics Centre) - part of the Homeland Security infrastructure

  10. Themescapes (2.0?) • Branched progeny of themescapes • Used in searching IP / Patents • Subscription service • Failed metaphors??

  11. Key Issues • Vocabulary / semantics - how do you interpret meaning from text statistics? • earlier failures of natural language processing • contingent semantics • Employing metaphors (Zhang 2008) • rely on unusual linkages (versus analogy) to highlight • degree of "unusual-ness" is critical: too much or too little leads to confusion

  12. Summary www.wordle.net

  13. References Marti A. Hearst: TileBars: Visualization of Term Distribution Information in Full Text Information Access. CHI 1995: 59-66 James A. Wise and James J. Thomas and Kelly Pennock and David Lantrip and Marc Pottier and Anne Schur and Vern Crow. Visualizing the Non-Visual: Spatial Analysis and Interaction with Information from Text Documents. Proc. IEEE Symp. Information Visualization, InfoVis, pp. 51-58, IEEE Computer Soc. Press, 30-31, October 1995. (in text pages 442-450) Jin Zhang. The Implication of Metaphors in Information Retrieval. Visualization in Information Retrieval, Elsevier, 2008. (pages 215-237)

More Related