1 / 61

Concept Switching in the Interspace: Networking Infrastructure for Community Knowledge

Concept Switching in the Interspace: Networking Infrastructure for Community Knowledge. Bruce Schatz CANIS Laboratory Graduate School of Library and Information Science University of Illinois at Urbana-Champaign Graduate School of Informatics, Kyoto University

dominique
Download Presentation

Concept Switching in the Interspace: Networking Infrastructure for Community Knowledge

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Concept Switching in the Interspace:Networking Infrastructure for Community Knowledge Bruce Schatz CANIS LaboratoryGraduate School of Library and Information ScienceUniversity of Illinois at Urbana-Champaign Graduate School of Informatics, Kyoto University schatz@kuis.kyoto-u.ac.jp, www.canis.uiuc.edu IEEE Knowledge Media Networking KMN’02 Keynote Address, CRL, Kyoto Japan, July 11, 2002

  2. THE THIRD WAVE OF NET EVOLUTION CONCEPTS OBJECTS PACKETS

  3. CONCEPT SPACES • from Objects to Concepts • from Syntax to Semantics • Infrastructure is Interaction with Abstraction Internet is packet transmission across computers Interspace is concept navigation across repositories

  4. Technology Engineering FORMAL (manual) Electrical IEEE communities INFORMAL groups (automatic) individuals LEVELS OF INDEXES

  5. THE DISTRIBUTED WORLD • Community Repositories in the Interspace • Peer to Peer Networking Infrastructure • Every Person performs Every Role USER request LIBRARIAN reference INDEXER classify PUBLISHER quality AUTHOR generate

  6. Meta Data How to Represent the Community Knowledge Automatic and Interactive Representation Techniques for Capturing the Fundamental Structure

  7. Meta Maps How to Locate the Community Knowledge Automatic and Interactive Location Techniques for Capturing the Fundamental Landscape

  8. CONCEPTS ACROSS THE INTERSPACE

  9. SCALABLE SEMANTICS • Automatic indexing • Domain-Independent indexing • Statistical clustering • Compute Context of • concepts within documents • documents within repositories

  10. CROSS-OVERS IN SEMANTIC INDEXING

  11. COMPUTING CONCEPTS ‘92: 4,000 (molecular biology) ‘93: 40,000 (molecular biology) ‘95: 400,000 (electrical engineering) ‘96: 4,000,000 (engineering) ‘98: 40,000,000 (medicine)

  12. SIMULATING A NEW WORLD • Obtain discipline-scale collection • MEDLINE from NLM, 10M bibliographic abstracts • human classification: Medical Subject Headings • Partition discipline into Community Repositories • 4 core terms per abstract for MeSH classification • 32K nodes with core terms (classification tree) • Community is all abstracts classified by core term • 40M abstracts containing 280M concepts • concept spaces took 2 days on NCSA Origin 2000 • Simulating World of Medical Communities • 10K repositories with > 1K abstracts (1K w/ > 10K)

  13. COMMUNITY PROCESSING

  14. Semantic Indexing • Extracting Concepts (AI) • Canonical noun phrases • Generic statistical parser • Computing Context (IR) • Co-occurrence frequency, in collection • Useful interactively, not strict ordering

  15. System Side Infrastructure Classification Technologies for Multimedia Documents • Phrases (multi-word nouns) • Concepts (generic phrases) • Types (identified concepts) • Clusters (grouped types) • Structures (semantic universals)

  16. INTERSPACE NAVIGATION • Semantic Indexes for Community Repositories • Navigating Abstractions within Repository • concept space & category map • Interactive browsing by Community experts *www.canis.uiuc.edu/interspace-prototype

  17. Interspace Remote Access Client

  18. Navigation in MEDSPACE For a patient with Rheumatoid Arthritis • Find a drug that reduces the pain (analgesic) • but does not cause stomach (gastrointestinal) bleeding Choose Domain

  19. Concept Search

  20. Concept Navigation

  21. Retrieve Document

  22. Navigate Document

  23. Retrieve Document

  24. Category Map

  25. Category Navigation

  26. Concept Navigation

  27. User Side Infrastructure Navigation Technologies for Search Interfaces • Exact Match (noun phrases) • Relationship List (concept suggestions) • Cluster Comparison (groups to groups) • Spreading Activation (group intersections) • Artificial Landscapes (semantic distances)

  28. SWITCHING In the Interspace… • each Community maintains its own repository • Switching is navigating Across repositories • use your vocabulary to search another specialty

  29. Medicine Session

  30. Categories and Concepts

  31. Concept Switching

  32. Document Retrieval

  33. Semantic region term Concept Space Concept Space CONCEPT SWITCHING • “Concept” versus “Term” • set of “semantically” equivalent terms • Concept switching • region to region (set to set) match

  34. ENGINEERING SESSION

  35. Engineering Categories & Concepts

  36. Further Concept Navigation

  37. Searching via Concept Suggestion

  38. Switching Across Repositories

  39. Future Technologies • Concept Switching • Spreading activation, type tagging • Dynamic Indexing • On-the-fly collections, during session • Path Matching • Aggregating indexes, many repositories

  40. Semantic Analysis of Multimedia • Collections of Objects containing Units • Text: community repository (topic proximity) document abstracts containing noun phrases • Image: aerial photograph (spatial proximity) feature regions containing texture tiles • Units -- media-dependent (statistical parsers) • Indexes -- media-independent (statistical clusters)

  41. Media Interoperability Model • text concept space & category map (geoscience) • 1M phrases in 500K abstracts from Georef and Petroleum Abstracts • image concept & category maps in aerial photos • visual thesaurus maps for 200K regions in 800 images (6M tiles) • geographic map (where) v. semantic map (what) • spatial gazetteer as bridge image<=>text<=>number

  42. Text and Number Interoperability Integrated Result: Within the bounding geography location, 2 documents and 88 AVHRR records related to the integrated query are retrieved. Text and AVHRR Query: Show me information about Santa Barbara area with mild temperature and high vegetation density.

  43. Image Concept Switching Image Query: By browsing a texture (tile) catalog, show me information about residential and farm land areas. Result: A set of related images are retrieved and shown in the Results Frame. The full-size image #368 is displayed with its place names and tile locations.

  44. INFORMATION SPACEFLIGHT • Landscape as category map visualization • Valleys are semantic clusters • Hills are semantic distances • Traversal across multiple levels of abstraction

  45. Category Maps

  46. SELF-ORGANIZING MAPS (SOMs)

  47. INFORMATION SPACEFLIGHT

  48. INFORMATION SPACEFLIGHT Flying through Cyberspace

  49. THE NET OF THE 21st CENTURY • Beyond Objects to Concepts • Beyond Search to Analysis • Problem Solving via Cross-Correlating Multimedia Information across the Net • Every community has its own special library • Every community does semantic indexing • The Interspace is true Cyberspace

More Related