1 / 32

Semi-Automatic Ontology Alignment for Geospatial Data Integration*

Semi-Automatic Ontology Alignment for Geospatial Data Integration*. Isabel F. Cruz William G. Sunna Anjli Chaudhry Department of Computer Science University of Illinois at Chicago.

tracyturner
Download Presentation

Semi-Automatic Ontology Alignment for Geospatial Data Integration*

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Semi-Automatic Ontology Alignment for Geospatial Data Integration* Isabel F. Cruz William G. Sunna Anjli Chaudhry Department of Computer Science University of Illinois at Chicago *This research was supported in part by the National Science Foundation under Awards EIA-0091489 and ITR IIS-0326284.

  2. Overview • Motivation: application, data heterogeneity, data integration • Alignment • Mapping types • User interface • Semi-automatic alignment • Propagation rules • User interface • Ontology merging • Challenges and discussion • Conclusions and future work

  3. Application • WLIS (Wisconsin LandInformation System): web-based system linking data from distributed, heterogeneous data sources • Case study: land use codes • Sample query: “Find all the agricultural lands in Dane and Racine counties.” • Different authorities use different land use coding systems leading to syntactic, schematic, and semantic heterogeneities

  4. Heterogeneity “Find all the agricultural lands in Dane and Racine counties.” Parcel-based example Each highlighted parcel has its own land use classification code

  5. Land Use Code Heterogeneity in WLIS Land Use Code Land Use Code Land Use Code Land Use Code There are 72 counties and hundreds of cities and towns in the state; each may have their own system of classifying Land Use codes

  6. Racine County Dane County Commercial Commercial Retail Sales and Services Retail Sales Retail Services Land Under Development Intensive Nonintensive Classification: Semantic Issue

  7. Land Use Codes

  8. Land Use Codes Synonyms

  9. Land Use Codes Synonyms Value heterogeneity

  10. Ontology-Based Data Integration Application Query Ontology does not change (Local as View approach) Mediator Ontology Wrapper Wrapper Wrapper Local Ontology Local Ontology Local Ontology Source Source Source [Fonseca & Egenhofer 99]

  11. Ontology-Based Data Integration Application Query Concepts in the Ontology and Local Ontology need to be aligned Mediator Ontology Wrapper Wrapper Wrapper Local Ontology Local Ontology Local Ontology Source Source Source [Fonseca & Egenhofer 99]

  12. Agreement Document • XML document that act as a wrapper layer for the underlying local data source • Stores information about how entities in the global ontology map to the entities in the local data source • Uses XML to capture the hierarchical ordering of entities and their mappings • Supports query operations using XPath/XSLT to hide details of how data is structured in local data source • Minimizes need for programmer intervention and maintenance as it is declaratively specified

  13. Ontology Alignment • Alignment is the process of mapping concepts from one ontology to concepts of another ontology • Concepts are mapped based on how “similar” they are to each other • Similarity takes different shapes: • Similarity in definition • For example,automobile and car have very similar definition in any given dictionary • Similarity in text • For example: agriculture and agricultural have the same prefix and have 4 letters in common

  14. Mapping types • Exact: the connected vertices equivalent in meaning • Subset: the vertex in the global ontology is a subset of the vertex in the local ontology, i.e. less general in meaning. • Superset: the vertex in the global ontology is a superset of the vertex in the local ontology, i.e. more general in meaning • Approximate: the connected vertices are close in meaning (e.g., they intersect in some properties) but are not equivalent in definition. • Null: the vertex in the global ontology does not have an equivalent vertex in definition in the local ontology

  15. Mapping Types Exact Industry Industry Exact Mining Manufacturing Production Mining Exact Rubber Construction Material Electrical Supplies Rubber and Glass Superset Subset

  16. Agreement Maker • Visual interface for creating agreements • Existing mappings displayed to the user • Displayed list of mappings updated as user identifies more mappings

  17. User Interface

  18. Semi-automatic Alignment • Framework that defines the values associated with the vertices of the ontology as functions of the: • values of the children vertices, or • user input • User (or system) establishes some mapping types • System propagates the mapping types along the ontologies (bottom-up) as much as possible

  19. Full vs. Partial Mappings Superset a d b e c f Exact Superset Full Mapping

  20. Full vs. Partial Mappings Subset a d b e c f g Exact Exact Partial Mapping

  21. Propagation Rules

  22. Deduction Results Interface

  23. Ontology Merging • Alignment can be used to: • Determine the need for merging ontologies • Alignment values of Null in the mappings of the global ontology to a local ontology, thus leading to loss of resolution in the query process • Guide the merging process • Combining the local ontologies with the global ontology

  24. Ontology Merging Approximate Commerce Commercial Sector User-defined Sales Services Non- Intensive Intensive NULL NULL Commerce Commercial sector Non- Intensive Intensive Function Scale Exact Exact Sales Services Non- Intensive Intensive NULL NULL

  25. Ontology Merging Approximate Commerce Commercial Sector User-defined Sales Services Non- Intensive Intensive NULL NULL Commerce Commercial sector NULL Non- Intensive Intensive Function Scale User-defined Exact Exact Sales Services Non- Intensive Intensive NULL NULL

  26. Ontology Merging Approximate Commerce Commercial Sector User-defined Sales Services Non- Intensive Intensive NULL NULL Superset Commerce Commercial sector System-defined NULL Non- Intensive Intensive Function Scale User-defined Exact Exact Sales Services Non- Intensive Intensive NULL NULL

  27. Challenges in Semi-Automatic Alignment Superset User-defined Local Global Subset Residential Buildings Apartment Buildings System-defined One-family Residence One-family Residence Two-family Residence Multi-family Residence Exact Subset

  28. Challenges in Semi-Automatic Alignment Superset • Specialization of the node Residential Buildingsin the global ontology is not total thus causing inconsistency • Solution: • Introduction of other children • Introduction of a virtual node User-defined Local Global Subset Residential Buildings Apartment Buildings System-defined One-family Residence One-family Residence Two-family Residence Multi-family Residence Exact Subset

  29. Monotonic Mappings A F Superset Superset Subset Exact B G Subset Superset Superset Subset Superset Exact C H Superset Superset Superset Superset Superset D I E J

  30. Discussion Local Global ??? Residential Buildings Superset Subset Apartment Buildings Apartment Buildings Virtual node One-family Residence One-family Residence Two-family Residence Multi-family Residence Exact Subset

  31. Conclusions • Real-world GIS problems • Semantic heterogeneities among non-spatial data • Ontologies are hierarchies of concepts, with “atomic nodes” • Data integration • Ontology-based • Alignment: mappings between ontologies are declaratively specified • Semi-automatic alignment • Automatic propagation of mappings along the ontologies, following well-defined rules • Building block for a more comprehensive approach that will integrate other approaches, e.g., similarity • Identified some challenges to be addressed

  32. Future Work • Explore layers of similarities between concepts and define new mapping types • Assign similarity scores that measures how close concepts match with each other • Use multiple dictionaries to further automate the mapping process by establishing mappings between concepts that are similar in meaning • Upgrade agreement maker to incorporate new ways of alignment process automation • Test with “large” ontologies: • Wetland ontologies • Rock classification systems • Integration with querying • Data integration across several themes

More Related