170 likes | 263 Views
First Insights into the Library Track of the OAEI. Dominique Ritze Mannheim University Library. Motivation. Ontology Mapping. Publication x. Search. 0 results. subject (thesaurus 2): ontology alignment. Thesaurus 1. Thesaurus 2. Ontology Mapping. =. Ontology Alignment.
E N D
First Insights into the Library Track of the OAEI Dominique Ritze Mannheim University Library
Motivation Ontology Mapping Publication x Search 0 results subject (thesaurus 2): ontology alignment Thesaurus 1 Thesaurus 2 Ontology Mapping = Ontology Alignment Ontology Mapping Publication x Search subject (thesaurus 1): ontology alignment
Overview • Ontology Matching • OAEI • Thesaurus vs. Ontology • OAEI Library Track 2012 • Lessons learned and Future Work
Ontology Matching Person People Author Author < Author, Author, =, 0.97 > < Paper, Paper, =, 0.94 > < reviews, reviews, =, 0.91 > < writes, writes, =, 0.7 > < Person, People, =, 0.8 > < Document, Doc, =, 0.7 > < Reviewer, Review, =, 0.6 >… CommitteeMember writes Reviewer PCMember reviews Doc reviews Document Paper Paper writes Review
Ontology Matching Evaluation O1 Tool A Test O2 m R Result
Ontology Alignment Evaluation Initiative (OAEI) • Annual campaign started 2005 • Different tracks/datasets • Benchmark, Anatomy, Conference, Multifarm, Large BioMed, Library, Instance Matching • 21 submitted systems (2012) • Goal: Improving the performances of the ontology matching field • Through comparison of algorithms • New challenges for the systems
Thesaurus = Ontology? Germany Commodities Tropical Fruit Ananas Metal Product -> Metal
OAEI Library Track Are current state-of-the-art ontology matching tools able to match thesauri? Dominique Ritze, Kai Eckert, Benjamin Zapilko, Joachim Neubert
Data Set • Thesaurus for economics (STW) • 6.000 concepts with 19.000 additional keywords (EN, DE) • Thesaurus for the Sociel Sciences (TheSoz) • 8.000 concepts with 4.000 additional keywords (EN, DE, FR) • Reference alignment manually created in 2006 • Both actively used in libraries for keyword indexing
Execution • 7GB Debian machine • Timeframe 1 week • 13 of the 21 submitted systems were able to generate an alignment • No system had a heap space problem • Evaluation: Precision, Recall, F-Measure, Runtime
Results How to evaluate the results? F-Measure of 0.67 good?
Manual Evaluation • Between 38 and 269 new correct correspondences found per matcher • Up to half of the correspondences correct • Many new correspondences are quite simple • Some more “complex” and interesting ones • Automated production = CAM • Several incorrect ones if the labels are quite similar • Difficult to distinguish the names of countries, their inhabitants and the languages
Lessons Learned • Transformation SKOS to OWL causes some problems, especially regarding the labels • Ontology matching systems are nevertheless able to match the thesauri and even discover unknown correct correspondences • Interest of the community in this topic
Future Work • Update reference alignment adapted results • SKOS import for matching systems • Use instance data to match thesauri? • Other thesauri?
Thankyouforyourattention! dominique.ritze@bib.uni-mannheim.de