300 likes | 402 Views
The Search for Meaning and Semantics: Taxonomies Get It Done. Joseph Busch – Why Semantics Matter . Agenda. Why semantics matter (… a quick review from 2001) What is semantic search, SKOS and Linked Data? Some semantic search examples?. Why Semantics Matter May 20, 2001.
E N D
The Search for Meaning and Semantics: Taxonomies Get It Done Joseph Busch – Why Semantics Matter
Agenda • Why semantics matter (… a quick review from 2001) • What is semantic search, SKOS and Linked Data? • Some semantic search examples?
Why Semantics Matter May 20, 2001
When you own a Rembrandt you can spell his name any way you want.
But when you want to find a Rembrandt … you better spell his name correctly.
Vocabulary resources can help find the right artist even if their name is typed incorrectly.
Users cannot type in the complex queries needed to find all the relevant items... But this can be done automatically.
Complex queries are even more important when you search the entire web.
Agenda • Why semantics matter • What is semantic search, SKOS and Linked Data? • Some semantic search examples?
19% 21% 40% 20% Search Failure • 19% Character errors. (Young, et al) • 40%Vocabulary errors. (Seaman. Norgard, et al) • 20% Index confusion. • 21% Successful (Nielsen)
Semantic search solution • Semantic search improves search accuracy by inferring the contextual meaning of terms via: • Disambiguation • Part of speech (POS) analysis • Synonyms, variations and quasi-synonyms • Concept matching • Natural language query analysis • Key sentence detection • Generate more consistent content to search on. • Correct user errors. • Map the language of users to the language of the target content. • Augment search results with linked data.
What is SKOS? • Provides the basis for any user, tool, or program to identify, define and link concept vocabularies.
CONCEPT Fringe parking prefLabel lc:sh85052028 Park and ride systems altLabel altLabel altLabel prefLabel Park-n- ride altLabel Park & ride trt:Brddf Park and ride altLabel altLabel broader P&R system trt:Brdd prefLabel Parking
Why SKOS? According to Alistair Miles* (SKOS co-author) • Ease of combination with other standards • Vocabularies are used in great variety of contexts. • E.g., databases, faceted navigation, website browsing, linked open data, spellcheckers, etc. • Vocabularies are re-used in combination with other vocabularies. • E.g., Library of Congress Subject Headings + Transportation Research Thesaurus; USPS states + USPS zip codes + US Congressional districts; etc. • Flexibility and extensibility to cope with variations in structure and style • Variations between types of vocabularies • E.g., list vs. classification scheme • Variations within types of vocabularies • E.g., Z39.19-2005 monolingual controlled vocabularies and the Transportation Research Thesaurus * Head of Epidemiological Informatics at Oxford University Wellcome Trust Centre for Human Genetics (formerly OUP Senior Computing Officer)
Why SKOS? (2) • Publish managed vocabularies so they can readily be consumed by applications • Identify the concepts • What are the named entities? • Describe the relationships • Labels, definitions and other properties • Publish the data • Convert data structure to standard format • Put files on an http server (or load statements into an RDF server) • Ease of integration with external applications • Use web services to use or link to a published concept, or to one or more entire vocabularies. • E.g., Google maps API, NY Times article search API, Linked open data; etc. • A W3C standardlike HTML, CSS, XML and RDF, RDFS, and OWL.
Agenda • Why semantics matter • What is semantic search, SKOS and Linked Data? • Some semantic search examples?
Oracle.com top-level taxonomy Person Organization Location Content Type Audience Products Has a Product Line Is a Technology Application Industry Solution
Oracle event finderhttp://events.oracle.com/ Filter on Location and Language More filters based on this result Subscribe to RSS feed based on the criteria set on this page Results shown on Google maps UI
Linked data example Elements of the periodic table, and common isotopes A faceted taxonomy of concepts in physics APS Taxonomy Broad Subject Areas Methods & Theories Phenomena Physical Systems Elements by Group Group 1 Group 2 Group 3 Group 4 Group 5 Group 6 Group 7 Group 8 Group 9 Group 10 Group 11 Group 12 Group 13 Group 14 Group 15 Group 16 Group 17 Group 18 Astronomical systems Atomic-scale objects Beams Complex systems Dynamical systems Electric & magnetic fields Engineered materials Fundamental particles Gases delete Information systems Liquids delete Materials Nonlinear system Nuclei Plasma Quasiparticles Cadmium Copernicium Mercury Zinc Materials by Composition Materials by Dimensionality Materials by Property Materials by Structure 194Hg 196Hg 198Hg 199Hg 200Hg 201Hg 202Hg 204Hg
Joseph A Busch Mobile 415-377-7912 jbusch@taxonomystrategies.com Questions
Session description • Semantic search – a phrase that is increasingly used in the popular as well as the professional literature. What does it look like, and how will it work. Panelists will present their visions of semantic search. Program is designed to be interactive with audience participation – suggestions for functions and features they see in the future. • What is semantic search? • What are the components of semantic search? • How can it be used in libraries?