730 likes | 901 Views
The Semantic Web an introduction. Luigi De Russis. T he Web is a Web of document for people, not for machines. The Semantic Web is a Web of data. Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch . http://lod-cloud.net/. Let’s think!.
E N D
The Semantic Web an introduction Luigi De Russis
The Web is a Web of document for people, not for machines
The Semantic Web is a Web of data Linking Open Data cloud diagram, by Richard Cyganiak and AnjaJentzsch. http://lod-cloud.net/
Exercise: Build a Music Catalog Comprehensiveguide to music across the world Web-based With always-updated information about each artist
How? What are the problems? What about data replication? What about data synchronization? …?
Solution #1 How? Site editors roam the Web for new facts and update the site manually What are the problems? A lot of people need to continuously roam the Web; the site will get soon out-of-date What about data? Data is replicated and not up-to-date with new facts
Solution #2 How? Site editors roam the Web for new data and write a program to extract the information What are the problems? Code needs to be updated each time a new site is found; the site will get out-of-date, soon or later… What about data? Data is replicated and not up-to-date
Solution #3 How? Site editors browse the Web for new data via APIs, and write some code to incorporate the information What are the problems? Code needs to be updated each time a new site is found and/or an API is changed; the site will get out-of-date, soon or later… What about data? Data is replicated and not up-to-date
Solution #4 How? Site editors choose to use some external, public datasets (e.g., Wikipedia, MusicBrainz, …) What are the problems? No problem What about data? Data is immediately available, not as APIs or hidden on a Web site. Information can be extracted using standard queries or HTTP requests.
Solution #4 In short… Use the Web of Data as a Content Management System Use the community at large as content editor An example: BBC Music http://www.bbc.co.uk/music
Data on the Web is not enough! we need a proper infrastructure Data should be available on the Web accessible via standard Web technologies Data should be interlinked over the Web i.e., data can be integrated over the Web This is where Semantic Web come in
To a computer, the Web is a flat, boring world, devoid of meaning. This is a pity, as in fact documents on the Web describe real objects and imaginary concepts. […] Adding semantics to the Web involves two things: allowing documents which have information in machine-readable forms, and allowing links to be created with relationship values. Only when we have this extra level of semantics we will be able to use computer power to help us exploit the information to a greater extent than our own reading. Tim Berners-Lee, 1994
What is the relationship with AI? Influence Some technologies in the Semantic Web benefited a lot from AI research and development (and viceversa) Different goals Artificial Intelligence approach: build smarter machines, teach computers to infer the meaning of data Semantic Web approach: have smarter data, make data easier for machines to find, access and process
Resource and description Resource every document “reachable” on the Web no matter the content, format, language, etc. Resource description independent from the format standard language (metadata)
Resource and description Resources
Resource and description Description
Resource and description Title Author Date Topic Quality Description Title Author Date Topic
URIs unambiguousnames for resources RDF a common data model to connectand describe resources SPARQL access to the data model RDFS, OWL common description languages OWL, RIF reasoning (mainly logic inference)
Example: bookstore Represent the following data about the AI book as a set of relations Title: “Artificial Intelligence: A Modern Approach” Author:Russel, Stuart and Norvig, Peter Publisher: Prentice Hall ISBN: 978-0136042594
Example: bookstore http://...isbn/9780136042594 Resource
Example: bookstore http://...isbn/9780136042594 Artificial Intelligence: A Modern Approach Literal
Example: bookstore http://...isbn/9780136042594 title Artificial Intelligence: A Modern Approach
Example: bookstore http://...isbn/9780136042594 title Artificial Intelligence: A Modern Approach publisher author Prentice Hall author Norvig, Peter Russel, Stuart
RDF: Resource Description Framework Structured in statements Subject a resource (URI) Predicate a verb, property or relationship Object a resource or a literal string
Example: bookstore Object Subject Predicate http://...isbn/9780136042594 title Artificial Intelligence: A Modern Approach Predicate Predicate publisher Object author Prentice Hall Object author Norvig, Peter Object Russel, Stuart
Example: bookstore http://...isbn/9780136042594 title Artificial Intelligence: A Modern Approach RDF in XML syntax <rdf:RDFxmlns:rdf=http://www.w3.org/…/22-rdf-syntax-ns#> <rdf:Descriptionabout=“http://... isbn/9780136042594”> <title>Artificial Intelligence: A Modern Approach</title> </rdf:Description> </RDF>
Example: bookstore http://...isbn/9780136042594 title Artificial Intelligence: A Modern Approach RDF in Turtle <http://... isbn/9780136042594> title “Artificial Intelligence: A Modern Approach”
Example: bookstore Represent the following data about the Italian translation of the AI book as a set of relations Title: “Intelligenzaartificiale. Un approcciomoderno” Author:Russel, Stuart and Norvig, Peter Publisher: Prentice Hall ISBN: 978-8871925936 Original ISBN: 978-0136042594
Example: bookstore IntelligenzaArtificiale. Un approcciomoderno title http://...isbn/9788871925936 publisher original Prentice Hall http://...isbn/9780136042594 creator creator Russel, Stuart Norvig, Peter
Example: bookstore IntelligenzaArtificiale. Un approcciomoderno title http://...isbn/9788871925936 publisher Prentice Hall original http://...isbn/9780136042594 http://...isbn/9780136042594 title Artificial Intelligence: A Modern Approach publisher Prentice Hall
Example: bookstore IntelligenzaArtificiale. Un approcciomoderno title http://...isbn/9788871925936 publisher Prentice Hall original http://...isbn/9780136042594 same URI, same resource http://...isbn/9780136042594 title Artificial Intelligence: A Modern Approach Prentice Hall creator
Example: bookstore IntelligenzaArtificiale. Un approcciomoderno title http://...isbn/9788871925936 publisher Prentice Hall original http://...isbn/9780136042594 title Artificial Intelligence: A Modern Approach publisher author creator Prentice Hall creator Russel, Stuart Norvig, Peter author Russel, Stuart Norvig, Peter
Example: bookstore IntelligenzaArtificiale. Un approcciomoderno title http://...isbn/9788871925936 What about merging creator and author? In RDF, it is not possible! publisher Prentice Hall original http://...isbn/9780136042594 title Artificial Intelligence: A Modern Approach publisher author creator Prentice Hall creator Russel, Stuart Norvig, Peter author Russel, Stuart Norvig, Peter
Problem: field names are arbitrary Synonyms : author or creator or maker or contributor or… Singular or plural: author or authors Solution: standards general or domain-specific
Dublin Core General vocabulary Dublin Core Metadata Initiative (DCMI) http://dublincore.org Building blocks to define metadata for the Semantic Web Define title, contributor, publisher, license, date, language, etc.
Problem: field values are arbitrary Value type: string, date, integer, … Value format: “Norvig, Peter” or “Norvig, P.” or “Peter Norvig” or… Value restrictions: one value or multiple values (how many?) Solutions Standards Controlled vocabulary (close list of terms) Semantically rich descriptions to support search (RDFS and/or OWL)
Friend Of A Friend (FOAF) General ontology Describe persons, their activities and their relations to other people and objects http://www.foaf-project.org Building blocks to define structured relations between people Define name, familyName, givenName, knows, age, nick, etc.
Example: bookstore http://...isbn/9780136042594 dc:title Artificial Intelligence: A Modern Approach dc:publisher dc:creator dc:creator foaf:name Prentice Hall foaf:name foaf:name Norvig, Peter Russel, Stuart foaf: http://xmlns.com/foaf/spec dc: http://purl.org/dc/terms
Why? http://...isbn/9780136042594 http://...isbn/9780136042594 dc:creator author foaf:name Norvig, Peter Norvig, Peter
RDF Schema Schema Definition of the nodes and predicates used in a RDF document Domainand Range RDFS describes properties in terms of classes of resource to which they apply (from a “domain” to a “range”)
Example RDF data http://elite.polito.it/people/derussis http.//bit.ly/lingambmult teaches
Example Person subClassOf domain range Teacher Course teaches type type RDF schema RDF data http://elite.polito.it/people/derussis http.//bit.ly/lingambmult teaches
Back to the bookstore example… http://...isbn/9780136042594 http://...isbn/9780136042594 dc:creator Why? author foaf:name Norvig, Peter Norvig, Peter
Back to the bookstore example… http://...isbn/9780136042594 http://...isbn/9780136042594 dc:creator Why? anonymous class author foaf:name Norvig, Peter Norvig, Peter dc:creator has range Agent, i.e. a class (resource), not a literal: we use an anonymous class for this scope. Finally, foaf:Name has range rdfs:Literal.
RDFS Expressivity • Simple relationship between things • RDFS provides a vocabulary to express relationship between things (e.g., subClassOf or type) • Avoid complex relationship • RDFS cannot describe data in terms of set of operations (e.g., unionOf), equivalence (e.g., sameAs) or cardinality (e.g., allValueFrom)