The Semantic Web Nottingham University 23 March 2007
The Official Definition http://www.w3.org/2001/sw/ Definition: The Semantic Web is the representation of data on the World Wide Web. It is a collaborative effort led by W3C with participation from a large number of researchers and industrial partners. It is based on the Resource Description Framework (RDF), which integrates a variety of applications using XML for syntax and URIs for naming.
Semantic = Meaning ?
Description The Semantic Web • HTML for presentation • XML for data • RDF for meaning A new form of Web content that is meaningful to computers will unleash a revolution of new possibilities…
Semantic = Data Modelling
Semantic Web is about data modelling • To make computers useful they must interact with the real world • So they must model the real world in some way • UML/Object Oriented; classes, instances, attributes • Relational (databases); tables, keys, constraints, schema • XML; documents, semistructured information, schema • Semantic Web uses RDF – with some similarities (and some differences) to these other techniques
Semantic Web is about data modelling • The base level of RDF is the triple: {subject, predicate, object} • Examples {[I], rdf:type, foaf:Person} {[I], foaf:name, “Michael Souris”} {[I], foaf:mbox,mailto:mm@example.com} • And {foaf:name, rdf:type, rdf:Property}
RDF triples define a graph data structure foaf:Person rdf:type foaf:name Michael Souris foaf:mbox mailto:mm@example.com
.. which can be serialized in XML <rdf:RDF xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:foaf="http://xmlns.com/foaf/0.1/"> <foaf:Person> <foaf:name>Michael Souris</foaf:name> <foaf:mbox rdf:resource="mailto:mm@example.com" /> </foaf:Person> </rdf:RDF>
.. and N3 @prefix rdf <http://www.w3.org/1999/02/22-rdf-syntax-ns#> @prefix foaf <http://xmlns.com/foaf/0.1/> [] a foaf:Person; foaf:name “Michael Souris”; foaf:mbox <mailto:mm@example.com> .
Merging RDF data from different sources Media metadata database: ../presentation.ppt dc:creator dc:date dc:description Talk on Semantic Web org:email 2006-04-23 mailto:steve.cayzer@hp.com
The second datasource Organization database: org:Engineer rdf:type vcard:FN Steve Cayzer org:worksOn org:email ex:SemanticBlogging mailto:steve.cayzer@hp.com
Two datasources merged Merged database: org:Engineer ../presentation.ppt rdf:type dc:creator vcard:FN dc:date dc:description Steve Cayzer org:worksOn Talk on Semantic Web org:email ex:SemanticBlogging 2006-04-23 mailto:steve.cayzer@hp.com
Linking foaf profiles … foaf:Person rdf:type foaf:name Michael Souris foaf:mbox mailto:mm@example.com
… into a social network … foaf:Person foaf:Person rdf:type rdf:type foaf:name foaf:name foaf:knows Michael Souris Donald Canard foaf:mbox foaf:mbox mailto:dd@example.com mailto:mm@example.com
… and serialized for machines … <rdf:RDF xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:foaf="http://xmlns.com/foaf/0.1/"> <foaf:Person> <foaf:name>Michael Souris</foaf:name> <foaf:mbox rdf:resource="mailto:mm@example.com" /> <foaf:knows> <foaf:Person> <foaf:mbox rdf:resource="mailto:dd@example.com" /> <foaf:name>Donald Canard</foaf:name> </foaf:Person> </foaf:knows> </foaf:Person> </rdf:RDF>
.. or visualised by people The Friend of a Friend (FOAF) project is about creating a Web of machine-readable homepages describing people, the links between them and the things they create and do. Distributed RDF/XML records describing people, who they know, projects they work on…
Semantic Web is also about global sharing of models • share data but also share models (or vocabularies) • Properties and Classes have URIs (so we can specify their semantics: ‘knows’ = met/friend/emailed/aware_of) • URIs must be globally unique (can use URLs) • Many shareable models eg documents (dublin core), images (EXIF!), people (foaf), lifesciences (ways of talking about genes, proteins), museums (items), • Modelling is not new; what’s new is that we can share models globally and mix them up – eg ‘find me photos (image metadata) about my friends (social network)’
The Fractal Semantic Web • From http://www.w3.org/2005/Talks/1110-iswc-tbl/
Modelling language for data (and metadata) Universal (web-based) naming scheme Semantic Web Technologies Trust Proof Rules and logic Digital Signature Query OWL and RDFS RDF XML + Namespaces + xmlschema URI
Ontology languages for representing the semantics of data Modelling language for data (and metadata) Universal (web-based) naming scheme Semantic Web Technologies Trust Proof Rules and logic Digital Signature Query OWL and RDFS RDF XML + Namespaces + xmlschema RDF, RDFS, OWL standardised February 2004 URI
Semantic Web Ontology Languages RDFS Student is a subclass of Person friend is a subproperty of knows The domain and range of knows is Person OWL Lions only eat meat The ‘ancestor’ relationship is transitive People have exactly one shoesize An email address unambiguously identifies one person
Ways of transforming data between ontologies Current research areas Query and data access protocol Ontology languages for representing the semantics of data Modelling language for data (and metadata) Universal (web-based) naming scheme Semantic Web Technologies Trust Proof Rules and logic Digital Signature Query OWL and RDFS RDF XML + Namespaces + xmlschema URI RDF, RDFS, OWL standardised Feb 2004 SPARQL query language: W3C Working Draft 4 October 2006 Rule Interchange Format working group chartered 2005
Aktors Integration of distributed… • RAE Data • Geographical Location • ACM Taxonomy • Internal Research Data • … http://www.aktors.org/akt/ http://triplestore.aktors.org/SemanticWebChallenge/
An HP Example The Semantic Web Environmental Directory (SWED) is a prototype of a new kind of directory of environmental organisations and projects. Our goal is to develop a sustainable (realistically maintainable) and easy to use directory about environmental organisations and projects throughout the UK.www.swed.org.uk
A side note on XML • RDF is not XML (though it can be expressed in XML) • The semantic web is about global sharing of information models • RDF describes the information model • RDF can be expressed in XML (or N3, or drawn as a graph…) • XML Schema is all about syntax and so has limited use for RDF • RDF Schema is all about the model
Take home messages • The semantic web is about globally sharing, processing and integrating information. • RDF is a model. • RDF is not XML, though it may be serialized in XML • Semantic web technologies range from production ready (the lower layers) to active research areas (the upper layers)
HP Labs semweb site http://www.hpl.hp.com/semweb/