170 likes | 277 Views
A Comprehensive Framework for Semantic Annotation of Web Content. Manuel Fiorelli 1 , Maria Teresa Pazienza 2 , Armando Stellato 2 1 Department of Civil Engineering and Computer Science, 2 Department of Enterprise Engineering University of Rome, Tor Vergata , Italy
E N D
A Comprehensive Framework forSemantic Annotation of Web Content Manuel Fiorelli1, Maria Teresa Pazienza2, Armando Stellato2 1Department of Civil Engineering and Computer Science, 2Department of Enterprise Engineering University of Rome, Tor Vergata, Italy {fiorelli,pazienza,stellato}@info.uniroma2.it
Outline • Annotation in action • The problem • Our proposal • Our choices • Combined managed of ontologies and Web content • Event-based behavioural model • Convention over configuration • End-user customizability • Conclusions
Annotation in action Annotation is a pervasive activity depending on different assumptions about granularity, level of formality and the use, if any, of ontologies. Collaborative authoring of documents Free text notes by co-authors NLP tools Structured annotations with weak semantics Semantic Web Annotations with respect to formalized knowledge to make explicit the meaning of Web resources (even Web services)
The problem The ultimate annotation system is unlikely to appear: • Even comprehensive annotation models(e.g. the draft produced by the Open Annotation W3C Community Group)maynot cover all usage scenarios • Divergent methodologies: automatic vsmanual annotation • Different applications: NLP, Web Annotation, …
The problem Supported applications Don’t take the convexity too strictly :-D Design Choices General Purpose Programming Language Full system Specification
Our proposal framework Supported applications Design Choices General Purpose Programming Language Full system Specification Low upfront commitment to specific design choices Progressive levels of customization/implementation
Our choices Narrowed Scope Applications dealing with combined management of ontologies and Web content. Behavioural Model Assignment of handlers to predetermined events. We call an assignment of semantically coherent handlers an annotation family Convention over Configuration To ease the development of common interaction patterns.
Combined management of ontologies and Web content Interaction with the domain model Interaction with the Web content cross realm interactions http://semanticturkey.uniroma2.it/
Semantic Turkey Architecture Three layeredextensiblearchitecture • Presentation Layer • An extension to one of Mozilla Applications • User Interface based on XUL, XBL and Javascript technologies • Not–only–presentation • Extensible via Mozilla Ext. Mechanism • Services Layer • Enablescommunicationbetween the client (Firefox browser extension) and the ontologypersistencelayer. • HTTP based Services accessed through the Ajax paradigm • OSGi Extensible Servicing System • PersistenceLayer • Access to ontologicalknowledge. • Based on dedicatedontology API, which can be implementedthrough use of differenttechnologies.
Event-based behavioural model From UI gestures(e.g. drag&drop) to events. selectionOverResource fired when a selection from a Web page is dropped onto an ontological resource resourceOverContent fired upon gestures for the association of Web content with an ontological resource regardless of their occurrence in the text contentLoaded triggered when Web content is loaded, in order to execute presentation related activities, e.g. highlighting the annotated fragments Creation of annotations Presentation of annotations
Families Events Handlers • selectionOverResource • resourceOverContent • contentLoaded bookmarking • selectionOverResource • resourceOverContent • contentLoaded open annotation (coarsegrain) • selectionOverResource • resourceOverContent • contentLoaded open annotation (fine grain) Event-based behavioural model Multiple handlers for offering different choices to the user Events agnostic about the resource type (e.g. class vs individual), but handlers have pre-conditions that may filter incoming events on the basis of the target resource type
Convention over configuration Event-based model makes almost no prior assumption about the application or the interaction patterns • Therefore, it is comprehensive However, to help the developer we provide a collection of predetermined handlers: • Add an annotation to a resource • Create a new instance • Create a (sub-)concept • Set a property value Which are defined up to a bunch of functions that must be implemented by the developer.
Convention over configuration Extension point
Convention over configuration Extension point
End-user customizability Most usage scenarios only concern with a subset of the possible interactions which a given family may offer. Strengthen the preconditions Select a subset of the handlers
Future Developments • Extend the set of available conventions • Evaluate our framework with respect to the coverage of the features of state-of-the-art annotation systems • Is it possible to implement the identified features in our framework? • How much effort does it require?