200 likes | 306 Views
EASTERN MEDITERRANEAN UNIVERSITY COMPUTER ENGINEERING DEPARTMENT. CMPE 588 ENGINEERING THE SEMANTIC WEB INFORMATION SYSTEM. ONTOLOGY-DRIVEN SEMANTIC MARK UP OF UNSTRUCTURED TEXTS. Instructor: Assoc. Prof. Dr. Atilla ELCI. Faaya H. Tiwuya. OUTLINE. Introduction Defination of terms
E N D
EASTERN MEDITERRANEAN UNIVERSITY COMPUTER ENGINEERING DEPARTMENT CMPE 588 ENGINEERING THE SEMANTIC WEB INFORMATION SYSTEM ONTOLOGY-DRIVEN SEMANTIC MARK UP OF UNSTRUCTURED TEXTS Instructor: Assoc. Prof. Dr. Atilla ELCI Faaya H. Tiwuya
OUTLINE • Introduction • Defination of terms • Need to use ontologies • Annotation tools • Project Tools • Ontology driven semantic mark up/annotation process • Sample Semantic Mark up • Querying knowledge base
OUTLINE • Conclusion • References • Questions
INTRODUCTION • A recent estimates by Merrill Lynch states that more than 85% of all business information exists as unstructured data • In order to realise the goal of the Semantic Web which is to allow machines share and exploit knowledge without central authority, there is a need to mark up this bulk of data in a way that machines will be able to read them
INTRODUCTION • Blogs are a typical example of information that exists in an unstructured format. • There is a need therefore to mark-up blog sites within the context of an “ontology” so that that meaningful information can be extracted from them for the population of a knowledge base for retrieval at a later time by different applications.
DEFINATION OF TERMS • What then is a blog, ontology and semantic annotation? • Blogs: A blog is a website where entries are written in a chronological order and displayed in reverse chronological order. Blogs provide commentary or news on a particular subject such as food, politics, or local news.
DEFINATION OF TERMS • Ontology: An ontology is a data model that represents a set of concepts within a domain and the relationships between those concepts. • It is used to reason about the objects within that domain. • Annotation: Annotation is the process of marking up or adding notes/comments to something. Within the context of the semantic web, it is the process of marking up documents with meaningful tags
NEED TO USE ONTOLOGIES • Why do we need to use ontologies? • To share common understanding of the structure of information among people or software agents • To enable reuse of domain knowledge • To make domain assumptions explicit • To separate domain knowledge from the operational knowledge • To analyze domain knowledge
ANNOTATION TOOLS • SHOE Knowledge Annotator • Annotea • SMORE • Semantic Word • KIM Semantic Annotation Platform • MnM • MnM is an annotation tool which provides both automated and semi-automated support for annotating web pages with semantic contents. MnM integrates a web browser with an ontology editor and provides open APIs to link to ontology servers and for integrating information extraction tools.
PROJECT TOOLS • MnM • TinyXml • C++ BUILDER 5.0
A CELEBRITY IS A PERSON • A celebrity “is a” person who is famous ( not for something infamous ) and attracts a lot of media attention. E.g David Beckham, Angelina Jolie, Zinedane Zidane. • Sites exists where people post comments about these sort of people almost o a daily basis. • Some of these sites contain important information.
SAMPLE ONTOLOGY –DRIVEN SEMANTIC MARKUP ON A DAVID BECKHAM BLOGUSING MnM
CONCLUSION • Blogs contain a lot of valuable information about different topics • An otology driven semantic annotation of these blogs will create a unified means to share the wealth of information these blogs possess.
REFERENCES • Prof. Dr. Atilla Aelci, Lecture slides CMPE 588 Engineering Semantic Web information Systems. • Ontology Development 101: A Guide to Creating Your First Ontology Natalya F. Noy and Deborah L. McGuinness Stanford University, Stanford, CA, 94305 noy@smi.stanford.edu and dlm@ksl.stanford.edu • http://en.wikipedia.org/wiki/Unstructured_data • http://semanticweb2001.aifb.uni-karlsruhe.de/ • http://en.wikipedia.org/wiki/Blog • http://en.wikipedia.org/wiki/Ontology_(computer_science) • http://annotation.semanticweb.org/tools/ • M.Vargas-Vera, E. Motta , J. Domingue M. Lanzoni , A. Stutt, F. Ciravegna, MnM: Ontology-Driven Tool for Semantic Markup • TinyXml, http://www.grinninglizard.com/tinyxmldocs/index.html
THANK YOU FOR YOUR ATTENTION • QUESTIONS