140 likes | 265 Views
Sandesh A Semantic Mesh Over Indian Open Data Open Systems Lab IIIT Bangalore http://osl.iiitb.ac.in/ http://sandesh.iiitb.ac.in:4000/. Open Data in India: A Summary [Agrawal et al. 2013]. Open Data: Challenges and Opportunities.
E N D
Sandesh A Semantic Mesh Over Indian Open Data Open Systems Lab IIIT Bangalore http://osl.iiitb.ac.in/ http://sandesh.iiitb.ac.in:4000/
Open Data: Challenges and Opportunities 100+ and a growing number of websites providing publicly accessible, structured, utilitarian datasets Each website comprising 100s-1000s of datasets Immense opportunity and challenge Semantic challenge: Data about any entity (place, crop, disease, etc.) fragmented across several datasets from several sources Need for a semantic mesh, to interconnect data from disparate sources Aim of the Sandesh project
Sandesh A “semantic data mesh” over Indian Open Data Connecting elements from different datasets under an overarching semantic structure Challenges Open data about no single topic in particular, fits into no single ontology Contextual boundaries of open data assertions unable to model using LinkedData standards The problem of “open-ended” data [Srinivasa et al. 2013]
Utility and Boundedness Consider the following RDF statements written in the form of (subject, predicate, object) triples: Encyclopedic knowledge Valid everywhere without contextual boundaries No immediate or specific utility Utilitarian knowledge Valid only within specific contextual boundaries (market, place, time, etc.) Has immediate and/or specific utility
Open data and Open-ended data CV of student Samar Jain published on his website Exam transcripts of Samar Jain Consider the following kinds of data: Login credentials for Samar Jain Open data: Author has no knowledge and control over who knows or has access to this data Closed data: Author has complete control over who else knows or is allowed to access this data Open-ended data: Author has no knowledge or control over who else knows or can access this data. However, author shares this data within a trusted environment that can guarantee legitimacy of access. Provide examples of open-ended data that were shared as open data
Many Worlds on a Frame (MWF) Place Person Crop Institution
MWF World Structure Institution Member Office Location Person Place Components Member Office Location Member Member Associations
MWF Frame Superclass World Container World is-a is-in is-a hierarchy inherits components, associations, constraints and attributes Institution Is-in hierarchy inherits privileges, visibility, non-existence is-a and is-in relationships form a tree rooted in worlds called Concept and UoD respectively. The data structure formed by is-a and is-in connections is called the Frame Concept is-a Concept, Concept is-in UoD UoD is-a Concept, UoD is-in UoD
MWF Privileges Institution Person Credentials of a Person (User) defined by the roles played by the Person in different worlds Credentials determine privilege level in a target world Privilege-level privilege Frame-level privilege Structure-level privilege Data-level privilege Visibility privilege
RootSet A PoC partially implementing the MWF knowledge model Contains mechanisms to bulk-load data from CSV files into a conceptual world with matching components or associations Other features include: Wall messages, event notification, import of world structure from DBPedia
Sandesh Core Team Srinath Srinivasa Principal investigator
Sandesh Demo
References Sweety Agrawal, Jayati Deshmukh, Srinath Srinivasa, Chinmay Jog, Sri Sayi Bhavani Kakarla, Rahul Dhek, Sneha Deshpande, Sana Javed and Vikas Mohandoss. A Survey of Indian Open Data. Proceedings of IBM ICARE 2013. ACM Press. New Delhi, India. Oct 2013 Srinath Srinvasa, Sweety Agrawal, Chinmay Jog, Jayati Deshmukh. Characterizing Open Utilitarian Knowledge. Communicated for publication. 2013.