150 likes | 161 Views
This overview explores the principles of linking open drug data, using semantic web technologies to publish structured data and set links between different data sources. It includes the importance of hyperlinks and the challenges faced in linking algorithms and data browsers.
E N D
Linking Open Drug DataSusie Stephens,Principal Research Scientist, Eli Lilly
Overview • Linked Data Principals • Linking Open Drug Data
The Classic Web Single information space Built on URIs globally unique IDs retrieval mechanism Built on Hyperlinks are the glue that holds everything together Search Engines Web Browsers HTML HTML HTML hyper-links hyper-links A C B Source: Chris Bizer
Linked Data Linked Data Browsers Linked DataMashups Search Engines Thing Thing Thing Thing Thing Thing Thing Thing Thing Thing typedlinks typedlinks typedlinks typedlinks A E C D B • Use Semantic Web technologies to publish structured data on the Web and set links between data from one data source and data from another data sources Source: Chris Bizer
Data Objects Identified with HTTP URIs rdf:type foaf:Person pd:cygri foaf:name Richard Cyganiak foaf:based_near dbpedia:Berlin pd:cygri = http://richard.cyganiak.de/foaf.rdf#cygridbpedia:Berlin = http://dbpedia.org/resource/Berlin Forms an RDF link between two data sources Source: Chris Bizer
Dereferencing URIs over the Web 3.405.259 dp:population skos:subject dp:Cities_in_Germany rdf:type foaf:Person pd:cygri foaf:name Richard Cyganiak foaf:based_near dbpedia:Berlin Source: Chris Bizer
Dereferencing URIs over the Web 3.405.259 dp:population skos:subject dp:Cities_in_Germany rdf:type foaf:Person pd:cygri foaf:name Richard Cyganiak foaf:based_near dbpedia:Berlin skos:subject dbpedia:Hamburg skos:subject dbpedia:Meunchen Source: Chris Bizer
The Linked Data Cloud • > 2 billion RDF triples • > 3 million links
Linking Open Drug Data • HCLSIG task started October 1, 2008 • Primary Objectives • Survey publicly available data sets about drugs • Publish and interlink these data sets on the Web • Explore interesting questions that could be answered if the data sets are linked
LODD Participants • Bosse Andersson • Chris Bizer • Kei Cheung • Don Doherty • Oktie Hassanzadeh • Anja Jentzsch • Scott Marshall • Eric Prud’hommeaux • Matthias Samwald • Susie Stephens • Jun Zhao
LODD: Data Set Evaluation Source: http://esw.w3.org/topic/HCLSIG/LODD/Data/DataSetEvaluation
Characterizing Drug Data Sources Source: Mark Sharp, et al: A Framework for Characterizing Drug Information Sources, 2008
Conclusions Rapidly growing cloud of Linked Data Many data sets related to life sciences W3C’s HCLSIG has published 4 drug related data sets Improvements needed to linking algorithms and data browsers