150 likes | 310 Views
We live in an age of information overload . . . Attwood TK et al . (2009) Calling International Rescue: knowledge lost in literature and data landslide! Biochemical Journal 424:317–333. . . . and we need help badly. About a million new biomedical journal articles are published annually
E N D
We live in an age of information overload . . . Attwood TK et al. (2009) Calling International Rescue: knowledge lost in literature and data landslide! Biochemical Journal 424:317–333.
. . . and we need help badly • About a million new biomedical journal articles are published annually • Without computation assistance to filter and integrate information, we are lost • To permit computers to help us, we need two technologies: • A mechanism for describing things using common machine-readable vocabularies that permit unambiguous definition of entities and their relationships – semantic integration • A mechanism for merging such descriptive information by the use of a common machine-processable format – syntactic integration • Tim Berners-Lee developed the World Wide Web Consortium as an organization to develop technology standards for this: • OWL, the Web Ontology Language, for creating ontologies (structured vocabularies) for describing things • RDF, the Resource Description Framework, for structuring resource descriptions in a common format
A bluffer’s guide to RDF, ontologies and linked data • The principles are very simple • All entities (ontology classes) and their relationships (ontology properties) are identified by unique URIs, thus being semantically defined on the Web • The URIs reference publicly available and commonly accepted ontologies (structured vocabularies), so that the meaning of terms is unambiguous • Each relationship is expressed as a subject– predicate – object‘triple’ • The syntax is defined by W3C’s Resource Description Framework (RDF) • Examples: :my-article rdf:typefabio:JournalArticle . :my-article dc:creator "Shotton, David" . :my-article dc:title “CiTO, the Citation Typing Ontology” . • Such statements can be combined into interconnected information networks (RDF graphs), forming ‘linked data’ • thereby creating a web of knowledge, the Semantic Web • in which the truth content of each original statement is maintained
20 Questions for Data Management Planning http://www.miidi.org/dmp
Then, on the third tab, describe data sharing and publication
Choose to save the file, rather than open it, and specify where
Save your DMP for inclusion in your data package • Save the XML version on your desktop as YourFamilyName.xml • Save the RDF version on your desktop as YourFamilyName.ttl • This saves it in Turtle format, a human-readable version of RDF • N.B. File names should not contain spaces! Use underscores or hypens, e.g. • Stanton-Jones.xml • van_der_Sompel.ttl • Later, you will use DataStage to include these files in your data package and submit them to a data repository