180 likes | 323 Views
Globally Unique Identifiers for Biodiversity Informatics (GUID). Ricardo Pereira Software Engineer. What is a GUID?. A GUID system provides mechanisms to identify and access data objects on the Web. GUID systems have identifiers Identifiers are persistent
E N D
Globally Unique Identifiers for Biodiversity Informatics (GUID) Ricardo Pereira Software Engineer ricardo@tdwg.org
What is a GUID? • A GUID system provides mechanisms to identify and access data objects on the Web. • GUID systems have identifiers • Identifiers are persistent • They are permanently associated with a data object. • Identifiers are globally unique • Identifiers are actionable or locatable. • Provides mechanisms to describe objects: metadata ricardo@tdwg.org
What are GUIDs good for? Slide by Donald Hobern ricardo@tdwg.org
What are GUIDs good for? Slide by Donald Hobern ricardo@tdwg.org
TDWG GUID Group • Broad cross-section of the community and other subscribers • Met twice this year • Durham, NC, USA (Feb) • Edinburgh, UK (June) • Worked via Wiki and Mailing List throughout 2006 ricardo@tdwg.org
TDWG GUID Work • Discussed requirements • Evaluated existing GUID technologies • LSID • DOI • Handles • PURL ricardo@tdwg.org
TDWG Group Work • Recommended the use LSID urn:lsid:lsid.tdwg.org:names:pk_12564 ricardo@tdwg.org
Why LSID? • Existing standard approach for retrieving data and metadata • Doesn’t require central authorities • Easy to assign large numbers of ids • Conceptually distinct from URLs • Dissociated from location (http) • Integrates with architecture model • Returns RDF • Resolvable ricardo@tdwg.org
LSID Evaluation • The TDWG group evaluated: • LSID specification: no changes required • LSID software: • Several prototype resolvers implemented • Few clients developed (thanks to Rod Page) • Need support for other platforms (PHP, Python) • Need few fixes to documentation ricardo@tdwg.org
LSID Applicability Statement • Purpose: • Defines how to use LSID in biodiversity information applications • Add our specific requirements on top of original LSID specification, while maintaining compatibility with other domains ricardo@tdwg.org
Recommendations • What gets an LSID? • Any object typed with the TDWG ontology • According to the TDWG architecture • Default metadata response in RDF • Default metadata binding is HTTP GET • Don’t use LSID revision part • Delegate versioning information to ontology • First 3 parts of LSID must be in lowercase • urn:lsid:authority_name:... • to allow simpler id matching and graph merging operations ricardo@tdwg.org
Public Service to Provide Authority Identification • Neutral authority identification: <authority_id>.lsid.tdwg.org • For those who cannot or do not want associate their own domain names with their identifiers • To facilitate migration of third-party datasets identified with LSIDs ricardo@tdwg.org
LSID Applicability Statement • Request for comments is out http://www.tdwg.org/subgroups/guid/ Please, give us your feedback!! • Additional Applicability Statements may be required from other domains, e.g.: • Taxonomic Names and Concepts, Specimens and Collections, Character Descriptions, Images, Sound, Video, Observations. ricardo@tdwg.org
Outreach • Documents targetted at: • Managers • Biologists • Software Developers (Resources Summary) ricardo@tdwg.org
The Roadmap • Addressed outstanding infrastructure issues • Establish GUID as a TDWG task group (charter) • Review LSID Applicability Statement and ratify It • Make sure relevant data sources are serving LSID • Nomenclators, collections, publications. • Develop domain specific applicability statements • Determine whether new platforms should be covered by LSID software • Investigate requirements of client software tools for browsing LSID and RDF data • Develop software to retrieve mappings of scientific names to nomenclators LSIDdatabases • Review of Applicability Statements and software ricardo@tdwg.org
LSID & RDF Tutorial • Hands-on tutorial on LSID and RDF • By Kevin Richards • Wednesday morning ricardo@tdwg.org
Acknowledgements • Gordon and Betty Moore Foundation • The National Evolutionary Synthesis Center (NESCENT) – USA • The e-Science Institute - UK • Global Biodiversity Information Facility (GBIF) • TDWG GUID Group • TDWG ricardo@tdwg.org
Thank You! • Any questions? ricardo@tdwg.org www.tdwg.org/subgroups/guid wiki.gbif.org/guidwiki ricardo@tdwg.org