220 likes | 335 Views
INFORMATION FACILITY. A Darwin-Core Archive solution to publishing and indexing taxonomic data within the GBIF network. GLOBAL BIODIVERSITY. David Remsen ECAT Program Officer September 2010. WWW.GBIF.ORG. Thanks: Peter Desmet , Canadensys - (graphics).
E N D
INFORMATIONFACILITY A Darwin-Core Archive solution to publishing and indexing taxonomic data within the GBIF network GLOBALBIODIVERSITY David Remsen ECAT Program Officer September 2010 WWW.GBIF.ORG Thanks: Peter Desmet, Canadensys- (graphics)
Enabling global discovery: Objectives • Develop capacity to document and publish taxonomic data • A simple exchange format • Suite of publication tools • Promote the publication of taxonomic data in a common format • Build and maintain an index of published checklists • Build services on this index that address user needs in the GBIF network
Enabling global discovery: Outcomes • Embed taxonomy into large-scale biodiversity data/info. management • Improved Interoperability among resources • Improved Precision and Recall within resources • Increase efficiencies in taxon-related linking, mapping, data-mining, and data management • Increased recognition of the value and relevance of taxonomy within all biodiversity information interchange (large and small)
Darwin Core Archive Data Format
Darwin Core • Ratified in 2009 • Significant additions/refinements • Set of terms • http://rs.tdwg.org/dwc/terms/index.htm • Simple Darwin Core (Subset) • Express as Text • http://rs.tdwg.org/dwc/terms/guides/text/index.htm
Core components – single file • Classification • Synonymy • Publication Details Taxon • Simple to Export • Simple to Manage • Comma-Separated Values Text File
Extending Darwin Core • Extensions defined via simple schema • Darwin Core or other terms • Linked to controlled vocabularies • One taxa – many extension records one-to-many Taxon Types and Specimens Bibliography one-to-many • Simple to Export • Simple to Manage • Comma-Separated Values Text File
Metafile describes the set one-to-many Describes Describes Describes Core Types and Specimens Bibliography one-to-many Metafile
Core + Set of Extensions “GNA Simple Exchange Format” one-to-many one-to-many Vernacular Names describes Bibliography one-to-many one-to-many Taxa Metafile Types and Specimens Distribution
Metadata documents resource documents GBIF EML profile one-to-many one-to-many Vernacular Names describes Bibliography one-to-many one-to-many Taxa Metafile Types and Specimens Distribution
Validator Status: Under Evaluation http://tools.gbif.org/dwca-validator/
Darwin Core Archive Publishing Options
Integrated Publishing Toolkit Compose EML Metadata Connect to database Upload Data Transform to DWCA Publish via GBIF http://ipt.gbif.org Status: Stable release – end 2010
Guidelines and Best Practices • DB Admin skills • Database export • No tools required • Successful pilots • Ireland • NBN UK • Norway • Avian Knowledge network • IPNI • IRMNG Status: Drafts for Novembercampaign (see roadmap)
Authoring Descriptor XML Metafile Status: Ready for Review http://tools.gbif.org/dwca-assistant/
Excel Spreadsheet Templates Status: Ready for Review/Testing
Spreadsheet Processor Status: Ready for Review http://tools.gbif.org/spreadsheet-processor/
Checklist Bank http://ecat-dev.gbif.org/ Status: Dev version in place. Integration with GBIF data portal 2011
Roadmap • Evaluation and testing and refinement Q4 2010 • Consolidate docs and publishing for ver. 1 Simple Exchange Format using DWC-A • Target current taxonomic data export publishers • Small grants to pilot DWC-A exports • Seed funds to GBIF Nodes • Publish regional and thematic species checklists • Evaluate 1.0 extensions and vocabularies