220 likes | 335 Views
INFORMATION FACILITY. GLOBAL BIODIVERSITY. 10 th Nodes Meeting. David Remsen Senior Programme Officer, ECAT 3 Oct 2009. WWW.GBIF.ORG. Use cases. Map species data to a controlled list or authority file (or system). Are my names spelled right? Taxonomy up to date?
E N D
INFORMATIONFACILITY GLOBALBIODIVERSITY 10th Nodes Meeting David Remsen Senior Programme Officer, ECAT 3 Oct 2009 WWW.GBIF.ORG
Use cases • Map species data to a controlled list or authority file (or system). Are my names spelled right? Taxonomy up to date? • Provide controlled lists for data entry • Build new regional or thematic species lists • Provide integrated/flexible taxonomic browsing of species data • Support browsing in native/other languages • Support the use of common names in search/access • Find species information regardless of misspellings, synonyms, etc. • Automated processes for identifying, extracting, validating, and linking names in documents, publications, web sites, etc.
Scope of the Global Names Architecture A global publication and discovery system for taxon names and concepts
GNA infrastructure extends GBIF infrastructure Common publication framework for GBIF and other networks
ECAT Work Programme • Define/implement the architectural framework for publishing taxonomic data • Build a global index of published “checklists” to enable discovery, integration and access • Build services and tools that use the published “checklist” data
Scope “Checklists” • Taxonomic Catalogues • Monographs, Regional Flora/Faunas, Taxonomic “Aggregates” • Species Inventories • Regional species lists • Thematically defined species lists • Red Lists, Invasive Species Lists, Medicinal, etc. • Common Name Lists/Inventories • Species indices in published content
Status of Architecture • Robust and extensible data standard in place • Documentation, examples, code • Tools and services to support global extension and use of the standard • Capacity to register, publish, and access data is in place now * • Simple Name indexes • Complex Taxonomic Data * Needs promotion, use, and refinement
Rich, Extensible, Simple Standard http://code.google.com/p/gbif-ecat/wiki/DwCArchive
Developing, Detailed documentation Source Code Schemas Tools Instructions Documentation Feedback!!! (please) http://code.google.com/p/gbif-ecat/
Extensible, International, Controlled Community authoring Controlled Vocabularies Extensions Multi-lingual Thesauri Nov 2009 release http://vocabularies.gbif.org
30% to 95% occurrence data linked to taxonomic sources Publishing Infrastructure is in place
Goal 2 of ECAT Work Build and integrate Indexes of the Published Checklist Data
Global Checklist Index A global name service brokerage to names hosted on taxonomic servers
Global Names Index Facilitates Linking URIs+Species Names Web Service “Fuzzy” name matching Simple DwC standard Publicise species data entry by species name
Goal 3 of ECAT Work Build Services & Applications that use published Checklists
New Processes/Technologies New Derived Products Enabling new and extended uses of taxonomic data
Applied to data access and integration Uses that provide benefits to GBIF participants
Questions • Does this enhanced capacity, to publish species-level data, particularly checklists, serve a need with the NODES? • Is the IPT sufficient for providing the capacity to publish these types of checklists • Would NODES like additional options? • Focus on cataloging as much Checklist content as possible • LifeDesks and Scratchpads, plus all the known databases that are "out there” • Develop the widest array of basic web services to enable their use? • More name processing tools/ mapping services?
How to contact GBIF: • Web site: www.gbif.org • Data portal: data.gbif.org • GBIF Secretariat • Universitetsparken 152100 CopenhagenDenmark • E-mail: info@gbif.org • Phone: +45 3532 1470 • Fax: +45 3532 1480