150 likes | 334 Views
Biodiversity Information Standards (TDWG: Taxonomic Databases Working Group). formed to establish international collaboration among biological database projects
E N D
Biodiversity Information Standards (TDWG: Taxonomic Databases Working Group) • formed to establish international collaboration among biological database projects • promotes the wider and more effective dissemination of information about the World's heritage of biological organisms for the benefit of the world at large • focuses on the development of standards for the exchange of biological/biodiversity data. • develop, adopt and promote standards and guidelines for the recording and exchange of data about organisms • promote the use of standards through the most appropriate and effective means and • act as a forum for discussion through holding meetings and through publications
Interest Groups • Biological Descriptions • Geospatial • Imaging / Multimedia • Invasive Species • Literature • Natural Collections • Observations and Specimen Records • Taxonomic Names and Concepts • TDWG Architecture / Infrastructure • Globally Unique Identifiers • TAPIR (TDWG Access Protocol Information Retrieval)
TDWG Activies • Annual Conference • Themed seminars • Working groups • Computer demonstrations • Self-maintained interest groups • Promotes ‘best practices’ • Propose new or updated standards • Promote wide spread use of standards
Of Interest to EOL • Globally Unique Identifiers • Annotations, de-duplication, consolidation of IDs • Life Science Identifiers (LSID) are TDWG recommended GUID format • Literature • Marking up species accounts, characteristics • TAPIR • Data access protocol allows customized response • But… not great for harvesters • Species Accounts and Characteristics • SPM can provide species descriptions and associations • SDD can provide atomized characteristics
Of Interest to EOL (cont.) • Taxon Names and Concepts • Names, nomenclature, taxonomy, and concepts • Defines RDF and XML standards for sharing data • Many options: DarwinCore, TCS XML, TDWG Ontology • Multimedia • Large percentage of EOL’s data • No overwhelmingly popular multimedia schema
EOL’s Use of Standards • TDWG Taxon Concept Schema • Hierarchy search and details API used by LifeDesks and EDIT ScratchPads • GNA accepts TCS as an import format • Species Profile Model • Plazi.org open literature repository • DarwinCore elements used in EOL Content Schema
EOL’s Use of Standards • EOL Content Schema • Taxon fields are primarily Darwin Core • Data Object metadata uses Dublin Core • Recommends the use of ISO language codes • Locations in W3C geo vocabulary • Species Profile Model Info Items
Representation - XML • Long time popular interchange format • Relatively easy to generate and consume • Well supported in programming languages and GUI applications • XML Schemas are inflexible • XML Documents can be bloated and contain more tags than data
Representation - RDF • Current specification completed in 2004 • Generic framework makes statements about resources • 3-part statements can be represented in XML or RDF triple • Integral component of today’s semantic web • Minimal support for generation and consumption • New framework still isn’t gaining momentum or widespread adoption
Other Representation • Tab delimited or CSV text files • Minimal filesize, bandwidth, parsing needed • Excellent for harvesters • GBIF pushing use of CSV files • Excel still popular with researches, exports CSV files • Minimal normalization possible, interrelationships awkward to represent