350 likes | 475 Views
The Importance and Future Trends of Sharing Biodiversity Data. Chau Chin Lin Taiwan Forestry Research Institute. Biodiersity: Tree of Life on the Planet. Chris King 2011 Genotype 1.1.9. Biodiversity: Information of Life. Class: Insecta. Taxonomic Names. Sequence Data. Order: Lepidoptera.
E N D
The Importance and Future Trends of Sharing Biodiversity Data Chau Chin Lin Taiwan Forestry Research Institute
Biodiersity: Tree of Life on the Planet Chris King 2011 Genotype 1.1.9
Biodiversity: Information of Life Class: Insecta Taxonomic Names Sequence Data Order: Lepidoptera Synonym: Pyralis nubilalis Hübner, 1796 Locus: AAL35331Definition: acyl-CoA Z/E11 desaturase 1 mvpyattadg hpekdecfed... Family: Pyralidae Genus: Ostrinia Hübner, 1825 Taxonomic Descriptions Species: Ostrinia nubilalis (Hübner, 1796) Diagnosis: Wingspan 26-30mm; sexually dimorphic;male: forewings ochreous to dark brown; female: forewings pale yellow; … Vernacular (EN): European Corn-borer Vernacular (DE): Maiszünsler Vernacular (ES): Piral del maíz Vernacular (FR): Pyrale du maïs Digital Literature and Web Resources Family: Gramineae Pheromones of Ostrinia http://www.nysaes.cornell.edu/fst/faculty/acree/pheronet/phlist/ostrinia.html Foodplant: Zea mais L. 1753 Collection: DGH Lepidoptera Record id: DGHEUR_003217 Country: France Coordinates: 03.047˚E 48.730˚N Date: 28 June 2003 Collector: Donald Hobern Individuals: 3 Richness: Biotic Interactions Abiotic Average Rainfall Location: 48.82°N 2.29°E Jan Feb Mar Apr ... 182.3 120.6 158.1 204.9 ... Spatial /Temporal Observations
Challenges and Opportunities Scientific innovation has been called on to spur economic recovery and to inform sustainability. Data collection,curation, and access are central to all of these issues Science 311:692-694, 2011
Data Informs the Lost of Biodiviersity Annual Cumulative Worm et al., Science 2006
Data Enhances Understanding of The Real World Understanding this disease requires knowledge of epidemiology, genetics, and transmission modes, along with their ecological contexts. Integrating ecologically pertinent data into the chain of information from the gene to the biosphere will significantly enhance our understanding of the natural world. Whitfield J. 2003 Ape populations decimated by hunting and Ebola virus. Nature 422:551
information Data/Raw data/Dataset Observations/experiments the real world Data Comes from Research of the Real World
The Traditional Paradigm of Data Publications Analysis and modeling Raw Data Problem Data Collection Planning
(Michener et al. 1997) Data Entropy Time of publication Specific details General details Retirement or career change Information Content Accident Death Time
used data Selection and extraction Secondary Observations Planning New Paradigm of Data Publications Analysis and modeling Collection Original Observations Problem Definition (Research Objectives) Planning
A Data-intensive Approach Data Cyberinstrstructure The Real World Collaboratories Observatory Networks Information Resources Synthesis Hubs and Nodes Fundamental Research Ecoinformatics
Adapting Cutting Edge Technologies Data Preserving and managing Data collecting Data transferring Data discovering, Integrating,analyzing,visualizing
Dealing with Data Collecting Change Interpret a number 10 x daily Interpret a pattern 1,000 x daily
Raw data Management, Archiving, & Curation Discovery, Retrieval Integrating, Analysis & Visualization Information Knowledge Towards Automation of Data Processing
Metadata? Metadata Is the Key to New Paradigm
Metadata Data Date (YYYYMMDD) Temp (°C) Precip. (mm) Obs. #1 Obs. #2 Obs. #3 What Is Metadata-An Example
Who? When? What? Where? How? Metadata Is Data about Data … and All Other Facet ofDataset!
Many Standards Can be Chosen • Standards for Metadata • ISO 19115 is a geo-spatial metadata standard developed by ISO/TC 211. ISO 19115 defines a comprehensive metadata model for geographic objects. ISO/TC 211also defined a smaller set of core metadata elements (shown on example slide.) This core contains the minimum elements that satisfy the requirements of an ISO conformant metadata record. The ISO 19115 standard does not specify storage format, but XML schemas are under development for an XML encoding of it (in full or for specialized profiles). • CSDGM/FGDC (Content Standard for Digital Geospatial Metadata) is a standard for metadata for geographic objects developed by FGDC (Federal Geographic Data Committee). However, this standard is not limited to spatial data. FGDC enables development of profiles, i.e. customization of the standard to suit the needs of a particular application domain (while staying within the framework of the standard). • SPOT imagery FGDC example • http://gcmd.nasa.gov/servlets/md/getdif.py?entry_id=[GCMD]CANEMRCCRSSPOT&xsl=dif_to_fgdc-html.xsl¤tTab=¤tItem=&portal=gcmd • EML(Ecological Metadata Language) • http://knb.ecoinformatics.org/data.html • Darwin Core • The Darwin Core (sometimes abbreviated as DwC) is a standard designed to facilitate the exchange of information about the geographic occurrence of species and the existence of specimens in collections.
What Is EML? Ecological Metadata Language is… • an ecological metadata standard • very extensible; it can be used to describe many different types of data • comprehensive and supports a rich set of constructs to fully describe data • XML and is defined by an XML Schema • exploitable by different computer applications
2005 1990 1995 2000 ‘01 ‘96 ‘91 ‘97 ‘92 ‘02 ‘03 ‘98 ‘93 ‘04 ‘99 ‘94 EML History Second EML workshop KNB Tools Workshop Michener et al. paper First EML workshop FGDC CSDGM 2.0 FGDC CSDGM RS EML ASU meeting XML 1.0 released NCEAS formed FGDC CSDGM BDP approved FGDC created NBII created FLED report ISO 19115 GML 3.0 EML 1.4.x EML 1.0.0 EML 1.3.0 EML 2.0.0 EML 2.0.1 Early ecological metadata work in LTER and elsewhere EML 2.0.0rc1-3 EML 2.0.0 beta1-9
Darwin Core – a glossary of terms higherClassification coordinatePosition specificEpithet geodeticDatum associatedSequences taxonConceptID taxonRank associatedSequences: A list (concatenated and separated) of identifiers (publication, global unique identifier, URI) of genetic sequence information associated with the Occurrence.
Species Distribution Prediction (Abies Kawakamii)
An Example of Biodiversity and Ecological Data Metadata provides data source Key words query Retrieval of dataset Data analysis Metadata list
Thank You! http://taibif.tw