180 likes | 273 Views
Joining the Dots Managing and identifying geolocated data by DOIs and IGSNs. Jens Klump | OCE Science Leader Earth Science Informatics. 20 August 2014. Mineral Resources Flagship. TERENO Terrestrial Environmental Observatory Managing data in environmental monitoring. What is TERENO?.
E N D
Joining the DotsManaging and identifying geolocated data by DOIs and IGSNs Jens Klump | OCE Science Leader Earth Science Informatics 20 August 2014 Mineral Resources Flagship
TERENO Terrestrial Environmental Observatory Managing data in environmental monitoring
What is TERENO? • TERENO is an infrastructure initiative by the Helmholtz Association to provide an environmental monitoring infrastructure for the scientific community. • Construction started in 2008, operation planned for 25 years. • 4 regional observatories • TERENO Northeast has • 8 study sites • 32 platforms • Approx. 35 M data entries from various sensors • More platforms being added • The other three regional observatories are of similar scale. Joining the dots | Jens Klump
Regions of high climate vulnerability Regions of high vulnerability • Droughts • Heat waves • Floods • Winter storms • Loss of biodiversity • Landsides From: Rüdiger Glaser (2008) Klimageschichte Mitteleuropas1200 Jahre Wetter, Klima, Katastrophen Joining the dots | Jens Klump
TERENO Regional Observatories • Northeastern German Lowland Observatory • Coordination: GFZ • Harz / Central German Lowland Observatory • Coordination: UFZ • Eifel / Lower Rhine Valley Observatory • Coordination: FZJ • Bavarian Alps / pre-Alps Observatory • Coordination: HMUG und KIT Joining the dots | Jens Klump
Atmosphere Terrestrial Hydrosphere & Pedosphere Terrestrial Biosphere TERENO Research Goals Investigate interactions and feedbacks between different compartments: Bridging the gap between measure-ment, model and management: Joining the dots | Jens Klump
TERENO Northeast Presentation title | Presenter name
Combination of geoarchives with process observations • Region impacts of Global Change on near-natural terrestrial ecosystems and landscape in space and time • Integrated system analysis of climate- and landscape development/process understanding • Combination of real-time process observations (e.g. soil moisture, hydrology, vegetation) and evaluation of geoarchives(lacustrine, colluvials, peats, soils) Field observation Geoarchive Remote Sensing Presentation title | Presenter name
TERENO data management Presentation title | Presenter name
System architecture Presentation title | Presenter name
TERENO data portal Joining the dots | Jens Klump
Looking Ahead: Future Directions Data Driven Research in the Geological Sciences
Identifiers for software www.sciforge-project.org • Similar to data an specimens, also software should be identifiable in a persistent way. • Establish the missing link between papers and data. • Make software recognisable as a scientific achievement. • Make science more transparent and reproducible. • Simply assigning DOI to software is a good start but might not be good enough. • Again, we encounter the question of identity (version) and location (repository). Joining the dots | Jens Klump
Managing Data from Sensor Networks Joining the dots | Jens Klump
Working with very large data sets • Some data sets are too large to be inspected in detail, or even to be loaded on a desktop PC. • Example: How would one check three years of meteorological radar data for anomalies? • Data mining today mainly involves numerical and textual media. • Processing will have to move from the desk top to the cloud for large data sets. Joining the dots | Jens Klump
Linked Data • Use URIs to denote things. • Use HTTP URIs so that these things can be referred to and looked up ("dereferenced") by people and user agents. • Provide useful information about the thing when its URI is dereferenced, leveraging standards such as RDF, SPARQL. • Include links to other related things (using their URIs) when publishing data on the Web. Joining the dots | Jens Klump
Summary • Persistent identifiers allow us to publish, cite, identify data, specimens and software. • Data publication is now becoming more common. • The principles of data identification can also be used with materials (e.g. IGSN) and software. • Future publications might consist of elements linked by identifiers: • Interpretation (“Paper”) • Data • Materials • Software and workflows • More and more data repositories offer API based on linked data. • Future data “publication” will also cater both for people and user agents. Joining the dots | Jens Klump
Thank you Mineral Resources Flagship Jens KlumpOCE Science Leader Earth Science Informatics t +61 8 6436 8828 e jens.klump@csiro.au w www.csiro.au Mineral Resources Flagship