1 / 19

Data management in deWELopment: experiences from other EU projects and recommendations

Polsko-Norweski Fundusz Badań Naukowych / Polish-Norwegian Research Fund. Data management in deWELopment: experiences from other EU projects and recommendations. Jannicke Moe (NIVA) deWELopment project meeting 01 .02.2010, Warsawa. Outline.

erik
Download Presentation

Data management in deWELopment: experiences from other EU projects and recommendations

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Polsko-Norweski Fundusz Badań Naukowych / Polish-Norwegian Research Fund Data management in deWELopment: experiences from other EU projectsand recommendations Jannicke Moe (NIVA) deWELopment project meeting 01.02.2010, Warsawa

  2. Outline • Experiences from data management in EU project WISER • Is data management in WISER relevant for deWELopment? • Data management within WPs vs. Central project database • Recommendations for deWELopment • Central project database or not? • If yes; raw values or calculated metrics?

  3. 1. Experiences from data management in EU project WISER

  4. Data management in WISER builds upon data management in EU FP6 project REBECCA WISER Data Service team

  5. Is data management in WISER relevant for deWELopment? Commonalities • Similar aims and data types • BQEs in rivers and lakes • Must combine many BQEs • different sampling methods etc. • Biological data to be combined with common abiotic data • for metric analysis • Biological data (or metrics) to be combined for different BQEs • for complete waterbody status assessment Differences • WISER is more complex: • Freshwater data + coastal data • New field data + old data • Some WPs depending on data collected by other WPs • 25 partners in ~20 countries • Data and results must be delivered to external groups (GIGs - intercalibration process) • Data management in deWELopment can be simpler

  6. Combining BQEs (Biological Quality Elements) Phytobenthos Hydrology Macroinvertebrates Acidification Organic

  7. From raw data - via database - to data analysis These steps must be done regardless of how you choose to store the data, but it is more practical to do these steps in Access than in Excel.

  8. Recommended way to store data: MS Access database with related tables

  9. Dilemma for data management: in each WPs or in central project database? Solution for WISER: Step 1: Data compilation per BQE - within each WP • Data Service provides template for recommended DB structure • WPs do data cleaning, standardisation of taxonomy and units, etc. • Data analyses will reveal errors => further data corrections Step 2: Combination of data across BQEs - by Data Service team • Necessary for making assessment with >1 BQE • BQE-specific databases should now have common features which makes it possible to combine them into a Central project DB

  10. Newdata 78-L-NC Data flow - general idea and example Data sources WP3.1 WP4.1 WP5.1 WP6.1 51-L-C Central database WP3.2 WP4.2 WP5.2 WP6.2 53-LR-C WP3.3 WP4.3 WP5.3 WP6.3 WP3.4 WP4.4 WP6.4 Meta-database ... WP5.2 needs the same data+ data from other lake BQEs Metadata for each dataset Example: WP3.4 needs data for fish in lakes ...

  11. Different WPs have very different data types, experience with data management, needs for assistance etc. Difficult to develop one common database structure which fits all needs of all WPs Current status in WISER: Data Service team has developed ”WISER common database structure” For WPs experienced with data management (f.ex. lake fish): Step 1: WP uses their own existing DBs also for storing new WISER data, and takes care of all data management themselves Step 2: WISER Data Service transfers the WP’s data (subset) to Central DB For WPs not experienced with data management (f.ex. lake phytoplankton): Step 1: WP stores new WISER data in a new DB with ”common structure”, and is offered assistance/tools from WISER Data Service Step 2: WP’s database can be imported directly into to Central DB Lessons from WISER data management

  12. 2. Recommendations for deWELopment

  13. Should biological values be stored in project DB as calculated metrics? BENEFITS • DB can have simpler structure • Easier to compile • Easier to extract tables for data analysis • NIVA experience from developing EEA DBs for biological metrics COSTS • Difficult / impossible to quality-check biological metrics after import to DB • Original project data are not easily available • Possible compromise for central project DB: BQE groups provide raw data + instructions for calculation of metrics. Data manager imports raw data, calculate metrics and stores them as metrics. • The best solution depends on the various needs of this project...

  14. Questions regarding data management to be considered during this meeting • Needs • Combination of biological data from different BQEs: Do you need to use raw biological data, or only biological metrics? • Evaluation of uncertainty: Do you need to use raw biological data, or only biological metrics? • ... • Responsibilities • Who is overall responsible for deWELopment data management? • For each BQE group, who is responsible / contact person for data management? • What should be the role of NIVA in data management? (Give recommendations? Practical assistance? Some responsibility?) • Ownership of data • Who owns the raw data? (The BQE group leader? The data collectors? The project consortium?) • Who makes final decisions regarding storage and use of raw data? (Project leader? Already regulated by project contract?)

  15. Who is responsible for which parts of data management within a BQE group? - Suggestions

  16. TAKK FOR OPPMERKSOMHETEN!

  17. ”Table format” (cross table) • Common for storing own data • Easy to read for humans • Can make summaries and figures directly in spreadsheet • ... which is not recommended! • Suitable for data with few "gaps" • Phys/chemistry: OK • Species: many empty cells • Difficult to transform to list format • requires macro • OK for data analysis in R • but can only import 1 header row; must replace empty cells, ... ”List format” • Required for databases • Easy to read for machines • Difficult to do summaries and plotting directly in spreadsheet • Suitable for all types of data • Easy to transform to table format (e.g. in Excel or Access) • OK for data analyses in R Difficult Easy

  18. Role of data holders (WPs)vs. Data Service Team ("Module 2")

  19. NEED TO HAVE for data manager Waterbody codes (not names) and station codes (not names) Taxonomic codes (not names) ”Cleaned” data NICE TO HAVE for data manager Database software (not Excel) Recommended: MS Access Standardised phys/chemical determinands and units Geographic coordinates Data reorganised into "list format" This applies regardless of how the data are managed (centrally or within each WP)

More Related