90 likes | 199 Views
Making Data discoverable & accessible Introducing Dept of Parks & Wildlife’s Marine Science information management infrastructure Florian Mayer, Dept Parks and Wildlife Presentation to ANDS round table 18 March 2014. Biodiversity Conservation
E N D
Making Data discoverable & accessible Introducing Dept of Parks & Wildlife’s Marine Science information management infrastructure Florian Mayer, Dept Parks and Wildlife Presentation to ANDS round table 18 March 2014
Biodiversity Conservation “Conserve, protect, manage native fauna & flora based on best practice science” Dept Parks & Wildlife Strategic Directions 2013-2014 Wisdom to inform policy Knowledge defensible & transparent The challenge Data management Data classification and sensitivity Digital information security Discoverability Accessibility Compliance Corporate culture and paradigm shift IT infrastructure and architecture Government agency woes – funding, agency restructure, locked out of academia Information from reproducible, automated analyses Data discoverable & accessible Research & Monitoring outcome-focused
The solution Data & metadata catalogue As used by 200+ govs&NGOs 4 years of my work Marines & OIM collaboration Works for Marines Available to Division & Dept Code https://bitbucket.org/dpaw/ Data catalog- Workflow automation - Stats work bench
Office for Information Management Scope Department-wide Mission To enable developers like me to deliver products like this data catalogue Marine Science Information Management Scope from Marine Science up to Science & Conservation Division Environment OIM’s department-wide infrastructure and policies Mission To deliver information management to Marine Science (serving as template for others)
Biodiversity Conservation “Conserve, protect, manage native fauna & flora based on best practice science” Dept Parks & Wildlife Strategic Directions 2013-2014 Wisdom to inform policy Knowledge defensible & transparent Information from reproducible, automated analyses Data discoverable & accessible Research & Monitoring outcome-focused
Biodiversity Conservation “Conserve, protect, manage native fauna & flora based on best practice science” Dept Parks & Wildlife Strategic Directions 2013-2014 Wisdom to inform policy Knowledge defensible & transparent Information from reproducible, automated analyses http:// Data discoverable & accessible R code as web app Data API Code repository Research & Monitoring outcome-focused Source code
Biodiversity Conservation “Conserve, protect, manage native fauna & flora based on best practice science” Dept Parks & Wildlife Strategic Directions 2013-2014 Wisdom to inform policy Reproducible report Knowledge defensible & transparent Information from reproducible, automated analyses Data + Code + Markup = PDF Data discoverable & accessible Sweave Data API Code repository Research & Monitoring outcome-focused Source code Collaboration
Biodiversity Conservation “Conserve, protect, manage native fauna & flora based on best practice science” Dept Parks & Wildlife Strategic Directions 2013-2014 Wisdom to inform policy Knowledge defensible & transparent Reproducible research Information from reproducible, automated analyses Data discoverable & accessible Simulate then observe (rinse&repeat) Research & Monitoring outcome-focused Mayer et al. 2010 Applying software engineering best-practice to scientific research
Internal workings of CKAN @ DPaW Ubuntu 12.04 LTS VM ~/projects/dpaw_docker/ckan dpaw_docker code repo clone First build creates docker image, copies some CKAN files from image to local file system, creates startup scripts and persistent files Modify settings and page templates Second build overlays modifications into image Your custom CKAN docker image /srv/dpaw/ckan/ Docker image scripts and persistent files http Startup container: /srv/dpaw/ckan/startup.sh Run shell in container: /srv/dpaw/ckan/shell.sh backup Database is persistent in /srv/dpaw/ckan/var/lib rsync