330 likes | 511 Views
Developing Tomorrow’s Infrastructure for Science John Womersley Director, Science Strategy Science and Technology Facilities Council presented by: Keith Jeffery Director, IT & International Strategy Science and Technology Facilities Council. Overview. Some STFC Science
E N D
Developing Tomorrow’s Infrastructure for ScienceJohn WomersleyDirector, Science StrategyScience and Technology Facilities Councilpresented by:Keith JefferyDirector, IT & International StrategyScience and Technology Facilities Council
Overview • Some STFC Science • Tomorrow’s Digital Infrastructure for Science • Supporting the Research Lifecycle • Some Policy Frameworks • Conclusion
What is STFC? • The Science and Technology Facilities Council (UK) • Created on April 1, 2007 • It is responsible for • fundamental research in particle physics, nuclear physics, astronomy, space • major UK facilities for the physical and life sciences • synchrotrons, light sources, lasers, neutrons • national laboratories at RAL, Daresbury, UKATC • international science projects • CERN, ESO, ESA, ILL, ESRF… • Over 2000 staff and an annual budget of over £700M
The Science we Address Some examples • Why is there a universe? • What is the origin of mass? • Was there ever life on Mars? • How are the chemical elements created? • How can we design better treatments for cancer? • How do cells work? • How can we create new materials to store energy?
STFC Facilities Neutrons and photons Provide complementary views of matter: Photons “see” electric charge – high atomic number nuclei Neutrons “see” nucleons – especially hydrogen atoms Imaging detector ComputingAnalysis Modelling beam data knowledge sample
Some STFC Projects Diamond phase 3 ESA centre Sapphire Materials Innovation Institute ISIS TS2 phase 3
Some STFC Projects 4GLS ESRF upgrade Hartree Centre Computational Science
Some STFC Projects HIPER ELI Future neutron sources: ESS/MW neutron source ILL 20/20 upgrade DIPOLE laser
Some STFC Projects European ELT Next generation Gravitational wave observatory FAIR SKA
Some STFC Projects International Linear Collider LHC upgrades Underground science Neutrinos, dark matter Neutrino factory
XFEL • Project “launched” on 5th June • This means DESY is now authorised to spend • XFEL GmbH to be set up by end of year • Our goal is to maximise our in-kind contributions within the £30M already allocated in LFCF • Pixel detector, streak camera…
Part 2 Tomorrow’s Digital Infrastructure for Science
The 7 C’s • Creation • Collection • Capacity • Computation • Curation • Collaboration • Communication
Its all about scale Creation: • Examining the detector arrays on the MAPs spectrometer at ISIS
Its all about scale Collection: • An ATSR image of Sicily with Mount Etna eruption; taken 24 July 2001
Its all about scale Capacity: eg at CCLRC • 20PB by 2010 • 1PB = 1015 Bytes • Billions of Floppys • Millions of CDs • Thousands of PCs (today’s)
Its all about scale Computation: • 3-D rabbit heart MRI rendered at 512 x 512 x 1400 using 12 GPUs • Data needs interpretation and analysis Picture of heart
Its all about scale Curation: • Some CCLRC based Repositories • The Atlas Datastore • The British Atmospheric Data centre • The CCLRC Data Portal • The CCLRC Publications Archive • The CCPs (Collaborative Computational Projects) • The Chemical Database Service • The Digital Curation Centre • The EUROPRACTICE Software service • The HPCx Supercomputer • The JISCmail service • The NERC Datagrid • The NERC Earth Observation Data Centre • The Starlink Software suite • The UK Grid Support Centre • The UK Grid for Particle Physics Tier 1A • The World Data Centre for Solar-Terrestrial Physics Atlas Datastore Tape Robot
Its all about scale Collaboration: • Barrel toroid magnet and detector module from ATLAS at CERN ATLAS: • 2000 scientists • 150 Universities • 30 countries
It’s all about scale Communication: “The web has changed everything...” • Technology enables: • access to everything • distributed,searchable information sources • Interlinking enables: • Revalidation of results ‘repeat experiment’ • Discovery enables: • new knowledge from old • Archiving enables: • Recording unique events • Antarctic environmental data CCLRC’s “e-pubs” Institutional Repository has records of 20,000 publications spanning 20 years
The Innovation Lifecycle Enabling Wealth Creation Enabling Knowledge Creation Strategic Direction The Body of Knowledge The Research Process The Government Process Improved Understanding Improved Quality of Life Quality Assessment Aggregation of Knowledge lies at the heart of the innovation lifecycle
Virtual Research Environment the researcher acts through ingest and access Archival Creation Access Curation Information Infrastructure Services the researcher shouldn’t have to worry about the information infrastructure Network Storage Compute The Information Infrastructure The Body of Knowledge
Distinct Infrastructures / Distinct User Experiences Data Analysis Data Analysis Data Analysis Analysed Data Analysed Data Analysed Data Publication Data Publication Data Publication Data Raw Data Raw Data Raw Data Publications Publications Publications Facility 2 Facility 1 Facility 3 Current View
Common Infrastructure / Common User Experience Data Analysis Raw Data Catalogue Publications Catalogue Analysed Data Catalogue Publication Data Catalogue Data Analysis Data Analysis Data Analysis Analysed Data Analysed Data Analysed Data Publication Data Publication Data Publication Data Raw Data Raw Data Raw Data Publications Publications Publications Facility 1 Facility 3 Facility 2 Capacity Storage Data Repositories Publications Repositories Standards/ Converters Future View
Part 4 Some Policy Frameworks
Some Policy Frameworks • UK Research Councils’ initiative on access to research outputs • 2005 and 2006 statements of principles • OECD Guidelines on Access to Research Data • 2004 Declaration, 2007 Guidelines • UK Office of Science and Innovation Report (2006) Developing the UK’s e-infrastructure for science and innovation
RCUK Policy (2005, 2006) Four principles: • Ideas and knowledge derived from publicly-funded research are made available and accessible for public use, interrogation, and scrutiny, as widely, rapidly, and effectively as practicable • Effective mechanisms are in place to ensure that published research outputs are subject to rigorous quality assurance, through peer review • The models and mechanisms for publication and access to research results are both efficient and cost-effective in the use of public funds • The outputs from current and future research can be preserved and remain accessible not only for the next few years but for future generations
OECD Recommendation (2006) OECD Recommendation on Access to research data from public funding 13 principles: A – Openness • Openness means access on equal terms for the international research community at the lowest possible cost, .... • B – Flexibility, C – Transparency, D – Legal conformity, E – Protection of intellectual property, F – Formal responsibility, G – Professionalism H – Interoperability • Technological and semantic interoperability is a key consideration in enabling and promoting international and interdisciplinary access to and use of research data. ... • I – Quality, J – Security, K – Efficiency, L – Accountability M – Sustainability • ... taking administrative responsibility for the measures to guarantee permanent access to data that have been determined to require long-term retention. http://webdomino1.oecd.org/horizontal/oecdacts.nsf/Display/3A5FB1397B5ADFB7C12572980053C9D3?OpenDocument
OSI e-Infrastructure Steering Group “Developing the UK’s e-infrastructure for science and innovation” • Cross departmental view • 6 working groups: • Data and Information creation • Preservation and curation • Search and navigation • Virtual research communities • Networks, compute power and storage hardware • Middleware, AAA and digital rights management • Reports available on UK National eScience Centre Website • http://www.nesc.ac.uk/documents/OSI/index.html • Note here report on Data and Information Creation
OSI e-Infrastructure Steering GroupData and Information Creation Key findings: • The future e-infrastructure should directly support the management of data throughout its lifecycle ’from cradle to grave’ • The future e-infrastructure should reduce the cycle time from conducting research, through analysis, publication and feedback into new research • There should be a much greater use of simulation-based research and its much closer integration with physical research • The future e-infrastructure should support the use for research purposes of data collected for other purposes • The future e-infrastructure should be based upon standards which support uniform classification, integration, certification and citation of data across all sources
Conclusion • STFC has massive holdings of data and information • The benefits of ready, online, open access to research are self-evident • wealth creation, improvement in quality of life • The data and information requires: • Preservation: making it available indefinitely • Curation: making it understandable indefinitely • This implies use of metadata • Needs to be ‘more intelligent’ (semantics on syntax) • Needs standards (for interoperation) • This is what the PARS Alliance is all about
The End