330 likes | 415 Views
DRIV(ER)ing Research Infrastructures. Yannis Ioannidis University of Athens, Hellas. 1st DRIVER Summit: Towards a Confederation of Digital Repositories, 16-17/1/2008, G ö ttingen. DRIVER. }. }. }. }. D igital R epository I nfrastructure V ision for E uropean R esearch. =? R esearch.
E N D
DRIV(ER)ing Research Infrastructures Yannis Ioannidis University of Athens, Hellas 1st DRIVER Summit: Towards a Confederation of Digital Repositories, 16-17/1/2008, Göttingen
DRIVER } } } } • Digital • Repository • Infrastructure • Vision for • European • Research =? Research
Imperatives • Comprehensive,global access to any type of scientific information • Minimum time and resources effort to access and use this information • Easy search/navigation, handling, manipulation, and re-dissemination of information • Maximum visibility to and communication with the research community, research impact • Long-term access and preservation of research results
High-Level Objectives • Develop an environment for integrating existing national, regional, or thematic repositories • Create a production-quality European DR infrastructure • Prepare the future expansion and upgrade of the DR infrastructure across Europe • Identify and promote the use of a relevant set of standards • Raise awareness among user communities
Challenges Organisation Data Software Create a European Repository Infrastructure Large number of providers and users Emphasis on content and services Hosting hardware and software Multifaceted endeavor: technology, organization Operational infrastructure, open for experimentations
Past-Present-Future Trans-National DRs (DRIVER) Universal DRs Pan-European and Inter-Thematic DRs National, Regional, and Thematic DRs
Repository Systems effortsIndividual institution site OAI-PMH • Centralized System • High installation and maintenance cost for hardware and software • Poor & limited scalability • Reuse by data and service duplication! UI Functionality resources Search … Index Index Information Space Content resources
Repository Systems effortsMultiple institution sites … … … … … … … … … … … … • Repeated efforts • High installation and maintenance cost for hardware and software • Poor & limited scalability • Reuse by data and service duplication! • Disconnected repositories
Repository Systems effortsSharing and reusing content • Centralized System • High installation and maintenance cost for hardware and software • Poor & limited scalability • Reuse by data duplication! Functionality resources UI Search … Index Index Information Space OAI-PMH Aggregator OAI-PMH OAI-PMH OAI-PMH … Content resources Institution Site Institution Site Institution Site
Repository Systems effortsSharing and reusing content … … … … … … … … … … … Genetic Data Netherlands E-Theses Germany Belgium wwPDB Greece India Italy ….. ….. … … … … … … … … … … … • Repeated efforts • High installation and maintenance cost for hardware and software • Poor & limited scalability • Reuse by data and service duplication! • Disconnected repositories • Sometimes desired policy • Often undesirable
DRIVER Infrastructure Vision Moving from building individual repositories or repository clusters, one at a time, repeating “things” again and again, to building a “generating engine”, a warehouse, an INFRASTRUCTURE, facilitating the above by offering appropriate generic, reusable services
DRIVER Infrastructure Vision • Build and maintain a sustainable European environment where content and functionality resources can be openly shared and integrated for use by any application or community • Sustainability • Maintainability • Scalability • Reusability
DRIVER Infrastructure Information Manager Manager AuthnAuthz Enabling Services Functionality Services UI UI Search Search … Index Index Index Store Content/Data Services Aggregator Aggregator Content Resources OAI-PMH OAI-PMH OAI-PMH OAI-PMH … … Institution Site Institution Site Institution Site Institution Site
Technological features • Fully flexible and dynamic • Repositories • Users • Communities • … • Services • Fully distributed System • Services are implemented as Web Services • Service Oriented Architecture (SOA) • Advantages • Scalability both on the data provided or the usage/load • Extensibility of functionalities is easily accomplished System Resources
Enabling Services Information Manager Manager AuthnAuthz • Infrastructure managementandservice/resourcegluing: handles all the nitty-gritty generic tasks (like an operating system) • Knowledge of all DRIVER Resources • Monitoring and coordination of Service interactions • Provides Authorization & Authentication mechanisms
Content/Data Services Collection OAI-Publisher • Information Space Management • Harvesting from external repositories • Aggregating: cleaning & enriching • Storage, indexing • Virtualization of content: collections • OAI-Publishing of harvested data Index Index Index Store Aggregator Aggregator
Functionality Services Alerts/Recommendations Profiling Communities • User-content based services • User Interfaces • Information (Content) Search & Browse • Personalized services • User and Communities • User Profiling • User recommendations & alerts UI UI Search Search
New Repository Scenario Enabling Services Information Manager Manager AuthnAuthz OAI-PMH OAI-PMH Functionality Services UI UI Search Search … Index Index Index Content/Data Services Store Aggregator Aggregator Content Resources OAI-PMH OAI-PMH OAI-PMH … … Institution Site Institution Site Institution Site Institution Site
New Service Scenario Enabling Services Information Manager Manager AuthnAuthz Index Store Validation OAI-PMH Functionality Services UI UI Search Search … Index Index Content/Data Services Aggregator Content Resources OAI-PMH OAI-PMH OAI-PMH … … Institution Site Institution Site Institution Site Institution Site
DRIVER European Information Space • Services for the creation, maintenance, and access to the European Information Space Functionality Layer Repositories Data Layer Enabling Layer
Data sharing & Service reuse • Belgium scenario • Use European DRIVER infra • Have a storage/Index for themselves • Provide their (Belgian) data to Europe • E-theses scenario • Include European theses documents in overall infra • Make these visible through virtual mechanisms (collections) for specialized searches • India Scenario • Deploy DRIVER infrastructure for all their repositories
DRIVER infrastructure: the benefits DLS (India?) DLS (Belgium) DRIVER Infrastructure Functionality Layer Repositories Data Layer Enabling Layer
Current DRIVER content > 200,000 documents
Current state of production • First TEST-BED released (v1.0) • Enabling Layer: Services deployed on DRIVER sites across Europe • Data Layer: now aggregating 70 Repositories from 6 Countries (FR,BE,NL,DE,UK, IT) • Functionality Layer: delivering Search User Interface with special functionalities: collections, recommendations, communities • One running DIS: “DRIVER European Information Space” counting 51 reps, for 250.000 Open Access docs
Content Resources • Focus on Institutional Repositories • Rapid progress over the last years • Inherent sustainability (e.g. libraries) • Adequate technical homogeneity (OAI-PMH) • Textual data • Selection of IRs based on • Maturity • Policies • Technologies used
Content Sources • Initially 51 institutional repositories • 15 from the Netherlands (coordinated by DARE) • 20 from the UK (coordinated by SHERPA) • 14 from Germany (adhere to the German DINI-standard) • 1 from France (CNRS) • 1 from Belgium (UGent) • Later raised to 70+ and growing • More repositories to be identified and included • Joint policies and objectives • Broad and multiple user groups • Metadata, technical, and organisational standards
Future issues • Towards release v1.1 • Addition of new DISs sharing the European Information Space • Belgium • Ireland • Electronic Theses and Dissertations • India? • more to come… • New content types, and compound documents/scientific objects • New functionality services
Simple Search Scenario Index IS Search RS UI
DRIVER Activities Raising Awareness / Outreach Programme Focussed Studies Content: Organisation and Provision Infrastructure Middleware Development/ Implementation
DRIVER Funding • DRIVER project: 18 months (6/06-11/07) • An organization and a testbed system • DRIVER2 project: 24 months (12/07-11/09) • A confederation and a production system • Research on next-generation issues • DRIVERn project • Driver Confederation members • Member states
Summary DRIVER drives Europe towards full unification of its scientific information www.driver-community.eu