380 likes | 496 Views
Networking institutional repositories in Germany – DINI / DFG projects (… and DRIVER). Frank Scholze Stuttgart University Library KUB Seminar on Open Access, Copenhagen, 29.11. 2007. Overview. General DINI certificate DINI / DFG projects OA network OA statistics OA citation DRIVER.
E N D
Networking institutional repositories in Germany – DINI / DFG projects (… and DRIVER) Frank Scholze Stuttgart University Library KUB Seminar on Open Access, Copenhagen, 29.11. 2007
Overview • General • DINI certificate • DINI / DFG projects • OA network • OA statistics • OA citation • DRIVER Frank Scholze
Open Access strategies Disciplinary strategy Disciplinary Repositories BMC, PLoS, ACP … Self-archiving „green“ OA publishing „gold“ Institutional Repositories University Presses Institutional strategy
The current situation for digital repositories • More than 1000 institutional repositories worldwide, about 120 in Germany • Many others: disciplinary, national, … • Many types: Primary data, textual documents, learning materials, multimedia objects, code … • Documents: incl. pre-prints, postprints, technical papers, dissertations, theses … • Various repository software Frank Scholze
What is known about repositories? • Many have the OAI-PMH implemented • small but relevant local specialties • Some international registries exist • OpenDOAR, ROAR … • Some national registries exist • DINI list … • Some search engines exist • BASE, OAIster, Google Scholar … Frank Scholze
Collaboration in repositories • Very few mature national repository organizations/collaborations • SURF, DINI … • No trans-national repository organization/collaboration • Lack of data harmonization, orchestration of services Frank Scholze
From the user point of view[ talking about researchers ] • Fragmented, obscure information landscape • content can be (partly) searched and found • quality and re-use differs from repository to repository Frank Scholze
Research process and repositories Frank Scholze From: e-SciDR Lisbon workshop, 4th September 2007
“Make it workable” • Focus on existing repositories and services • Focus on Institutional Repositories • Rapid progress over the last years • Inherent sustainability (e.g. libraries) • Adequate technical homogeneity (OAI-PMH) • Focus on textual materials Frank Scholze
General • DINI certificate • DINI / DFG projects • OA network • OA statistics • OA citation • DRIVER Frank Scholze
DINI • Deutsche Initiative für Netzwerk Information(German Initiative for Networked Information)) • Coalition of German Higher Education Infrastructure- or Service-Institutions • Libraries • Computing Centres • Media Centres • Scientists • 8 Working Groups Frank Scholze
DINI Certificate • Launched in 2003 by DINI Electronic Publishing working group • Quality control for Document and Publication Repositories • Organizational, technical, personal and policy aspects • Defines a set of minimum standards (requirements) for a repository and its operator(s) mandatory for modern scholarly communication • Recommends foreseeable developments that might turn into future requirements • DINI Certificate 2007 released September 2006 Frank Scholze
DINI Certificate - Content • Visibility of the Service • Policy, Guidelines • Author Support • Legal Aspects • Security, Authenticity and Data Integrity • Indexing • Subject indexing • Metadata Export • Interfaces • Logs and Statistics • Long-term Availability Frank Scholze
Certification in practice • Certificate 2004: 19 Services certified • Certificate 2007: 2 services certified, 4 in progress • Common issues during the certification process • policy • persistent identifiers • documentation • Results of certification • Certification as development of the service common experience • Certification as marketing action experiences range from very good results to no effect at all Frank Scholze
General • DINI certificate • DINI / DFG projects • OA network • OA statistics • OA citation • DRIVER (Europe) Frank Scholze
DINI / DFG projects • Cluster of proposals to the DFG (coordinated by DINI) • Network of certified open access repositories (OA network) 2y • National input to EU repository infrastructure project DRIVER • Usage statistics (OA statistics) demonstrator proposal under review • Distributed open access reference citation service (OA citation) demonstrator proposal under review • Related DINI projects • OA information (open-access.net) 18m • CARPET - Community for Academic Reviewing, Publishing and Editorial Technologyproposal under review Frank Scholze
OA network • Building a networked infrastructure for German repositories • Project just started • Builds on DINI certified services • Relationship to DRIVER • German node for DRIVER • DINI certificate more comprehensive than DRIVER guidelines • Except for harvesting recommendations • Add-on services beyond DRIVER • OA statistics • OA citation Frank Scholze
OA network - architecture Processing Enrichment Aggregation Harvesting Frank Scholze
OA statistics • Local and aggregated usage data • Transparent and standardized data • E.g. COUNTER, IFABC, LogEc • Calculation of data is comprehensible • Klick-spans, robot elimination etc. Frank Scholze
Log Repository CO CO CO CO CO CO CO CO CO Link Resolver Services Metrics Data Mining Filtering Webserver -Log Rewrite module Infrastructure for Collecting Usage Data Log Repository OpenURL ContextObjects Link Resolver Aggregated Usage Data e.g. Log DB Log harvester (Service Provider) Log Repository Aggregated Usage Data Aggregated logs e.g. Log DB Normalise OpenURL ContextObjects or SUSHI Based on: Bollen, Johan and Van de Sompel, Herbert, OAI4, Geneva Frank Scholze Normalise (optional) -> Robots, psydonymization
Usage - indicators • Indicators can be calculated quantitatively or structurally • Example for a quantitative indicator: Usage Factor • Mean value of aggregated usage over a defined period of time • Example for a structural indicator: Usage Page Rank • Reciprocal voting of nodes in a network Frank Scholze
Dr. Dobb's Journal bX project - Comparison of Journal Usage PageRank and Journal Impact Factor Journal of Molecular Graphics and Modelling Usage > IF Frank Scholze
OA citation • Builds on work done in citebase, citeseer, google scholar, CDSware, ePrints • Extraction of references • Citation indexing (CI) • Expansion of the traditional document space for CI • competing with WCI • Calculating alternative indicators (Citation Page Rank) • Cf projects MESUR (LANL), Eigenfactor (U of Washington) • http://www.mesur.org/ • http://www.eigenfactor.org/ Frank Scholze
OA network demonstrator Frank Scholze
BASE integration demonstrator Open Access and Metrics Frank Scholze
General • DINI certificate • DINI / DFG projects • OA network • OA statistics • OA citation • DRIVER Frank Scholze
DRIVER • Digital Repository Infrastructure Vision for European Research • Environment and tools for building service-based Repository Systems • Sets of servicesrunning at differentnetwork sites, possiblyinmultipleinstances, interacting,dynamic,sharable, open • DRIVER I: BE, FR, GE, NL, UK • DRIVER II: IT, PL, EL, DK, SL, PT Frank Scholze
European Information Space • Includes the DRIVER Repository System • Providing users with advanced functionalities over a uniform European Information Space formed by aggregating multiple Repositories • Repositories • Can join or leave the infrastructure at any time • Are dynamically/automatically aggregated to populate and keep updated the DRIVER Information Space Frank Scholze
DRIVER and standards • Service Resources are implemented as WebServices and accessed through the corresponding Web Service Interface • Parameters calls are enveloped into SOAP messages • The Enabling Services are also compatible with REST • XML is the lingua-franca for the whole system • Resource internal status, i.e. Resource profiles • Profiles in Information Service use eXist XML engine Frank Scholze
DRIVER and standards II • DRIVER Aggregation • Harvesting according to OAI-PMH • Adopting OAI-Provenance best practice(OAI-about DRIVER Guidelines) • To be extended to other object models and harvesting protocols • Queries to Search Service and Index Service obey to SRW/CQL standard Frank Scholze
DRIVER Guidelines • Unambiguous identification of OA content (using sets if necessary) • Direct link to the digital object (dc:identifier) • Transient or persistent information on deleted objects • ISO 639-3 language format • Well defined batch size (100-200 datasets) • Adequate lifespan of the resumption token (24h) Frank Scholze
„Human“ portal for access Frank Scholze
Repository Landscape Frank Scholze
Managing Aggregation Frank Scholze
Test of Compliance Frank Scholze
Conclusion • Bring standardization of interfaces, protocols and formats to a wider community • Services based repository infrastructure • Identification of repositories • Harvesting, searching, browsing, re-use • Integrating repository infrastructure into the research process • Other outputs: Primary data, learning objects, patents … • Linking outputs from a research process perspective Frank Scholze
Information • DINI certificatehttp://nbn-resolving.de/urn:nbn:de:kobv:11-10075687 • DINI repository listhttp://www.dini.de/wisspub/repositories/german/index.php • OA networkhttp://www.dini.de/oa-netzwerk/ • OA citation (DOARC – Demonstrator)http://doarc.projects.isn-oldenburg.de/ • DRIVERhttp://www.driver-repository.eu/ Frank Scholze
Thank you! Frank ScholzeStuttgart University Libraryscholze@ub.uni-stuttgart.de