230 likes | 242 Views
Learn about the current research infrastructure in Greece, the implementation of CERIF-based systems, tools, and lessons learned, supporting the adoption of CRIS in universities and research centers.
E N D
Implementation of CERIF-based infrastructure for enabling CRIS adoption in the Greek academia Vasilis Bonis Nikos Pougounias, Dimitris Karaiskos National Documentation Centre / National Hellenic Research Foundation, Greece
Outline • Current Research Infrastructure System in Greece • Native CERIF and CERIF related systems • CERIF Tools implemented by EKT • Lessons learned • Future Steps
Greece - Research Community Demographics • 22 Universities, 14 Technological Educational Institutes, 13 Research Centres in total • 52.358 researchers, 29.054 Full-Time Equivalents (2013) • Source: «Research and Development Expenditure and Personnel in Greece in 2013 - Main indicators, National Documentation Centre, 2015» (http://metrics.ekt.gr/sites/metrics/files/RDstatistics_2013final_Greece_en.pdf)
Greek CRIS infrastructure • Each organisation operates its own institutional CRIS system • Policies, procedures and workflows at the institutional level • All institutional CRIS systems are based on the same CRIS software platform • CRIS software developed by EKT (Native CERIF database) • A central aggregator will provide a unified point of access to the entire information • CERIF XML/REST is used for interoperability
Support to institutions • CRIS software – including documentation for installation, configuration, usage • Online helpdesk for support • http://helpdesk.cris.ekt.gr • Software distribution system • Including certification procedure, notifications for new releases, versioning • Semantics – vocabularies • Tools for data import to the CERIF database • CERIF XML • Excel / CSV and Dublin Core through an open source library (Biblio-transformation-engine, BTE)
The CRIS software platform – overview (1) • Native CERIF database - support all CERIF entities • Java, Spring, JPA, Elastic Search, Javascript/jQuery • Based on the open source CERIF-JPA library – used for the CERIF reference implementation (http://github.com/EKT/CERIF-Tools) • Search, faceted browsing • User interface (web forms) for editing all entities and semantic layer • Advanced support for data management / approval workflows
The CRIS software platform – overview (2) Researcher • Sophisticated user roles / authorization • Role-based access control (RBAC) in CERIF tables • Roles aligned with institutions’ main needs • Users stored in DB or logging in through LDAP. Single Sign-On • Reporting • Review institution’s hierarchical structure • Highly configurable (e.g. fields order, facets, permissions, etc.) Project Manager Indexer Content Manager Research Support Staff Research Administration Staff Authorized System Administrator
Aggregator - Architecture • Augmented CRIS • CERIF DB + aggregation related tables Aggregator EKT • Generic functionality • Relays messages • System agnostic • Accepts OAI-PMH data Harvester • CERIF XML payload • Per entity retrieval OAI Server • /{entity}?identifiersOnly=false&limit=X&pageSize=Y CRIS REST Institution CRIS DB • Direct access
Roadmap • 1st Phase – Early Adoption • Currently running until end of 2015 • To participate, 12 Universities, 3 TEIs and 1 Research Centre • Installations in 7 Universities, 1 TEIs and 1 Research Centre • Aggregator deployment and operation • CRIS current version 0.8.2 – next version to be released at the end of May 2015 • 2nd Phase – Complete Adoption • All institutions will install and operate CRIS • Aggregator fully operational (aggregation and reporting)
EKT’s Native CERIF Systems (1) • R&D Projects Registry (in progress) • Catalogs R&D related national and European projects and their associated organisations • Sources of data: • E-Corda (FP7 and Horizon2020) • Greek Monitoring Information System for National Strategic Reference Framework (NSRF 2007-2013) • RDI national organisations registry (supported by EKT) • Eurostat/UNESCO • More than 30.000 organisations • More than 10.000 NSRF projects and 20.000 FP7 projects
EKT’s Native CERIF Systems (2) • R&D Data Management System (DMS) • Stores, interconnects and manages • Greek R&D Expenditure and Personnel data as collected via online data collection system (Eurostat - Commission Regulation 995/2012) • Organisations’ profiles stored in the RDI national registry • Indicators as produced from primitive data • External R&D data sources as managed by EKT (e.g. FP7 projects etc.) • Goals • Single management point • Preservation and archiving (time series) • Data validation & estimation workflow • Benchmarking using external sources • Real time automated generation of indicators • Data exports (SDMX, CSV, Excel, JSON, CERIF-XML) • Reporting
EKT’s CERIF related systems (1) • Greek Researcher Directory – researchers.gr • Database of Greek Researchers - Researchers in Greek organisations • Access only to researchers – registration only by invitation • Pre-loaded information for permanent personnel of Greek public research performing organisations • Automatic loading of information from sources: • CrossRef, PubMed, Microsoft Academic Search, DBLP, Greek Books in Print, RepEc (economics), arXiv (Cornel University Library), Google Scholar (bibTex, ris, refman) • Link publications to researchers – automatic mechanisms • Automatic recognition of references from researcher CVs – Text analysis • Export profile in CERIF XML
EKT’s CERIF related systems (2) • Greek Researcher Directory – researchers.gr • Researcher Profile (exported in CERIF XML) • Affiliations • Education • Research expertise – disciplines (Frascati) and keywords • Identifiers (ORCID, MAS ID, Scopus Author ID, ResearcherID, …) • Awards / Distinctions (Best paper awards, Journal Editorships, Scientific Society Fellowships, …) • Publications • Journals, Conference Proceedings including impact figures, Books and Monographs • Projects (in development)
CERIF Tools Implemented (1) • CERIF Model • All CERIF entities as POJOs • 100% compliant to the latest CERIF version • CERIF Services • JPA data repositories for all entities • Common persistence and selection services • CERIF XML • Full support of CERIF 1.6 XML Schema • CERIF API • RESTful services based on current EuroCRIS draft recommendation • OAI-PMH – per entity retrieval
CERIF Tools Implemented (2) • Open source • http://github.com/EKT/CERIF-Tools • Mature • 4 years of continuous development • Deployed on a plethora of production systems • Ongoing Maintenance • Tools aligned to CERIF versions • New features and bug fixing
Production Systems Using EKT’s CERIF Libraries • DMS Metrics - Native CERIF DB + Custom tables • EuroRIs-Net+ Observatory - Native CERIF DB, CERIF XML • R&D Business Registry - Native CERIF DB, CERIF XML • Scopus DB - Native CERIF DB • PAERIP Inventory of Research Infrastructures - Native CERIF DB (older version) • CRIS - Native CERIF DB, CERIF XML, CERIF API • RICH - Native CERIF DB, CERIF XML • OpenAire - CERIF XML, CERIF API (Custom OAI-PMH implementation) • Greek Researcher Directory - CERIF XML
Data Migration Tool • Biblio-Transformation-Engine • Provided as Open Source https://github.com/EKT/Biblio-Transformation-Engine • Included in the core distribution of DSpace • JAVA
Using BTE for Data Migration • Identify CRIS related data • Identify per migration datasets • Map data to CERIF • Identify CERIF entities • Identify persistent identifiers • Identify relations • Identify semantics – Custom semantics may be necessary • Process, Clean, Merge source data • Implement migration (commonly OutputGenerator classes) • Run • Test
Lessons Learned • Central vs. decentralized CRIS – Keep data fresh • Minimizing the need for data migration and manual data entry – Tackle cold start problem • Horizontal need for persistent identifiers • CERIF semantic entities: • Difficulties in understanding and using • Need for Greek translations • User management – Integration of already existing user directories • Security issues – Data filtering • Policies and workflows • Communicate the value of CRIS
Future Steps (2015 - 2016) • CRIS Installations in more institutions • Collect more data in Aggregator and produce statistics/metrics • Implement visualizations for Aggregator statistics/metrics • Ongoing Maintenance of CRIS software and all CERIF tools • Fully aligned to CERIF versions • New features and bug fixing • Organize workshops and tutorials for institutions’ personnel • Completion of CERIF XML export for Terms Thesaurus • Completion of Projects Registry
Thank you for your attention Q&A • More info: • bonisv AT ekt.gr • http://metrics.ekt.gr/cris • http://helpdesk.cris.ekt.gr • http://github.com/EKT/CERIF-Tools • http://github.com/EKT/Biblio-Transformation-Engine