880 likes | 901 Views
The Common European Research Information Format CERIF. Brigitte Jörg, M.A. (Information Science) Language Technology Lab, German Research Center for Artificial Intelligence (DFKI) Saarbrücken, Germany. Introduction of Speaker. Brigitte Jörg M.A. Information Science
E N D
The Common European Research Information Format CERIF Brigitte Jörg, M.A. (Information Science) Language Technology Lab, German Research Center for Artificial Intelligence (DFKI) Saarbrücken, Germany
Introduction of Speaker Brigitte Jörg M.A. Information Science Information Systems, Business Administration • Researcher, Project Manager DFKI GmbH, Language Technology Lab, Saarbrücken • CERIF TG Leader, Board Member euriCRIS • Contact: brigitte.joerg @ dfki.de http://www.dfki.de/~brigitte/
Outline • What is CERIF? • Grounding Explanations • Model • Metadata • Research Information • CRIS • The Conceptual (Logical) CERIF Model • Entities • Relationships • Structure • The CERIF (XML) Interchange Format • The CERIF 2008 1.0 Release • CERIF Examples and Related Activities
Funding Programme Organisation Organisation Person Person Project Project Service Skills Publication Equipment CV Patent Classification Classification Product ( ( ) ) Semantics Semantics Event What is CERIF ? CommonEuropeanResearchInformationFormat
What is CERIF ? Common European Research Information Format • A Concept about Research Entities and their RelationshipsSpecification (Conceptual Level) • An abstract formal description of the concept about entities and their relationships Model (Logical Level) • A formal machine readable description of the conceptDatabase Scripts (Physical Level) SQL Script ----------------------- CREATE Table Person CREATE Table Project CREATE Table OrgUnit Organisation of Data/Information accordingly !!
What is CERIF ? Common European Research Information Format • data model (conceptual, logical, physical) • allows for a (metadata) representation of • research entities • their activities / interconnections (research) • their output (results) • allows for high flexibility with formal semantic relationships enables quality maintenance, archiving, access and interchange of research information supports knowledge transfer to decision makers, for research evaluation, research managers, strategists, researchers, editors, the general public
What is CERIF ? CommonEuropeanResearchInformation Format • CERIF is an EU Recommendation to Member States http://cordis.europa.eu/cerif/ • The European Commission (EC) has authorised euroCRIS to maintain and develop CERIF and its usagehttp://www.eurocris.org/cerif/cerif-releases/
The CERIF Evolution CERIF 2006 / 2008 Model Similar Ideas UN/UNESCO OECD CODATA CORE Link Semantics Language 2ndLevel EU Working Group on Research Databases Workshop CERIF 2000 Model Roles EXPERTISE OrgUnit PERSON CERIF 91 PROJECT RESULTS EQUIPMENT PROJECT CLASSIFICATION Acronym: ERGO Participant: Keith Jeffery, Anne Asser son, many more Organisations: Rutherford Appleton, Uni- versity of Bergen, … • - Data Model (RDBMS, OO, IR) • Model Normalization • - Robust Structure • - Extensible Structure • - Consistent Structure • - Semantic Layer • XML Exchange Specification • Connectivity to Repositories (Elaboration on Publication) • - Data Model (RDBMS, OO, IR) • - Multilinguality • Controlled Vocabulary • Roles / Types • User-driven • EC Recommendation to Member States • - Networking of DBs • Exchange of Records • Recommendation to Member States 1987 1991 2000 2006 2008
Outline • What is CERIF? • Grounding Explanations • Model • Metadata • Research Information • CRIS • The Conceptual (Logical) CERIF Model • Entities • Relationships • Structure • The CERIF (XML) Interchange Format • The CERIF 2008 Release • CERIF and Related Activities
is part of informs A C B D SQL Script ------------------- CREATE Table Person CREATE Table Project CREATE Table OrgUnit depends on X Z waits for F G What is a model ? • … is a simplified view to describe a particular area of interest • … allows for a better communication between parties (mutual understanding) • … supports (re-)design decisions • … supports workflow identification • … supports documentation • … can be exchanged, re-used, iterated, extended
Funding Programme Organisation Organisation Person Person Project Project Service Skills Publication Equipment CV Patent Classification Classification Product ( ( ) ) Semantics Semantics Event The C E R I F Model CommonEuropeanResearchInformationFormat
What is Metadata ? „Metadata is structured data which describes the characteristicsof a resource.” An Introduction to Metadata, by Chris Taylor, University of Queensland “Metadata is sometimes defined literally as 'data about data,' but the term is normally understood to mean structured data about resources that can be used to help support a wide range of operations. These might include, for example, resource description and discovery, the management of information resources and their long-term preservation.” Metadata in a Nutshell, by Michael Day, UKOLN Support for a wide range of operations …
Metadata Metadata Metadata Metadata Metadata Metadata Metadata Metadata What is Metadata ? Book: Title: The Hitchhiker‘s Guide to the Galaxy Date of Publication: 1979 Radio Series: Title: The Hitchhiker‘s Guide to the Galaxy Description: is a science fiction comedy series created by Douglas Adams. Originally a radio comedy broadcast on BBC Radio 4 in 1978, […] Source: Wikipedia Date of Query: May 30, 2008 Series of five Books: Title: The Hitchhiker‘s Guide to the Galaxy. Between: 1979 - 1982 • Structure: • Type of Resource • Title • Description • Source • Date • Author, Creator, … TV Series: Title: The Hitchhiker‘s Guide to the Galaxy Screened: 1981 Data about Data Game Cover Image: The Hitchhiker‘s Guide to the Galaxy Source:http://egotron.com/ Retrieved: May 30, 2008 Computer Game: Title: The Hitchhiker‘s Guide to the Galaxy Released: 1984 Links: http://www.bbc.co.uk/cult/hitchhikers/ HTML-Title: Cult – The Hitchhiker‘s Guide to the Galaxy http://en.wikipedia.org/wiki/The_Hitchhiker's_Guide_to_the_Galaxy HTML-Title:The Hitchhiker's Guide to the Galaxy Comic Book Adaptions: Title: The Hitchhiker‘s Guide to the Galaxy Between: 1993 – 1996
What is Metadata ? Support for a wide range of operations … Metadata Categories • Descriptive Metadata [intellectual contents] • Administrative Metadata • Technical [file formats ...] • Rights Management [permissions ...] • Provenance [creation, subsequent treatment, ...] • ... • Structural Metadata [internal structure of items: page order ...] • Contextual Metadata • Project Context [funding programme, participating organisations …] • Publication Context [number of authors, external authors, first …] • Usage Context [downloads, requests, …] • ... See also: JISC Report from April 2008 “Metadata for digital libraries: state of the art and future directions” by Richard Gartnerhttp://www.jisc.ac.uk/media/documents/techwatch/tsw_0801pdf.pdf
What is Formal Metadata ? Support for a wide range of operations … Metadata Categories • Descriptive Metadata [intellectual contents] • Administrative Metadata • Technical [file formats ...] • Rights Management [permissions ...] • Provenance [creation, subsequent treatment, ...] • ... • Structural Metadata [internal structure of items: page order ...] • Contextual Metadata • Project Context [funding programme, participating organisations …] • Publication Context [number of authors, external authors, first …] • Usage Context [downloads, requests, …] • ... Formalization = based on a Model
What is Research Information ? Data/Metadata or Information about: • Scientists • Project Managers • Ongoing and Completed Projects • Research Departments • Funding Organisations and Programmes • Research Results • Publications • Equipment • their timely Relationships (Semantics) ...
What is a CRIS? Current Research Information System = CRIS • … that means • Timeliness • Vitality • … information about • People + • Organisations + • Projects + • Funding Programmes + • Research Results + • … • … driven by • A Concept • A Model • … incorporated as a • Implementation (ICT) an integrated approach towards managing research information
CERIF What is a CRIS? Current Research Information System = CRIS Metadata • … that means • Timeliness • Vitality • … information about • People + • Organisations + • Projects + • Funding Programmes + • Research Results + • … • … driven by • A Concept • A Model • … incorporated as a • Implementation (ICT) heterogenous entities changing relationships Integration an integrated approach towards managing research information
Users of CRISs ? • Researchers (find partners, track competitors, form collaborations) • Research Managers (assess performance, assess research output, find reviewers for evaluation of proposals) • Research Strategists (decide on priorities and resourcing, compare with other countries) • Publication Editors (find potential authors, find reviewers for proposed papers) • Intermediaries / Brokers (find research products, identify ideas to be carried forward) • Media (communicate results) • General Public (for interest)
Users of CRISs ? • Researchers (find partners, track competitors, form collaborations) • Research Managers (assess performance, assess research output, find reviewers for evaluation of proposals) • Research Strategists (decide on priorities and resourcing, compare with other countries) • Publication Editors (find potential authors, find reviewers for proposed papers) • Intermediaries / Brokers (find research products, identify ideas to be carried forward) • Media (communicate results) • General Public (for interest) Research is International Research Information involves various Entities
What kind of Questions do we want to answer from CRISs? • How many articles has author X published in 2007 as a first author? • How often have articles by author X been cited? • Did author X publish with institutionally external authors? • In how many FP7 projects does organisation Z participate? • How many publications have resulted from project Y? • How many people have been employed in the course of FP6 projects from the 1st call in the NMS? • How many PhD students have participated in FP6 projects? • How many women have been involved in FP6 projects? • How often have articles in journal A been requested in 2007? • How many articles have been published in the field of B? • …
Outline • What is CERIF? • Grounding Explanations • Model • Metadata • Research Information • CRIS • The CERIF Model • Entities • Relationships • Structure • The CERIF (XML) Interchange Format • The CERIF 2008 Release • CERIF Examples and Related Activities
Funding Programme Organisation Organisation Person Person Project Project Service Skills Publication Equipment CV Patent Classification Classification Product ( ( ) ) Semantics Semantics Event CERIF: Common European Research Information Format
Concept of the CERIF Model CERIF: A model to manage Research Information • Research Entities • Project, Person, Organisation • Funding Programme, Service, Equipment, • Publication, Patent, Product, … • Activities / Interconnections in their Context • Relationships • Semantics / Roles / Types -> for Exchange -> for Interoperability -> for Implementation of CRISs (Current Research Information Systems)
Concept of the CERIF Model - Structure CERIF Entity Types • Core Entities • Result Entities • 2nd Level Entities • Link Entities CERIF Features • Multiple Language • Semantics
Person ID URI Sex FirstNames OtherNames FamilyNames NameVariants ResearchInterest Keywords Project ID URI Acronym StartDate EndDate Title Abstract Keywords OrganisationUnit ID URI Acronym Name HeadCount CurrencyCode Turnover ResearchActivity Keywords Core CERIF Entities in Detail
ResultPublication ID URI Title Subtitle Abstract Bibl. Note PublicationDate TotalPages StartPage EndPage Keywords ResultPatent ID URI PatentNumber Title CountryCode RegistrationDate ApprovalDate Description Keywords ResultProduct ID URI InternationalID CERIF Result Entities in Detail
CERIF 2nd Level Entities Facility Equipment Funding ExpertiseAndSkills Service Qualification ElectronicAddresse Prize PostalAddress CV Country Citation Currency Metrics Event Language
FundingProgramme ID URI Name CurrencyCode Budget StartDate EndDate Description Keywords ResultPatent ID URI PatentNumber Title CountryCode RegistrationDate ApprovalDate Description Keywords Event ID URI Name FeeOrFree StartDate EndDate CityTown CountryCode Description Keywords Some CERIF 2nd Level Entities in Detail Facility ID URI Name Description Keywords Facility Equipment Funding ExpertiseAndSkills Service Qualification ElectronicAddresse Prize PostalAddress CV Country Service ID URI Name Description Keywords Citation Currency Metrics Event Language
Some CERIF Multiple Language Features in Detail OrganisationUnit Name [language] ResearchActivity [languange] Keywords [language] ResultPublication Title [language] Abstract [languange] Keywords [language] ResultPatent Name [language] Description [languange] Keywords [language] ResultProduct Name [language] Description [languange] Keywords [language] Service Name [language] Description [languange] Keywords [language] Facility Name [language] Description [languange] Keywords [language] Person ResearchInterest [language] Keywords [language] Project Title [language] Abstract [languange] Keywords [language] Multiple Language Features are associated with Core, Result, 2nd Level, Classification Entities
role=author1 institute role=author role=deliverable1.2 role=CEO role=funder role=coordinator Some CERIF Semantic Features Semantic Features are associated with Link Entities
Associated Semantic Features in more Detail OrganisationUnit_Result Publication orgID publID Classification ClassificationScheme StartDate; EndDate Person_ResultPublication persID publID Classification ClassificationScheme StartDate; EndDate role=author role=author1 institute Project_ResultPublication persID publID Classification ClassificationScheme StartDate; EndDate Project_FundingProgramme projID fundProgID Classification ClassificationScheme StartDate; EndDate role=originator role=co-funder Project_Person projID perslID Classification ClassificationScheme StartDate; EndDate Person_OrganisationUnit persID orgID Classification ClassificationScheme StartDate; EndDate Project_Organisation projID orgID Classification ClassificationScheme StartDate; EndDate role=coordinator role=investigatedBy role=affiliation
Associated Formal Semantic Features in more Detail OrganisationUnit_Result Publication orgID publID Classification ClassificationScheme StartDate; EndDate CERIF Model Person_ResultPublication persID publID Classification ClassificationScheme StartDate; EndDate role=author role=author1 institute Project_ResultPublication projID publID Classification ClassificationScheme StartDate; EndDate Project_FundingProgramme projID fundProgID Classification ClassificationScheme StartDate; EndDate role=originator role=co-funder Project_Person projID perslID Classification ClassificationScheme StartDate; EndDate Person_OrganisationUnit persID orgID Classification ClassificationScheme StartDate; EndDate Project_Organisation projID orgID Classification ClassificationScheme StartDate; EndDate role=coordinator role=investigatedBy role=affiliation
ClassificationScheme ClassSchemeID Description [language] URI Classification ClassID ClassSchemeID Term [language] Description [language] StartDate, EndDate URI Classification_Classification ClassID1 (Term1) ClassID2 (Term2) ClassSchemeID1 (Schema1) ClassSchemeID2 (Schema1) ClassId (Role) ClassSchemeID (RoleSchema) StartDate, EndDate CERIF Semantic Layer ClassScheme_ClassScheme ClassSchemeID1 ClassSchemeID2 ClassID (Role) ClassSchemeID (RoleSchema) StartDate, EndDate
CERIF Semantic Layer • Allows to capture any Schema or Structure • Flat Lists • Taxonomies • Ontologies • Open / Extensible in all directions • New Schemas • New Concepts / Terms • New Relationships • Enables to manage • Roles / Types Semantics • Subject Headings • Archiving (Time component) • Allows for simple Mappings between Schemas (Interchange) • Allows for an efficient (independent) Maintenance
CERIF 2nd Level Entities (ERM View) Facility Equipment Funding ExpertiseAndSkills Service Qualification ElectronicAddresse Prize PostalAddress CV Country Citation Currency Metrics Event Language
Funding Programme Organisation Organisation Person Person Project Project Service Skills Publication Equipment CV Patent Classification Classification Product ( ( ) ) Semantics Semantics Event CERIF: Common European Research Information Format
Outline • What is CERIF? • Grounding Explanations • Model • Metadata • Research Information • CRIS • The Conceptual (Logical) CERIF Model • Entities • Relationships • Structure • The CERIF Interchange Format • Concept / Structure • XML • CERIF Examples and Related Activities
Entity Person Person Entity CERIF Interchange Format • According to the CERIF Model Structure • Core Entities • Result Entities • 2nd Level Entities • Link Entities • Multilingual Features • Semantic Features 1:1 Interchange Entity Interchange Entity 1:1
Person CERIF Interchange Format <XML> <PERSON> <ID>1</ID> <URI>http://www.linkedin.com1</URI> <Sex>female</Sex> </PERSON> <PERSON> <ID>2</ID> <URI>http://www.linkedin.com2</URI> <Sex>male</Sex> </PERSON> --- </XML> Person ID URI Sex
ResultPublication CERIF Interchange Format <XML> <ResultPublication> <ID>1</ID> <PublicationDate>2006</PublicationDate> <URI>http://www.epubs.org/ID1</URI> </ResultPublication> <ResultPublication> <ID>2</ID> <PublicationDate>2005</PublicationDate> <URI>http://www.greynet.org/thegrey journal.html?ID2</URI> </ResultPublication> --- </XML> ResultPublication ID URI PublicationDate Num Vol Edition Series Issue TotalPages StartPage EndPage ISBN ISSN