530 likes | 669 Views
CERIF Tutorial: Release 2008 – 1.1. Brigitte Jörg, M.A. (Information Science) Language Technology Lab, German Research Center for Artificial Intelligence (DFKI) Berlin, Germany. Introduction of Speaker. Brigitte Jörg M.A. Information Science
E N D
CERIF Tutorial: Release 2008 – 1.1 Brigitte Jörg, M.A. (Information Science) Language Technology Lab, German Research Center for Artificial Intelligence (DFKI) Berlin, Germany
Introduction of Speaker Brigitte Jörg M.A. Information Science Information Systems, Business Administration • Researcher, Project Manager DFKI GmbH, Language Technology Lab, Berlin • CERIF TG Leader, Board Member euriCRIS • Contact: brigitte.joerg @ dfki.de http://www.dfki.de/~brigitte/
Outline • CERIF – Common European Research Information Format • CERIF Entities Types and Features • CORE Entities • Result Entities • 2nd Level Entities • Link Entities • Semantic Layer • (Multilinguality) • CERIF XML Interchange Format • Formalizing a CERIF Semantics • Towards a CERIF Core • Discussion
Funding Programme Organisation Organisation Person Person Project Project Service Skills Publication Equipment CV Patent Classification Classification Product ( ( ) ) Semantics Semantics Event CERIF CommonEuropeanResearchInformationFormat
What is CERIF ? Common European Research Information Format • A Concept about Research Entities and their RelationshipsSpecification (Conceptual Level) • An abstract formal description of the concept about entities and their relationships Model (Logical Level) • A formal machine readable description of the conceptDatabase Scripts (Physical Level) SQL Script ----------------------- CREATE Table Person CREATE Table Project CREATE Table OrgUnit Organisation of Data / Information accordingly !!
What is CERIF ? Common European Research Information Format • data model (conceptual, logical, physical) • allows for a (metadata) representation of • research entities • their activities / interconnections (research) • their output (results) • allows for high flexibility with formal semantic relationships enables quality maintenance, archiving, access and interchange of research information supports knowledge transfer to decision makers, for research evaluation, research managers, strategists, researchers, editors, the general public
What is CERIF ? CommonEuropeanResearchInformation Format • CERIF is an EU Recommendation to Member States http://cordis.europa.eu/cerif/ • The European Commission (EC) has authorised euroCRIS to maintain and develop CERIF and its usagehttp://www.eurocris.org/cerif/cerif-releases/
The CERIF Evolution CERIF 2006 / 2008 Model Similar Ideas UN/UNESCO OECD CODATA CORE Link Semantics Language 2ndLevel EU Working Group on Research Databases Workshop CERIF 2000 Model Roles EXPERTISE OrgUnit PERSON CERIF 91 PROJECT RESULTS EQUIPMENT PROJECT CLASSIFICATION Acronym: ERGO Participant: Keith Jeffery, Anne Asser son, many more Organisations: Rutherford Appleton, Uni- versity of Bergen, … • - Data Model (RDBMS, OO, IR) • Model Normalization • - Robust Structure • - Extensible Structure • - Consistent Structure • - Semantic Layer • XML Exchange Specification- Elaboration on Publication • (Core) CERIF Semantics • - Data Model (RDBMS, OO, IR) • - Multilinguality • Controlled Vocabulary • Roles / Types • User-driven • EC Recommendation to Member States • - Networking of DBs • Exchange of Records • Recommendation to Member States 1987 1991 2000 2006 2008
Outline • CERIF – Common European Research Information Format • CERIF Entities Types and Features • CORE Entities • Result Entities • 2nd Level Entities • Link Entities • Semantic Layer • (Multilinguality) • Formalizing a CERIF Semantics • Towards a CERIF Core • Discussion
Concept of the CERIF Model - Structure CERIF Entity Types • Core Entities • Result Entities • 2nd Level Entities • Link Entities CERIF Features • Multiple Language • Semantics
Person ID URI Sex FirstNames OtherNames FamilyNames NameVariants ResearchInterest Keywords Project ID URI Acronym StartDate EndDate Title Abstract Keywords OrganisationUnit ID URI Acronym Name HeadCount CurrencyCode Turnover ResearchActivity Keywords Core CERIF Entities in Detail
ResultPublication ID URI Title Subtitle Abstract Bibl. Note PublicationDate TotalPages StartPage EndPage Keywords ResultPatent ID URI PatentNumber Title CountryCode RegistrationDate ApprovalDate Description Keywords ResultProduct ID URI InternationalID CERIF Result Entities in Detail
CERIF Core, Result and 2nd Level Entities Call Facility Grant Equipment FundingProgramme ExpertiseAndSkills Service Qualification ElectronicAddresse Prize PostalAddress CV Country Citation Currency Metrics Event Language
FundingProgramme ID URI Name CurrencyCode Budget StartDate EndDate Description Keywords ResultPatent ID URI PatentNumber Title CountryCode RegistrationDate ApprovalDate Description Keywords Event ID URI Name FeeOrFree StartDate EndDate CityTown CountryCode Description Keywords Some CERIF 2nd Level Entities in Detail Call Facility Grant Equipment Facility ID URI Name Description Keywords FundingProgramme ExpertiseAndSkills Service Qualification ElectronicAddresse Prize PostalAddress CV Country Citation Service ID URI Name Description Keywords Currency Metrics Event Language
role=author1 institute role=author role=deliverable1.2 role=CEO role=funder role=coordinator Some CERIF Semantic Features Semantic Features are associated with Link Entities
role=author1-institute role=editor role=... ? role=author role=author1 role=reviewer role=... ? role=deliverable1.2 role=journal article role=public report role=CEO role=researcher role=project-manager role=funder role=investigator role=member role=coordinator role=manager More CERIF Semantic Features Semantic Features are associated with Link Entities
Associated Formal Semantic Features in more Detail OrganisationUnit_Result Publication OrgID PublID ClassificationID ClassificationSchemeID StartDate; EndDate CERIF Model Person_ResultPublication PersID PublID ClassificationID ClassificationSchemeID Fraction;StartDate;EndDate role=author role=author1 institute Project_ResultPublicationPersID PublID ClassificationID ClassificationSchemeID Fraction;StartDate;EndDate Project_FundingProgramme ProjID FundProgID ClassificationID ClassificationSchemeID StartDate; EndDate role=originator role=co-funder Project_Person ProjID PerslID ClassificationID ClassificationSchemeID StartDate; EndDate Person_OrganisationUnit PersID OrgID ClassificationID ClassificationSchemeID StartDate; EndDate Project_Organisation ProjID OrgID ClassificationID ClassificationSchemeID StartDate; EndDate role=coordinator role=affiliation role=investigatedBy
ClassificationScheme ClassSchemeID (Taxonomy) URI (http://www.taxonomy.org/) Description [language=EN] Classification ClassID (isA) ClassSchemeID (Taxonomy) StartDate, EndDate URI Term [language=EN] Description [language=EN] Classification_Classification ClassID1 (Ontology) ClassID2 (SemanticWeb) ClassSchemeID1 (WebTechnologies) ClassSchemeID2 (WebTechnologies) ClassId (isA) ClassSchemeID (Taxonomy) Fraction (0.3) StartDate, EndDate CERIF Semantic Layer ClassScheme_ClassScheme ClassSchemeID1 (LT World) ClassSchemeID2 (CLARIN) ClassID (mapsWith) ClassSchemeID (LT-World Mappings) Fraction (0.3) StartDate, EndDate
CERIF Semantic Layer • Allows to capture any Schema or Structure • Flat Lists • Taxonomies • Ontologies • Open / Extensible in all directions • New Schemas • New Concepts / Terms • New Relationships • Enables to manage • Roles / Types Semantics • Subject Headings • Archiving (Time component) • Allows for simple Mappings between Schemas (Interchange) • Allows for an efficient (independent) Maintenance
CERIF Modules Call Facility Grant Equipment FundingProgramme ExpertiseAndSkills Service Qualification ResultPublication ElectronicAddresse Prize PostalAddress CV Time Role X Country Role Y Citation Project Funding Programme Role Z Currency Metrics Event Language
CERIF Modules Call ResultPublication Facility Grant Equipment OrganisationUnit FundingProgramme ExpertiseAndSkills Project Service Qualification ElectronicAddresse Prize Funding PostalAddress CV Country SCHEMA 1 Citation SCHEMA 2 Role X Currency SCHEMA 3 Metrics Event Role Y Role A Language Role Z Role A Role C Role B Role C Role B Semantic Layer
CERIF 2nd Level Entities (ERM View) Facility Equipment Funding ExpertiseAndSkills Service Qualification ElectronicAddresse Prize PostalAddress CV Country Citation Currency Metrics Event Language
Funding Programme Organisation Organisation Person Person Project Project Service Skills Publication Equipment CV Patent Classification Classification Product ( ( ) ) Semantics Semantics Event CERIF: Common European Research Information Format
Outline • CERIF – Common European Research Information Format • CERIF Entities Types and Features • CORE Entities • Result Entities • 2nd Level Entities • Link Entities • Semantic Layer • (Multilinguality) • CERIF XML Interchange Format • Formalizing a CERIF Semantics • Towards a CERIF Core • Discussion
Entity Person Person Entity CERIF Interchange Format • According to the CERIF Model Structure • Core Entities • Result Entities • 2nd Level Entities • Link Entities • Multilingual Features • Semantic Features 1:1 Interchange Entity Interchange Entity 1:1
Person CERIF Interchange Format <XML> <PERSON> <ID>1</ID> <URI>http://www.linkedin.com1</URI> <Sex>female</Sex> </PERSON> <PERSON> <ID>2</ID> <URI>http://www.linkedin.com2</URI> <Sex>male</Sex> </PERSON> --- </XML> Person ID URI Sex
ResultPublication CERIF Interchange Format <XML> <ResultPublication> <ID>1</ID> <PublicationDate>2006</PublicationDate> <URI>http://www.epubs.org/ID1</URI> </ResultPublication> <ResultPublication> <ID>2</ID> <PublicationDate>2005</PublicationDate> <URI>http://www.greynet.org/thegrey journal.html?ID2</URI> </ResultPublication> --- </XML> ResultPublication ID URI PublicationDate Num Vol Edition Series Issue TotalPages StartPage EndPage ISBN ISSN
Person_Publication CERIF Interchange Format <XML> <Person_ResultPublication> <personID>1</personID> <publicationID>1</publicationID> <ClassID>1</ClassID> <ClassSchemeID>1</ClassSchemeID> <Fraction>0.3</Fraction> <StartDate>2010-01-01</StartDate> <EndDate>2010-12-31</StartDate> </Person_ResultPublication> <Person_ResultPublication> <personID>2</personID> <publicationID>1</publicationID> <ClassID>1</ClassID> <ClassSchemeID>1</ClassSchemeID> <Fraction>0.3</Fraction> <StartDate>2010-01-01</StartDate> <EndDate>2010-12-31</StartDate> </Person_ResultPublication> --- </XML> Person_Publication personID publicationID ClassID ClassSchemeID Fraction StartDate EndDate
CERIF Interchange Format • According to W3C Standards • Refers to XML Schemas for Validation • XML files corresponding to CERIF Structure-> Entities / Separation of Relationships • Available Specification Document as part of the CERIF 2008 Releasehttp://www.euroCris.org/cerif/cerif-releases/
Outline • CERIF – Common European Research Information Format • CERIF Entities Types and Features • CORE Entities • Result Entities • 2nd Level Entities • Link Entities • Semantic Layer • (Multilinguality) • CERIF XML Interchange Format • Formalizing a CERIF Semantics • Towards a CERIF Core • Discussion
CERIF Example (Integrated Person) Facility Equipment Funding ExpertiseAndSkills Service ResultPublication Qualification ElectronicAddresse Prize PostalAddress CV Country Project Funding Programme Citation Currency Metrics Event Language
CERIF XML Example (Person) <cfPers> <cfPersId>BrigitteJoerg</cfPersId> <cfSex>f</Sex> <cfURI>http://www.dfki.de/~brigitte</URI> </cfPers> <cfPersName> <cfPersId>BrigitteJoerg</cfPersId> <cfFamilyName>Joerg</FamilyName> <cfFirstNames>Brigitte</FirstNames> </cfPersName> <cfPers_ResPubl> <cfPersId>BrigitteJoerg</cfPersId> <cfResPublId>DataScienceJournalArticle</cfResPublId> <cfClassId>Article</cfClassId> <cfClassSchemeId>REF2010-Evaluation</FirstNames> <cfFraction>1.0</cfFraction> <cfStartDate>2010-01-01</cfStartDate> <cfEndDate>2010-12-31</EndDate> </cfPers_ResPubl> <cfPers_OrgUnit> <cfPersId>BrigitteJoerg</cfPersId> <cfOrgUnitId>DFKI</cfOrgUnitId> <cfClassId>Affilation</cfClassId> <cfClassSchemeId>REF2010-Evaluation</cfClassSchemeId> <cfFraction>1.0</cfFraction> <cfStartDate>2010-01-01</cfStartDate> <cfEndDate>2010-12-31</EndDate> </cfPers_OrgUnit>
CERIF Example (Integrated Organisation) Facility Equipment Funding ExpertiseAndSkills Service ResultPublication Qualification ElectronicAddresse Prize PostalAddress CV Country Project Funding Programme Citation Currency Metrics Event Language
CERIF XML Example (Organisation) <cfOrgUnit> <cfOrgUnitId>DFKI</cfOrgUnitId> <cfCurrencyCode>EURO</cfCurrencyCode> <cfURI>http://www.dfki.de</URI> </cfOrgUnit> <cfOrgUnitName> <cfOrgUnitId>DFKI</cfOrgUnitId> <cfName lang=“EN”>German Research Center for Artificial Intelligence</cfName> </cfOrgUnitName> <cfOrgUnit_Class> <cfOrgUnitId>DFKI</cfOrgUnitId> <cfClassId>PrivateNotForProfit</cfClassId> <cfClassSchemeId>OrganisationTypes</cfClassSchemeId> <cfFraction>1.0</cfFraction> <cfStartDate>2007-01-01</cfStartDate> <cfEndDate>2099-12-31</EndDate> </cfOrgUnit_Class> <cfOrgUnit_Class> <cfOrgUnitId>DFKI</cfOrgUnitId> <cfClassId>Artificial-Intelligence</cfClassId> <cfClassSchemeId>Research-Fields</cfClassSchemeId> <cfFraction>0.1</cfFraction> <cfStartDate>2007-01-01</cfStartDate> <cfEndDate>2009-12-31</EndDate> </cfOrgUnit_Class>
CERIF Example (Integrated Project) Facility Equipment Funding ExpertiseAndSkills Service ResultPublication Qualification ElectronicAddresse Prize PostalAddress CV Country Project Funding Programme Citation Currency Metrics Event Language
CERIF XML Example (Project) <cfProj> <cfProjId>LT-World</cfProjId> <cfAcro>LT World</Acro> <cfURI>http://www.lt-world.org</URI> </cfProj> <cfProjTitle> <cfProjId>LT-World</cfProjId> <cfTitle lang=“EN”>Language Technology World</Title> </cfProjTitle> <cfProj_OrgUnit> <cfProjId>LT-World</cfProjId> <cfOrgUnitId>DFKI</cfOrgUnitId> <cfClassId>Coordinator</cfClassId> <cfClassSchemeId>Project-OrgUnit-Roles</cfClassSchemeId> <cfFraction>1.0</cfFraction> <cfStartDate>2001-01-01</cfStartDate> <cfEndDate>2099-12-31</EndDate> </cfProj_OrgUnit> <cfProj_Class> <cfProjId>LT-World</cfProjId> <cfClassId>National</cfClassId> <cfClassSchemeId>Project-Types</cfClassSchemeId> <cfFraction>1.0</cfFraction> <cfStartDate>2001-01-01</cfStartDate> <cfEndDate>2009-12-31</EndDate> </cfProj_Class>
CERIF Example (Integrated Publication) Facility Equipment Funding ExpertiseAndSkills Service ResultPublication Qualification ElectronicAddresse Prize PostalAddress CV Country Project Funding Programme Citation Currency Metrics Event Language
<cfResPubl> <cfResPublId>JoergEtAl2008</cfResPublId> <cfResPublDate>2008-01-01</ResPublDate> <cfStartPage>107</cfStartPage> <cfEndPage>123</cfEndPage> <cfISBN>978-961-6133-38-8</cfISBN> <cfURI>http://www2.dfki.de/lt/publications.php?author=brjo01</URI> </cfResPubl> CERIF XML Example (Publication) <cfResPublTitle> <cfResPublId>JoergEtAl2008</cfResPubljId> <cfTitle lang=“EN”>Analyzing European Research Competencies</Title> </cfResPublTitle> <cfPers_ResPubl> <cfPersId>Brigitte-Joerg</cfPersId> <cfResPublId>JoergEtAl2008</cfResPublId> <cfClassId>FirstAuthor</cfClassId> <cfClassSchemeId>REF-AuthorScheme-2010</cfClassSchemeId> <cfFraction>0.5</cfFraction> <cfStartDate>2010-01-01</cfStartDate> <cfEndDate>2010-12-31</EndDate> </cfPers_ResPubl> <cfPers_ResPubl> <cfPersId>Keith-Jeffery</cfPersId> <cfResPublId>JoergEtAl2008</cfResPublId> <cfClassId>2ndAuthor</cfClassId> <cfClassSchemeId>REF-AuthorScheme-2010</cfClassSchemeId> <cfFraction>0.5</cfFraction> <cfStartDate>2010-01-01</cfStartDate> <cfEndDate>2010-12-31</EndDate> </cfPers_ResPubl> <cfResPubl_Class> <cfResPublId>JoergEtAl2008</cfResPublId> <cfClassId>Conference-Article</cfClassId> <cfClassSchemeId>REF-PublicationTypes-2010</cfClassSchemeId> <cfFraction>1.0</cfFraction> <cfStartDate>2010-01-01</cfStartDate> <cfEndDate>2010-12-31</EndDate> </cfResPubl_Class>
Outline • CERIF – Common European Research Information Format • CERIF Entities Types and Features • CORE Entities • Result Entities • 2nd Level Entities • Link Entities • Semantic Layer • (Multilinguality) • Formalizing a CERIF Semantics • Towards a CERIF Core • Discussion
Book Chapter Abstract Book Chapter Inbook Anthology Monograph Reference Book Manual Commentary Book Chapter Review Textbook Book Annotation Book Review Publication Types Encyclopedia News Clipping Journal Article Otherbook Report Journal Conference Proceedings Letter PhD Thesis Journal Article Abstract Short Communication Conference Proceedings Article Letter to Editor Doctoral Thesis Journal Article Review Poster Presentation CERIF Semantics [Publication Types]