1 / 19

The CERIF-2000 and Vocabularies

The CERIF-2000 and Vocabularies. Andrei Lopatenko Vienna University of Technology http://derpi.tuwien.ac.at/~andrei . Problems. Examples. The CERIF-2000 Standard Publications types. 'abstract','bibliography','biography','book','book chapter',

marnin
Download Presentation

The CERIF-2000 and Vocabularies

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. The CERIF-2000 and Vocabularies Andrei Lopatenko Vienna University of Technology http://derpi.tuwien.ac.at/~andrei

  2. Problems. Examples. The CERIF-2000 Standard Publications types • 'abstract','bibliography','biography','book','book chapter', • 'conference paper','conference proceedings','correspondence', • 'dictionary','directory','dissertation','duplicate publication', • 'editorial','encyclopedia','errata','guideline','index','interview', • 'journal article','lecture', 'meta-analysis','miscellaneous', • 'monograph','multimedia','news','overall','patent','report','review' • , 'standard','textbook','translation','twin study'

  3. Problems. Examples. UiB (real CRIS) publication types • http://www.ub.uib.no/avdeling/fdok/demo/UK2001/classifikasjon.htm • Two level hierarchy • The first level. Seven terms • The second level. About 50 terms • The same situation in Austria • Vocabularies dictated by real requirement to the CRIS from researchers, policy-makers, university administration

  4. Research topics, expertise skills • In the CERIF-2000 ORTELIUS and others vocabularies are suggested to use • Real needs of researchers – AMS, national vocabularies and others • And what to do?

  5. Why different vocabularies for the same data • Different societies and policies, focusing on different aspects or interested in different details • Different states by historical and other reasons

  6. When vocabulary used? • In information retrieval operations • Search by vocabulary terms • Browse by hierarchy • Reports, analysis, evaluation, visualization (usually some statistical calculations) • Information input

  7. What we need • As usual every CRIS has own target audience. That audience defines vocabulary used • But some CRIS should be compatible. Data should be able to be exported to EU bodies and problem of “domains” – in this case vocabularies should be solved. • The CERIF-2000 should specify how to use custom vocabulary for CRIS audience need and remain compatible with other CERIF CRIS (future, ERIS network?)

  8. Other advantages • Such framework also will be useful for some CRIS with data sharing. • Some CRIS has several different audiences. And each uses own vocabulary for the same data. Creating multi vocabulary CRIS can be implementation guideline-help for CRIS developers • Example: publication types – researchers and university administration (only statistical weights of publications), national and international audience foe the same CRIS

  9. So what we need • Framework and application which • Built, evolve and store different vocabularies of different types • Specify meaning of vocabulary for classifying which information it can be used • Specify intervocabulary relations (mappings)

  10. What we need • Framework and application which(operational stage) • Perform domain transformation (vocabulary mapping) in export/import operations - short term goal • Perform domain transformation (vocabulary mapping) in informational retrieval operations

  11. How to implement • Several possible ways • Simple database descriptions of vocabularies and their mappings • More complicated, as example, Description Logics descriptions of the vocabularies and mappings

  12. Database implementation • Easy to implement • Easy to develop vocabulary creation, supporting, mapping definition tools • More or less easy to implement transformation tools • But very bad in expressing meaning of terms, terms definitions

  13. Database implementation • Impossible to specify attribute definition of terms • Example: “EU Project” is a “Project” which “funding organization” is a “fund” of EU or Euro Commission • Hard to measure information lacks…

  14. Database implementation • But already implemented and can be part of CERIF-2000 free implementation • Currently MS ASP application, better JSP or more open standard

  15. Screenshot. Vocabulary development

  16. Screenshot. Intervocabulary mappings

  17. Description Logic • Much more powerful in taxonomies and mapping descriptions • But need very special experience and knowledge (some logical theories, languages) • Not directly database integrated, as usual terms are not stored in database

  18. Description Logics. • DAML + OIL can make situation not so hard? • Possibly, yes, should be investigated in EU project • Very important for European wide CRIS

  19. Summary • We can propose free tools for CERIF CRIS developers • And when we get feedback situation will be more clear • Need to get EU project and solve vocabulary problems

More Related