1 / 28

Part 1: overhead presentation

GEMET, the General Environmental Multilingual Thesaurus: Development, User Perspectives and Plans for a Thesaurus System. Part 1: overhead presentation Bruno Felluga, CNR - Consiglio Nazionale delle Richerche, Rome, Italy Part 2: slide show Stefan Jensen, project leader ETC/CDS,

marcos
Download Presentation

Part 1: overhead presentation

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. GEMET, the General Environmental Multilingual Thesaurus: Development, User Perspectives and Plans for a Thesaurus System Part 1: overhead presentation Bruno Felluga, CNR - Consiglio Nazionale delle Richerche, Rome, Italy Part 2: slide show Stefan Jensen, project leader ETC/CDS, Lower Saxony Ministry of the Environment Hannover, Germany Open Forum on Metadata Registries, Santa Fe, NM January 20, 2000 EUROPEAN TOPIC CENTREON CATALOGUEOF DATA SOURCES (ETC/CDS) EUROPEAN ENVIRONMENT AGENCY

  2. Outline GEMET presentation - part 2 “linking terminology and applications” GEMET activities performed by the ETC/CDS - co-ordination of the thesaurus development - GEMET usage for indexing and retrieving environmental metadata - development of application around GEMET - assessing 3rd party user needs to incorporate into future developments EUROPEAN TOPIC CENTREON CATALOGUEOF DATA SOURCES (ETC/CDS) EUROPEAN ENVIRONMENT AGENCY

  3. Co-ordination of the development - Encouraging and co-ordinating the translation of the core terminology into 12 languages - Contracting application development around GEMET - implement shared coding lists (value domains) - Promoting the use of GEMET through marketing activities - Distributing GEMET and supplying technical helpdesk EUROPEAN TOPIC CENTREON CATALOGUEOF DATA SOURCES (ETC/CDS) EUROPEAN ENVIRONMENT AGENCY

  4. GEMET - usage for indexing metainformation GEMET has been used in the work of EEA to index metadata from the following resources: - The Directory of Information Resources (DIR) - The Reporting Obligation Database (ROD) to do this, 2 applications were developed: - MS-Access based tool for metadata registry (WinCDS) - Webbased JAVA tool for online registration(prototype) EUROPEAN TOPIC CENTREON CATALOGUEOF DATA SOURCES (ETC/CDS) EUROPEAN ENVIRONMENT AGENCY

  5. Thesaurus part of WinCDS EUROPEAN TOPIC CENTREON CATALOGUEOF DATA SOURCES (ETC/CDS) EUROPEAN ENVIRONMENT AGENCY

  6. JAVA based online registration - the indexing EUROPEAN TOPIC CENTREON CATALOGUEOF DATA SOURCES (ETC/CDS) EUROPEAN ENVIRONMENT AGENCY

  7. GEMET - usage for indexing metainformation Directory of Information Resources Total dataset sum: 931 Controlled terms in use: 655 of ~5300 Total descriptors sum: 4714 (GEMET terms used for indexing) Term ranking: 121 of 655 terms have been used more than 10 times EUROPEAN TOPIC CENTREON CATALOGUEOF DATA SOURCES (ETC/CDS) EUROPEAN ENVIRONMENT AGENCY

  8. Terms used more than 60 times EUROPEAN TOPIC CENTREON CATALOGUEOF DATA SOURCES (ETC/CDS) EUROPEAN ENVIRONMENT AGENCY

  9. DIR : term ranking

  10. DIR : term ranking

  11. GEMET - usage for indexing metainformation Reporting Obligations Database ROD prototype Questions Datasets total: 22844 Controlled terms in use: 323 of 5300 have been used 86628 times Top ranking: Term ‚atmospheric emissions‘ has been used 14443 times Sources Datasets total: 42 Controlled terms in use: 65 of 5300 have been used 256 times EUROPEAN TOPIC CENTREON CATALOGUEOF DATA SOURCES (ETC/CDS) EUROPEAN ENVIRONMENT AGENCY

  12. ROD questions: term ranking

  13. ROD questions: term ranking

  14. ROD questions: term ranking

  15. ROD sources: term ranking between 10 and 33

  16. ROD sources: term ranking

  17. GEMET - usage to browse and retrieve metadata • GEMET is used to browse or retrieve metadata within 5 applications: • The ThesShow GEMET browser • The WebCDS accessing the DIR via HTML • The WebCDS accessing the DIR via JAVA applets • The multilingual search service (MSS) • The Reporting Obligation Database (ROD) EUROPEAN TOPIC CENTREON CATALOGUEOF DATA SOURCES (ETC/CDS) EUROPEAN ENVIRONMENT AGENCY

  18. JAVA based thesaurus browser for WebCDS EUROPEAN TOPIC CENTREON CATALOGUEOF DATA SOURCES (ETC/CDS) EUROPEAN ENVIRONMENT AGENCY

  19. The Multilingual Search Service (MSS) • Motivation • distributed multilingual document collections of European institutions (European Environment Agency, EEA) • support query formulation in user‘s native language • search and retrieve documents in all understandable languages • Approach of EEA‘s Multilingual Search Service (MSS) • thesaurus support for query formulation(domain specific thesaurus required, e.g. GEMET) • translation by making use of multilinguality of theseaurus(GEMET is available in 12 languages) • use translations as input for off-the-shelf Web search engine (e.g., Netscape Compass Server) EUROPEAN TOPIC CENTREON CATALOGUEOF DATA SOURCES (ETC/CDS) EUROPEAN ENVIRONMENT AGENCY

  20. Using a term for searching metadata EUROPEAN TOPIC CENTREON CATALOGUEOF DATA SOURCES (ETC/CDS) EUROPEAN ENVIRONMENT AGENCY

  21. Search results from websites within the EERC EUROPEAN TOPIC CENTREON CATALOGUEOF DATA SOURCES (ETC/CDS) EUROPEAN ENVIRONMENT AGENCY

  22. Questionnaire about GEMET usage Goals: - learn more about current users - get guidance from usage for future development Process: - the current ~200 GEMET users from all over the world have been addressed by e-mail - the 2 page (+annexes) questionnaire was made available digitally and as a form on internet - Survey was performed in November and December 1999 EUROPEAN TOPIC CENTREON CATALOGUEOF DATA SOURCES (ETC/CDS) EUROPEAN ENVIRONMENT AGENCY

  23. Areas and frequency of GEMET usage 100% translation 56% translation22% indexing/retrieval 42% translation 33% indexing 22% retrieval EUROPEAN TOPIC CENTREON CATALOGUEOF DATA SOURCES (ETC/CDS) EUROPEAN ENVIRONMENT AGENCY

  24. Current usage of languages in GEMET %

  25. Evaluation of the thesaurus content %

  26. Usage of the GEMET browser ThesShow %

  27. GEMET - conclusions from the own indexing experience and the questionnaire General guidelines: - The GEMET content should remain stable, minor improvements are justified - There is a need to add new functionalities to the tools to allow the user to customise an own “thesaurus system” EUROPEAN TOPIC CENTREON CATALOGUEOF DATA SOURCES (ETC/CDS) EUROPEAN ENVIRONMENT AGENCY

  28. Contact information URL: http://www.mu.niedersachsen.de or http://etc-cds.eionet.eu.int eMail: etc/cds@mu.niedersachsen.de EUROPEAN TOPIC CENTREON CATALOGUEOF DATA SOURCES (ETC/CDS) EUROPEAN ENVIRONMENT AGENCY

More Related