1 / 21

INSPIRE-ing GEMET: Enhancing Metadata Creation and Discovery for Spatial Datasets

This project aims to integrate the General Environmental Multilingual Thesaurus (GEMET) with the INSPIRE Spatial Data Themes. It provides a mixed approach for metadata creation, using the GEMET structure and metadata statistics, and offers three approaches for discovery, including using metadata statistics on keyword occurrence and co-occurrence, as well as the GEMET structure.

cholbrook
Download Presentation

INSPIRE-ing GEMET: Enhancing Metadata Creation and Discovery for Spatial Datasets

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Overview • 1) Overview • 2) Background • 3) INSPIRE-ing GEMET • 4) Scenarios for metadata creation • 5) Scenarios for discovery

  2. 2) Background • Motivation For spatial datasets and spatial dataset series the INSPIRE Implementing rule for Metadata requires that “at least one keyword shall be provided from the General Environmental Multi-lingual Thesaurus (GEMET) describing the relevant spatial data theme as defined in Annex I, II or III to Directive 2007/2/EC."

  3. 2) Background • GEMET • GEneral Multilingual Environmental Thesaurus • Developed for European Topic Centre on Catalogue of Data Sources (ETC/CDS) and European Environment Agency (EEA) • Published and managed by EIONET • More than 5000 concepts

  4. 2) Background Hydrosphere Hydrologic Cycle Lake Sea Ocean Circulation Hydrologic Balance Open Sea Deep Sea Ocean Ocean Current Runoff Sediment Transport Ocean Temperature

  5. 2) Background • INSPIRE Spatial Data Themes • INSPIRE spatial data themes and descriptions are defined in Annex I, II and III of the Directive 2007/2/EC* • Available in all 23 official EU languages *http://eur-lex.europa.eu/JOHtml.do? uri=OJ:L:2007:108:SOM:EN:HTML)

  6. INSPIRE-ing GEMET step 1

  7. 3) INSPIRE-ing GEMET – step 1 Integrating INSPIRE Spatial Data Themes into GEMET INSPIRE Spatial Data Themes GEMET languages* * There is a mismatch between the languages available in the current version of GEMET and the 23 official EU languages.

  8. 3) INSPIRE-ing GEMET – step 1

  9. 3) INSPIRE-ing GEMET– step 1

  10. But …. What is the relation between GEMET concepts and INSPIRE Spatial Data Themes? INSPIRE-ing GEMET step 2

  11. 3) INSPIRE-ing GEMET – step 2 • Linking GEMET concepts and INSPIRE Themes • Link was established manually: List of concepts that are linked to an INSPIRE Spatial Data Theme

  12. 4) Metadata Creation • Metadata creation • Choice of additional keywords based on a given INSPIRE theme • Using GEMET structure • Using metadata statistics • Mixed approach

  13. 4) Metadata Creation • Using GEMET structure Metadata creator sees tree representation of GEMET taxonomy of concepts linked to the INSPIRE theme. Allows search among all sub-concepts to chose additional keywords • considers GEMET taxonomy Tree representation using GEMET structure

  14. 4) Metadata Creation • Using metadata statistics Metadata creator sees cloud representation of GEMET concepts linked to INSPIRE themes. Size of each keyword depends on frequency of use • independent of the GEMET classification • will support a natural selection of terms and result in a set of frequently used keywords cloud representation using metadata statistics

  15. 4) Metadata Creation • Mixed approach Metadata creator sees tree (or graph) representation, size of each keyword depends on frequency of use. • Combines the benefits of both approaches • Might be more difficult to apply by inexperienced users Mixed representation using GEMET structure and metadata statistics

  16. 5) Discovery • Discovery • help requestors in selecting search term(s): 3 apporaches • Using metadata statistics on keyword occurrence • Using metadata statistics on keyword co-occurrence • Using Gemet structure

  17. 5) Discovery • Using metadata statistics: occurrence of keywords The user sees GEMET concepts that have so far been used to annotate data (cloud representation). All other concepts are omitted • Each presented keyword has at least one match. • The user gets an impression of the amount of results before he performs the search

  18. 5) Discovery • Using metadata statistics: co-occurrence of keywords Can be used to suggest additional search terms to narrow downhis search. After selecting one keyword, the users sees those keywords that are frequently used in combination. Using metadata statistics (co-occurrence of keywords)

  19. 5) Discovery • Using the GEMET structure • Find additional matches by expanding the query with similar terms: • list of GEMET terms in a tree structure • automatic query expansion to include similar terms. Similarity measures … ?

  20. Thank you very much for your attention • {nicole.ostlaender|michael.lutz}@jrc.it

More Related