210 likes | 233 Views
This project aims to integrate the General Environmental Multilingual Thesaurus (GEMET) with the INSPIRE Spatial Data Themes. It provides a mixed approach for metadata creation, using the GEMET structure and metadata statistics, and offers three approaches for discovery, including using metadata statistics on keyword occurrence and co-occurrence, as well as the GEMET structure.
E N D
Overview • 1) Overview • 2) Background • 3) INSPIRE-ing GEMET • 4) Scenarios for metadata creation • 5) Scenarios for discovery
2) Background • Motivation For spatial datasets and spatial dataset series the INSPIRE Implementing rule for Metadata requires that “at least one keyword shall be provided from the General Environmental Multi-lingual Thesaurus (GEMET) describing the relevant spatial data theme as defined in Annex I, II or III to Directive 2007/2/EC."
2) Background • GEMET • GEneral Multilingual Environmental Thesaurus • Developed for European Topic Centre on Catalogue of Data Sources (ETC/CDS) and European Environment Agency (EEA) • Published and managed by EIONET • More than 5000 concepts
2) Background Hydrosphere Hydrologic Cycle Lake Sea Ocean Circulation Hydrologic Balance Open Sea Deep Sea Ocean Ocean Current Runoff Sediment Transport Ocean Temperature
2) Background • INSPIRE Spatial Data Themes • INSPIRE spatial data themes and descriptions are defined in Annex I, II and III of the Directive 2007/2/EC* • Available in all 23 official EU languages *http://eur-lex.europa.eu/JOHtml.do? uri=OJ:L:2007:108:SOM:EN:HTML)
3) INSPIRE-ing GEMET – step 1 Integrating INSPIRE Spatial Data Themes into GEMET INSPIRE Spatial Data Themes GEMET languages* * There is a mismatch between the languages available in the current version of GEMET and the 23 official EU languages.
But …. What is the relation between GEMET concepts and INSPIRE Spatial Data Themes? INSPIRE-ing GEMET step 2
3) INSPIRE-ing GEMET – step 2 • Linking GEMET concepts and INSPIRE Themes • Link was established manually: List of concepts that are linked to an INSPIRE Spatial Data Theme
4) Metadata Creation • Metadata creation • Choice of additional keywords based on a given INSPIRE theme • Using GEMET structure • Using metadata statistics • Mixed approach
4) Metadata Creation • Using GEMET structure Metadata creator sees tree representation of GEMET taxonomy of concepts linked to the INSPIRE theme. Allows search among all sub-concepts to chose additional keywords • considers GEMET taxonomy Tree representation using GEMET structure
4) Metadata Creation • Using metadata statistics Metadata creator sees cloud representation of GEMET concepts linked to INSPIRE themes. Size of each keyword depends on frequency of use • independent of the GEMET classification • will support a natural selection of terms and result in a set of frequently used keywords cloud representation using metadata statistics
4) Metadata Creation • Mixed approach Metadata creator sees tree (or graph) representation, size of each keyword depends on frequency of use. • Combines the benefits of both approaches • Might be more difficult to apply by inexperienced users Mixed representation using GEMET structure and metadata statistics
5) Discovery • Discovery • help requestors in selecting search term(s): 3 apporaches • Using metadata statistics on keyword occurrence • Using metadata statistics on keyword co-occurrence • Using Gemet structure
5) Discovery • Using metadata statistics: occurrence of keywords The user sees GEMET concepts that have so far been used to annotate data (cloud representation). All other concepts are omitted • Each presented keyword has at least one match. • The user gets an impression of the amount of results before he performs the search
5) Discovery • Using metadata statistics: co-occurrence of keywords Can be used to suggest additional search terms to narrow downhis search. After selecting one keyword, the users sees those keywords that are frequently used in combination. Using metadata statistics (co-occurrence of keywords)
5) Discovery • Using the GEMET structure • Find additional matches by expanding the query with similar terms: • list of GEMET terms in a tree structure • automatic query expansion to include similar terms. Similarity measures … ?
Thank you very much for your attention • {nicole.ostlaender|michael.lutz}@jrc.it