60 likes | 172 Views
Virtual Language Observatory Facetted Browsing. Claus Zinn Max Planck Institute for Psycholinguistics Nijmegen, The Netherlands Claus.Zinn@mpi.nl. Clarin Information Day, Nijmegen, July 1st 2009. Facetted Navigation.
E N D
Virtual Language ObservatoryFacetted Browsing Claus Zinn Max Planck Institute for Psycholinguistics Nijmegen, The Netherlands Claus.Zinn@mpi.nl Clarin Information Day, Nijmegen, July 1st 2009
Facetted Navigation • Help users browse/find resources based on more than one dimension, or facet • clearly defined, mutually exclusive, collectively exhaustive • E.g, book collection classified using an author facet, a subject facet, a date facet etc. • Selecting a facet refines the result set • Can see breakdown and projections of the items along the dimensions (given prior facets selection) • Helps gathering insights about the data they are exploring • Used by many commercial sites • Computationally intensive: #contexts in browsing space grows exponentially with #items, #facets, #values
Virtual Language Observatory(s) • CLARIN LRT Resources • 42 attributes • 9 facets, with up to 196 values (organisations) • 828 records • DEMO • CLARIN LRT Tools, DEMO • DFKI NLP Software Registry, DEMO • IMDI Metadata • 32 attributes • 13 facets, with up to 365 values (language) • Ca. 190.000 IMDI records, all corpora • DEMO
Challenges • Merge various DBs • Get on board the various existing content providers • Unify/map between metadata schemas • Use facets as focus points • Indicate origin of information provider • Scalability issues • #items, #facets, #values • Usability issues • Which facets, when? • Curation • Local/central & synch. • e.g., organisation names, language names • Integration with other access methods • IMDI Browser, Geographical Browser, Lexical Space (LEXUS), Conceptual Space (ViCoS)