160 likes | 287 Views
The Role of Automated Categorization in E-Government Information Retrieval. Tanja Svarre & Marianne Lykke, Aalborg University, DK ISKO conference , 8th of July , 2013. Agenda. Background of the study Theoretical framework Research methods Results Summary and closing remarks.
E N D
The Role of Automated Categorization in E-Government Information Retrieval Tanja Svarre & Marianne Lykke, Aalborg University, DK ISKO conference, 8th of July, 2013.
Agenda • Background of the study • Theoreticalframework • Research methods • Results • Summary and closing remarks
Background to the search test • Initiated and partiallycofinanced by the Danish National IT and Telecom Agency • Purpose: To investigatehowautomaticassignment of metadata cancontribute to the intention of increasedefficiency and effectivenessin (Danish) e-government
Building on indexing/categorization: Early Cranfield tests Categorization is helpful: when the query is vague, broad, general, or ambiguous whenresultrakingsaredeficient (Käki, 2005) in supportingexploratorysearches in understanding large search sets (Kules & Shneiderman, 2004; 2005)
Research methods • Case studyin the Danish TaxAuthorities • Search test: • Controlled lab test • Comparison test • Professional users • Domain specificsearchtasks • Pre test questionnaire • Log data • Post search interview
Data: Search test • System characteristics: • Prototype of the corporateintranet • www.skat.dkcontentand internal information • 2 search systems: • Freetextindexing (SYSTEM A) • Categorization (SYSTEM B) • 32 test persons • 3 controlled and 1 naturalsearchtask per session, 2 tasks per system
Success at tasklevel • At tasklevel the success of the two systems differs
System B (cat.) omissions • Highly relevant documentsarediscoveredbefore a category has beenselected • Relevant documentsarelocatedwhile waiting for B (cat.) to categorizesearchresults • Categorizationis not relevant whenfewdocumentsareretrieved
Summary • Categorization is useful: • Whenemployees do not posessextensiveknowledgeabout the task at hand • In offering new perspectives on the composition of a qury • In understanding facets of queries • Whentaskknowledge is present, categorization is used to support the assumptions of a correctsearch
Summary • Categorization is omittedwhen: • Search resultsarelimited • When relevant documentsareranked at the top of the results
National IT & Telecom Agency: Findings • The participants start out with freetextindexing and supplement with the otherwhennecessary • The indexingmethodscomparedarecomplementary • To meet the variety of information needsseveralindexingme-thodsshouldberepresentedsimultaneously