240 likes | 419 Views
Beyond Basic Faceted Search Ben-Yitzhak, et al. Fahimeh Fakour CS 572 Summer 2010. Introduction. Importance and significance Background Information Objective Related work Approach and Solutions Enhancements Contributions Pros & Cons. 1. Importance and Significance. Too much info
E N D
Beyond Basic Faceted SearchBen-Yitzhak, et al. Fahimeh Fakour CS 572 Summer 2010
Introduction • Importance and significance • Background Information • Objective • Related work • Approach and Solutions • Enhancements • Contributions • Pros & Cons Beyond Basic Faceted Search
1. Importance and Significance • Too much info • Transactions Beyond Basic Faceted Search
1. Importance and Significance (cont) • Categories, lists, and the human mind Beyond Basic Faceted Search
2. Background Information • Research done in IBM & Yahoo Research labs • Facets, buckets, and categories • Navigate multiple paths for different ordering • Free text queries • List of matching documents with count Beyond Basic Faceted Search
3. Objective • Extend traditional facet • Beyond numbers Numbers Words • Search & Index correlated documents • Similarity to OLAP: multi-dimensional data Beyond Basic Faceted Search
4. Related Work • Multifaceted search • Lexical subsumption • Synsets and hypernym • RawSugar social tagging • Online Analytical Processing (OLAP) • Multi-dimensional data • Aggregation of data: Cube • N-dimensional “group by” Exciting new technique Beyond Basic Faceted Search
5. Approach & Solutions 5.1 Technologies: Lucene & Solr 5.2 Data Model 5.3 Facet hierarchy: Forest 5.4 Creating the facet paths 5.5 Running the facet query 5.6 Example Beyond Basic Faceted Search
5.1. Technologies: Lucene & Solr • Posting element: docID, offset, payload • Matching document processing byte array of additional info (runtime accessible) + “hit collector” to organize into facets Beyond Basic Faceted Search
5.2. Data Model • Taxonomy: hierarchical relationships among facets • Predefined taxonomy • Acquired/Learned through documents • Facet-path forest • Tree: top-level facet Beyond Basic Faceted Search
5.3. Facet hierarchy: Forest Beyond Basic Faceted Search
5.4. Creating the facet paths • Posting element for document for each prefix of Pi • Add path to taxonomy index • Encode all k paths related to this document Beyond Basic Faceted Search
5.5. Running the facet query • Terms: • Faceted query string + taxonomy subtrees • Faceted result set ranked list of documents matching query + counters • Lucene: use the Taxonomy Index function to determine ordinal number of paths Beyond Basic Faceted Search
5.6. Example Facet$clothing: doc1,doc2 Facet$clothing$children’s: doc1 Beyond Basic Faceted Search
6. Enhancements Beyond Basic Faceted Search
6.1. Business Intelligence • Qualitative rather than quantitative • Best sellers rather than number of books published by author Beyond Basic Faceted Search
6.2. Dynamic Facets: Welcome to the real world • Not always independent data • Example: • Running shorts • Different sizes per color • Location & price Beyond Basic Faceted Search
6.2. Dynamic Facets: Solution • Use tree over the data Beyond Basic Faceted Search
6.2. Dynamic Facets: Solution (cont) Beyond Basic Faceted Search
7. Contributions • “rich” aggregation : qualitative • Engineering details • Correlation in facet values Independent Facets Beyond Basic Faceted Search
8.1. Pros • Detailed description of engineering aspects & design decisions • Use of implemented technologies • Clearly defines the scope of the paper • Give foundation/background information • Compatible with real life data Beyond Basic Faceted Search
8.2. Cons • Experiments and testing: No qualitative measurement • effectiveness of “qualitative” facets • Not explain relevance of some of the previous work • Criteria for display/grouping? • Key use cases & known user access patterns not explained • Build taxonomy: depth/breadth? Beyond Basic Faceted Search
Thank You Beyond Basic Faceted Search
References Ben-Yitzhak, et al. “Beyond Basic Faceted Search”. Proceedings of the international conference on Web search and web data mining. Pp.33-44, 2008. <http://nadav.harel.org.il/papers/p33-ben-yitzhak.pdf> “Faceted Search with Solr” Lucid Imagination. July 1, 2010. <http://www.lucidimagination.com/Community/Hear-from-the-Experts/Articles/Faceted-Search-Solr > “Faceted classification” Wikipedia. July 7, 2010 <http://en.wikipedia.org/wiki/Faceted_classification > Lemieux, Earley, and Associates. “Designing for Faceted Search” User Interface Engineering. July 6, 2010 <http://www.uie.com/articles/faceted_search/> (Originally in KM World, March 2009) Mattman, Chris. “Query Models” (presentation slides for class) Beyond Basic Faceted Search 24