1 / 1

Searching with Autocompletion cool algorithms for a cool feature Holger Bast bast@mpi-inf.mpg.de

Doc 1. Doc 2. A B. A-D. D. ADA B AA B 1110101101 00110001000001. Doc 3. Doc 4. Doc 5. A B. B C D. -. A-B. C-D. Doc 6. Doc 7. Doc 8. B B B 1010010 111. CDDC 0001111 0110. -. A D. A B D. Doc 9. Doc10. -. B C. C. A. B. D. 000. 000. 0000. 1000.

derex
Download Presentation

Searching with Autocompletion cool algorithms for a cool feature Holger Bast bast@mpi-inf.mpg.de

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Doc 1 Doc 2 A B A-D D ADA B AA B 1110101101 00110001000001 Doc 3 Doc 4 Doc 5 A B BCD - A-B C-D Doc 6 Doc 7 Doc 8 B B B 1010010 111 CDDC 0001111 0110 - A D ABD Doc 9 Doc10 - B C C A B D 000 000 0000 1000 Searching with Autocompletioncool algorithms for a cool featureHolger Bastbast@mpi-inf.mpg.de Please try it yourself on the MPII homepage. Type a?for help before you start searching! For the query adfocs rag all words are displayed that start with rag and that occur in a document that also containsa word starting with adfocs. This list of completions is updated with every new letter you type! The list of top hits is also updated with every new letter you type. It is often surprising how little one has to type to get to what one was looking for! The core algorithmic problem Given a range of words W(all completions of the last word the user has started typing) and a set of documents D (the hits of the preceding part of the query), compute the subset W' ⊆ W of words that occur in at least one document ofD and the subset D' ⊆ D of documents containing a word from W'. typically |W'| << |W| Our new data structure ~ |W'|time per query Ordinary Inverted Index ~ |W|time per query A: 1,3,7,8 B: 1,3,5,8,10 C: 5,10 D: 2,5,7,8

More Related