1 / 54

Efficient Interactive Fuzzy Keyword Search

Efficient Interactive Fuzzy Keyword Search. Shengyue Ji , Guoliang Li, Jianhua Feng , Chen Li University of California, Irvine WWW 2009 1 Dec 2011 Presentation @ IDB Lab. Seminar Presented by Jee -bum Park. Outline . Introduction Indexing Methods Single Keyword Multiple Keywords

nhu
Download Presentation

Efficient Interactive Fuzzy Keyword Search

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Efficient Interactive Fuzzy Keyword Search ShengyueJi, Guoliang Li, JianhuaFeng , Chen Li University of California, Irvine WWW 2009 1 Dec 2011 Presentation @ IDB Lab. Seminar Presented by Jee-bum Park

  2. Outline • Introduction • Indexing Methods • Single Keyword • Multiple Keywords • Experiments • Conclusions

  3. Introduction • http://searchenginewatch.com/article/2128218/Google-Searchers-Use-Autocomplete-Most-Ignore-Google-Instant-Eye-Tracking-Study

  4. Introduction • http://searchenginewatch.com/article/2128218/Google-Searchers-Use-Autocomplete-Most-Ignore-Google-Instant-Eye-Tracking-Study

  5. Introduction • http://searchenginewatch.com/article/2128218/Google-Searchers-Use-Autocomplete-Most-Ignore-Google-Instant-Eye-Tracking-Study

  6. Introduction • http://searchenginewatch.com/article/2128218/Google-Searchers-Use-Autocomplete-Most-Ignore-Google-Instant-Eye-Tracking-Study

  7. Introduction • http://searchenginewatch.com/article/2128218/Google-Searchers-Use-Autocomplete-Most-Ignore-Google-Instant-Eye-Tracking-Study

  8. Introduction • http://searchenginewatch.com/article/2128218/Google-Searchers-Use-Autocomplete-Most-Ignore-Google-Instant-Eye-Tracking-Study

  9. Introduction • http://searchenginewatch.com/article/2128218/Google-Searchers-Use-Autocomplete-Most-Ignore-Google-Instant-Eye-Tracking-Study

  10. Introduction • http://searchenginewatch.com/article/2128218/Google-Searchers-Use-Autocomplete-Most-Ignore-Google-Instant-Eye-Tracking-Study

  11. Introduction • http://searchenginewatch.com/article/2128218/Google-Searchers-Use-Autocomplete-Most-Ignore-Google-Instant-Eye-Tracking-Study

  12. Introduction • A typical directory-search form

  13. Introduction • Interactive fuzzy search

  14. Introduction • “interactive, fuzzy search” • Interactive • The system searches for the best answers on the flyas the user types in a keyword query • Fuzzy • The system tries to find relevant records that include words similar to the keywords in the query, even if they do not match exactly

  15. Outline • Introduction • Indexing Methods • Single Keyword • Multiple Keywords • Experiments • Conclusions

  16. Indexing Methods • List

  17. Indexing Methods • List • Typed “li”

  18. Indexing Methods • List • Typed “lu”

  19. Indexing Methods • Trie 0: \0 10: l 1 4 11: i 14: u 12: n 13: u 15: i 16: s 3, 4 5 7

  20. Indexing Methods • Trie • Typed “li” 0: \0 10: l 1 4 11: i 14: u 12: n 13: u 15: i 16: s 3, 4 5 7

  21. Indexing Methods • Trie • Typed “li” 0: \0 10: l 1 4 11: i 14: u 12: n 13: u 15: i 16: s 3, 4 5 7

  22. Indexing Methods • Trie • Typed “li” 0: \0 10: l 1 4 11: i 14: u 12: n 13: u 15: i 16: s 3, 4 5 7

  23. Indexing Methods • Trie • Typed “li” 0: \0 10: l 1 4 11: i 14: u 12: n 13: u 15: i 16: s 3, 4 5 7

  24. Indexing Methods • Trie • Typed “li” 0: \0 10: l 1 4 11: i 14: u 12: n 13: u 15: i 16: s 3, 4 5 7

  25. Indexing Methods • Trie • Typed “li” 0: \0 10: l 1 4 11: i 14: u 12: n 13: u 15: i 16: s 3, 4 5 7

  26. Indexing Methods • Trie • Typed “li” 0: \0 10: l 1 4 11: i 14: u 12: n 13: u 15: i 16: s 3, 4 5 7

  27. Outline • Introduction • Indexing Methods • Single Keyword • Multiple Keywords • Experiments • Conclusions

  28. Single Keyword

  29. Edit distance Single Keyword 0 1 2 • Example • Query = “nlis”, edit distance threshold = 2 0: \0 10: l 1 4 11: i 14: u 12: n 13: u 15: i 16: s 3, 4 5 7

  30. Edit distance Single Keyword 0 1 2 • Initial state: “” • Query = “nlis”, edit distance threshold = 2 0: \0 10: l 1 4 11: i 14: u 12: n 13: u 15: i 16: s 3, 4 5 7

  31. Edit distance Single Keyword 0 1 2 • Typed: “n” • Query = “nlis”, edit distance threshold = 2 0: \0 10: l 1 4 11: i 14: u 12: n 13: u 15: i 16: s 3, 4 5 7

  32. Edit distance Single Keyword 0 1 2 • Typed: “n” • Query = “nlis”, edit distance threshold = 2 0: \0 10: l 1 4 11: i 14: u 12: n 13: u 15: i 16: s 3, 4 5 7

  33. Edit distance Single Keyword 0 1 2 • Typed: “n” • Query = “nlis”, edit distance threshold = 2 0: \0 10: l 1 4 11: i 14: u 12: n 13: u 15: i 16: s 3, 4 5 7

  34. Edit distance Single Keyword 0 1 2 • Typed: “nl” • Query = “nlis”, edit distance threshold = 2 0: \0 10: l 1 4 11: i 14: u 12: n 13: u 15: i 16: s 3, 4 5 7

  35. Edit distance Single Keyword 0 1 2 • Typed: “nl” • Query = “nlis”, edit distance threshold = 2 0: \0 10: l 1 4 11: i 14: u 12: n 13: u 15: i 16: s 3, 4 5 7

  36. Edit distance Single Keyword 0 1 2 • Typed: “nl” • Query = “nlis”, edit distance threshold = 2 0: \0 10: l 1 4 11: i 14: u 12: n 13: u 15: i 16: s 3, 4 5 7

  37. Edit distance Single Keyword 0 1 2 • Typed: “nli” • Query = “nlis”, edit distance threshold = 2 0: \0 10: l 1 4 11: i 14: u 12: n 13: u 15: i 16: s 3, 4 5 7

  38. Edit distance Single Keyword 0 1 2 • Typed: “nli” • Query = “nlis”, edit distance threshold = 2 0: \0 10: l 1 4 11: i 14: u 12: n 13: u 15: i 16: s 3, 4 5 7

  39. Edit distance Single Keyword 0 1 2 • Typed: “nli” • Query = “nlis”, edit distance threshold = 2 0: \0 10: l 1 4 11: i 14: u 12: n 13: u 15: i 16: s 3, 4 5 7

  40. Edit distance Single Keyword 0 1 2 • Typed: “nlis” • Query = “nlis”, edit distance threshold = 2 0: \0 10: l 1 4 11: i 14: u 12: n 13: u 15: i 16: s 3, 4 5 7

  41. Edit distance Single Keyword 0 1 2 • Typed: “nlis” • Query = “nlis”, edit distance threshold = 2 0: \0 10: l 1 4 11: i 14: u 12: n 13: u 15: i 16: s 3, 4 5 7

  42. Edit distance Single Keyword 0 1 2 • Typed: “nlis” • Query = “nlis”, edit distance threshold = 2 0: \0 10: l 1 4 11: i 14: u 12: n 13: u 15: i 16: s 3, 4 5 7

  43. Outline • Introduction • Indexing Methods • Single Keyword • Multiple Keywords • Experiments • Conclusions

  44. Multiple Keywords • Challenges in multiple keywords • Intersection of multiple lists of keywords • Each prefix query keyword has • Multiple predicted complete keywords • The union of the lists of predicted keywords includes potential answers • The union lists of multiple query keywords need to be intersected in order to compute the answers to the query • Cache-based incremental intersection

  45. Multiple Keywords • HYB (H. Bast, I. Weber. Type Less, Find More: Fast Autocompletion Search with a Succinct Index. In SIGIR 2006) • The intersections can be computed in • The union can be computed in • Total time complexity W’ = { iphone, ipv4, ipv6 } D ∩ Dw = D’ = { 21, 172, 308, 759 }

  46. Multiple Keywords • Forward lists

  47. Outline • Introduction • Indexing Methods • Single Keyword • Multiple Keywords • Experiments • Conclusions

  48. Experiments • DBLP • It included about one million computer science publication records • Authors, title, conference or journal name, year, page numbers, URL • MEDLINE • It had about 4 million latest publication records related to life sciences and biomedical information • Authors, their affiliations, article title, journal name, journal issue

  49. Experiments • Computing prefixes similar to a keyword

  50. Experiments • List intersection of multiple keywords

More Related