1 / 16

Keyword Searching and Browsing in Databases using BANKS

Keyword Searching and Browsing in Databases using BANKS. Charuta Nakhe, Arvind Hulgeri, Gaurav Bhalotia, Soumen Chakrabarti, S. Sudarshan Presented by Sushanth Sivaram Vallath. Motivation. Keyword search of documents on the web as been enormously successful

vaughan
Download Presentation

Keyword Searching and Browsing in Databases using BANKS

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Keyword Searching and Browsing in Databases using BANKS Charuta Nakhe, Arvind Hulgeri, Gaurav Bhalotia, Soumen Chakrabarti, S. Sudarshan Presented by Sushanth Sivaram Vallath

  2. Motivation • Keyword search of documents on the web as been enormously successful • Simple and intuitive, no need to learn any query language • Database querying using keywords is desirable • SQL is not appropriate for casual users • Form interfaces cumbersome: • Require separate form for each type of query – confusing for casual users of Web information systems • Not suitable for ad hoc queries

  3. Motivation • Many Web documents are dynamically generated from databases • E.g. Catalog data • Keyword querying of generated Web documents • May miss answers that need to combine information on different pages • Suffers from duplication overheads

  4. Examples of Keyword Queries • Airticket reservation database • “DFW LAX” • University database • Info on courses • Online shopping • Canon Digital Rebel

  5. Differences from IR/Web Search • Related data split across multiple tuples due to normalization • Different keywords may match tuples from different relations

  6. Schema

  7. Basic Model • Database: modeled as a graph • Nodes = tuples • Edges = references between tuples • foreign key • Edges are directed.

  8. The BANKS Answer Model • Query: set of keywords {k1, k2, .., kn} • Each keyword ki matches set of nodes Si • Answer: rooted, directed tree connecting nodes, with one node from each Si • Root node has special significance, may be restricted to some relations • May include intermediate nodes not in any Si and hence a steiner tree. • Multiple answers • Ranking based on proximity + prestige

  9. Edge Directionality • Some popular tuples are connected to many other tuples • E.g. Students -> departments -> university • Popular tuples would create misleading shortcuts from every tuple to every other • E.g. every student would be closely linked with every other student via the department/university • Solution: define different forward and backward edge weights • Forward edges: In the direction of the foreign key reference

  10. Node Weight • Nodes have prestige weights too • nodes with greater prestige tend to have greater indegree

  11. Finding Answer Trees • Backward Expanding Search Algorithm: • Intuition: find vertices from which a forward path exists to at least one node from each Si. • Run concurrent single source shortest path algorithm from each node matching a keyword • Create an iterator for each node matching a keyword • Traverse the graph edges in reverse direction • Output a node whenever it is on the intersection of the sets of nodes reached from each keyword

  12. Finding Answer Trees • Backward Expanding Search • Intuition: travel backwards from keyword nodes till you hit a common node Query: sudarshan roy MultiQuery Optimization paper writes Sudarshan Prasan Roy authors

  13. References • Keyword Searching and Browsing in Databases using BANKS • Keyword Searching and Browsing in Databases using BANKS(PPT)

  14. Thank You

More Related