Improving Web Search Results Using Affinity Graph

Improving Web Search Results Using Affinity Graph Advisor : Dr. Hsu Presenter : Jia-Hao Yang Author :Benyu Zhang , Hua Li , Yi Liu , Wensi Xi , Weiguo Fan SIGIR

Outline • Motivation • Objective • Definition • Methods (Affinity Ranking) • Experiments • Conclusion • Opinion SIGIR

Motivation • situation • Many of the queries are ambiguous. • the user’s information needs are unknown. • Ex : “足球” , 是只想要足球還是要找足球賽 • In traditional, precision and recall are two metrics, but these didn’t consider the content of documents. • Hyperlink SIGIR

Objective • Two metrics, diversity and information richness, have been proposed to improve this problem. • Re-ranking the top search results to satisfy the user’s information needs. SIGIR

Definition • Diversity measures the variety of topics in a group of documents. • Information richness measures how many different topics a single document contains. SIGIR

Methods • AG : According to vector space model, each document can be represented , • If we consider documents as nodes, the document collection can be modeled as a graph by generating the link between documents. d2 d3 d1 d4 SIGIR d5 d6

Methods(cont.) • Information richness : • 1st • 2nd SIGIR

Methods(cont.) • Diversity penalty : • 1st : • 2nd • 3rd , • 4th • 5th 2nd • Re-ranking : • The score-combination scheme uses a linear combination of two parts: • The rank-combination scheme of re-ranking uses a linear combination of the ranks based on full-text search and Affinity Ranking : SIGIR

Experiments (In Yahoo & ODP) • Affinity Ranking vs. K-Means Clustering SIGIR

Experiments (cont.) SIGIR

Experiments (In Newsgroup) • Improve in Top 10 Search Results : • As the top 10 search results always receive the most attention of end-users, we show how Affinity Ranking affects the top 10 search results from the newsgroup data set. SIGIR

Experiments (cont.) • Improve within Top 50 Search Results SIGIR

Experiments (cont.) SIGIR

Experiments (α & β) SIGIR

A Case Study • Outlook print error : SIGIR

Conclusion • This paper proposed two new metrics, diversity and information richness, and a novel ranking scheme, Affinity Ranking, to measure the search performance. • By presenting wider topic coverage and more highly informative results in each topic in the top results, this method can effectively improve the search performance. SIGIR

Opinion • Future work : scaling the AR computation, to the Web scale. SIGIR

Improving Web Search Results Using Affinity Graph

Improving Web Search Results Using Affinity Graph

Presentation Transcript

Clustering Web Search Results

Graph Algorithms Using Depth First Search

Clustering Web Search Results

Graph Substructure Search

Improving Search

Improving Protein-Ligand Binding Affinity Prediction using Random Forest

Improving the Quality of Visual Web Browsing by Using Weighted Graph Drawing

Graph Search Methods

Improving web image search results using query-relative classifiers

Web Page Clustering using Heuristic Search in the Web Graph

Improving Query Results using Answer Corroboration

Improving Web Search Results Using Affinity Graph

Clustering Search Results Using PLSA

Improving Error Discovery using Guided Search

Clustering Personalized Web Search Results

Structural Web Search Using a Graph-Based Discovery System

Optimized Graph Search Using Multi-Level Graph Clustering

Improving full-text search results on dúchas.ie using language technology

Using Web Search Methods Refining Results

Graph Algorithms Using Depth First Search

Clustering Search Results Using PLSA

Graph Algorithms Using Depth First Search