180 likes | 295 Views
Improving Web Search Results Using Affinity Graph. Advisor : Dr. Hsu Presenter : Jia-Hao Yang Author :Benyu Zhang , Hua Li , Yi Liu , Wensi Xi , Weiguo Fan. Outline. Motivation Objective Definition Methods (Affinity Ranking) Experiments Conclusion Opinion. Motivation.
E N D
Improving Web Search Results Using Affinity Graph Advisor : Dr. Hsu Presenter : Jia-Hao Yang Author :Benyu Zhang , Hua Li , Yi Liu , Wensi Xi , Weiguo Fan SIGIR
Outline • Motivation • Objective • Definition • Methods (Affinity Ranking) • Experiments • Conclusion • Opinion SIGIR
Motivation • situation • Many of the queries are ambiguous. • the user’s information needs are unknown. • Ex : “足球” , 是只想要足球還是要找足球賽 • In traditional, precision and recall are two metrics, but these didn’t consider the content of documents. • Hyperlink SIGIR
Objective • Two metrics, diversity and information richness, have been proposed to improve this problem. • Re-ranking the top search results to satisfy the user’s information needs. SIGIR
Definition • Diversity measures the variety of topics in a group of documents. • Information richness measures how many different topics a single document contains. SIGIR
Methods • AG : According to vector space model, each document can be represented , • If we consider documents as nodes, the document collection can be modeled as a graph by generating the link between documents. d2 d3 d1 d4 SIGIR d5 d6
Methods(cont.) • Information richness : • 1st • 2nd SIGIR
Methods(cont.) • Diversity penalty : • 1st : • 2nd • 3rd , • 4th • 5th 2nd • Re-ranking : • The score-combination scheme uses a linear combination of two parts: • The rank-combination scheme of re-ranking uses a linear combination of the ranks based on full-text search and Affinity Ranking : SIGIR
Experiments (In Yahoo & ODP) • Affinity Ranking vs. K-Means Clustering SIGIR
Experiments (cont.) SIGIR
Experiments (cont.) SIGIR
Experiments (In Newsgroup) • Improve in Top 10 Search Results : • As the top 10 search results always receive the most attention of end-users, we show how Affinity Ranking affects the top 10 search results from the newsgroup data set. SIGIR
Experiments (cont.) • Improve within Top 50 Search Results SIGIR
Experiments (cont.) SIGIR
Experiments (α & β) SIGIR
A Case Study • Outlook print error : SIGIR
Conclusion • This paper proposed two new metrics, diversity and information richness, and a novel ranking scheme, Affinity Ranking, to measure the search performance. • By presenting wider topic coverage and more highly informative results in each topic in the top results, this method can effectively improve the search performance. SIGIR
Opinion • Future work : scaling the AR computation, to the Web scale. SIGIR