250 likes | 442 Views
Citation Recommendation. By: Fattane Zarrinkalam Supervisor: Dr. Mohsen Kahani. Web Technology Laboratory Ferdowsi University of Mashhad. Outline. Introduction Current Approaches Evaluation Methods References. Introduction.
E N D
Citation Recommendation By: FattaneZarrinkalamSupervisor: Dr. MohsenKahani Web Technology Laboratory Ferdowsi University of Mashhad
Outline • Introduction • Current Approaches • Evaluation Methods • References
Introduction • When starting a work in a new research topic or brainstorming for novel ideas, a researcher have to be well aware of most recent improvement in the topic. • Search for related work is an important part of writing papers • Substantial effort is wasted in rediscover ideas
Introduction (cont.) • When papers are written, many times the author wants to make some citations at a place but he is not sure which papers to cite. • the number of research paper published is exponentially growing. • This filtering process is generally tedious and time consuming.
Introduction (cont.) • Two common ways to find reference papers are: • search documents on search engines such as Google. • trace the cited references by starting with a small number of initial papers (seed-papers).
Introduction (cont.) • We wish to have a recommendation system which can recommend Citations for papers. • the user has already written a few pages about the topic, and is able to submit this document to the search system as the query. • the user wants documents that the query document might cite.
recommender systems • recommender systems emerged as an independent research area in the mid-1990s • Examples of such applications include recommending books, CDs, and other products at Amazon.com, movies by MovieLens and so on
recommendation Techniques • The Collaborative Filtering Approach (CF) • Content-based Recommendation • Hybrid Approach
Different kinds of works in citation recommendation: • Works that can only recommend papers • Works that can recommend papers for a specific position
recommender system (1) • map the citation graph onto a collaborative filtering ratings matrix. • Co-Citation Matching
Content-based Recommendation • recommend items based on the contents of the items a user has experienced before. • Text-based Analysis • These approaches use NLP and text mining methods to find papers that are semantically similar to the input paper
recommender system (2) • candidate set : • the system retrieves the top 100 most similar papers to the query document and adds them to R (base set). • all papers cited by any paper in R are added to R. • Rank the candidate set
recommender system (3) • Using a weighted sum of feature scores: • Features: • Similar terms (Tf-Idf) • Citation-count • Author-h-index • Venue-citation-count • Cited using similar terms • Similar topics • Learn the feature weights
recommender systems (5) • Candidate set • D= document corpus • {D Э d2 | d2= global context + a set of in-link context} • LC100{outlink context to c*}+G1000{abstract +title to d1} • ranking
recommender systems (6) • Input: • a query manuscript without citation placeholders • Output: • where citation are needed • a list of candidate article to be cited • Finding citation context: • Divide the query manuscript into sentences- overlapping window of 100 word • Extract citation context of corpus • Language model • n-gram • Contextual similarity • Topical relevance
recommender systems (7) • Multi-class SVM classifier • Training and test data • Training: • Feature set: local context, global context, similarity features • Input: citing paper • Output: label of cited paper
Hybrid Approach • Composed of two independent module: • Content-base filtering • Collaborative filtering
recommender system (8) • The CBF module uses the text of the active paper as input and • the CF module uses the citations from the active paper as input.
Evaluation • Automatic • a particular paper from the collection as a query and its citations as the relevant documents. • Metrics: recall, precision, rank , coverage , co-cited probability, • it is circular; system is attempting to improve the citing ability of authors, but evaluate with the papers that authors actually cite. • System Might discover citations that are more relevant than the one held out. • Such citations may have not been included in the paper’s references list because of limits on space or because they overlapped with other references, possibly the one left out.
Evaluation (cont.) • Manual • authors of papers rate the relevance of citations recommended for a paper they had written. • A full manual evaluation of retrieval accuracy was not possible
References • He, Q., Pei, J., Kifer, D., Mitra, P., Giles, C.L., 2010, Context-aware Citation Recommendation, in Proceedings of the 19th International World Wide Web Conference (WWW), pp. 421–430. • Tang, J., Zhang, J., 2009, A Discriminative Approach to Topic-Based Citation Recommendations PAKDD'09. • Gipp, B., Beel, J., Hentschel, C., 2009, Scienstein: A Research Paper Recommender System, in Proceedings of the International Conference on Emerging Trends in Computing (ICETiC’09), pp. 309-315, January 2009. • Ritchie, A., 2008, Citation context analysis for information retrieval, PhD thesis, University of Cambridge • Strohman, T., Croft, W. B., Jensen, D., 2007, Recommending citations for academic papers, in Proceedings of the 30th Annual ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR)’, ACM Press, pp. 705–706. • McNee, S., Albert, I., Cosley, D., Gopalkrishnan, P., Lam, S., Rashid, A., Konstan, J., Ried, J., 2002, On the Recommending of Citations for Research Papers. CSCW'02.
References • Schafer, B., Frankowski, D., Herlocker, J., Sen, S., 2007, Collaborative filtering recommender systems, In Brusilovsky, P., Kobsa, A., Nejdl, W., eds., The Adaptive Web: Methods and Strategies of Web Personalization. Lecture Notes in Computer Science, Vol. 4321, Berlin Heidelberg New York, Springer-Verlag. • Gori, M., Pucci, A., 2006, Research Paper Recommender Systems: A Random-Walk Based Approach, in Proceedings of the 2006 International Conference on Web Intelligence, pp. 778-781. • Kessler, M. M., 1963, Bibliographic coupling between scientific papers, American Documentation 14(1), 10–25. • Small, H., 1973, Co-citation in the scientific literature: A new measurement of the relationship between two documents, Journal of the American Society of Information Science 24(4), 265–269.