By: Fattane Zarrinkalam Supervisor: Dr. Mohsen Kahani

Citation Recommendation By: FattaneZarrinkalamSupervisor: Dr. MohsenKahani Web Technology Laboratory Ferdowsi University of Mashhad

Outline • Introduction • Current Approaches • Evaluation Methods • References

Introduction • When starting a work in a new research topic or brainstorming for novel ideas, a researcher have to be well aware of most recent improvement in the topic. • Search for related work is an important part of writing papers • Substantial effort is wasted in rediscover ideas

Introduction (cont.) • When papers are written, many times the author wants to make some citations at a place but he is not sure which papers to cite. • the number of research paper published is exponentially growing. • This filtering process is generally tedious and time consuming.

Introduction (cont.) • Two common ways to find reference papers are: • search documents on search engines such as Google. • trace the cited references by starting with a small number of initial papers (seed-papers).

Introduction (cont.) • We wish to have a recommendation system which can recommend Citations for papers. • the user has already written a few pages about the topic, and is able to submit this document to the search system as the query. • the user wants documents that the query document might cite.

recommender systems • recommender systems emerged as an independent research area in the mid-1990s • Examples of such applications include recommending books, CDs, and other products at Amazon.com, movies by MovieLens and so on

recommendation Techniques • The Collaborative Filtering Approach (CF) • Content-based Recommendation • Hybrid Approach

Different kinds of works in citation recommendation: • Works that can only recommend papers • Works that can recommend papers for a specific position

Collaborative Filtering

recommender system (1) • map the citation graph onto a collaborative filtering ratings matrix. • Co-Citation Matching

Content-based Recommendation • recommend items based on the contents of the items a user has experienced before. • Text-based Analysis • These approaches use NLP and text mining methods to find papers that are semantically similar to the input paper

recommender system (2) • candidate set : • the system retrieves the top 100 most similar papers to the query document and adds them to R (base set). • all papers cited by any paper in R are added to R. • Rank the candidate set

recommender system (3) • Using a weighted sum of feature scores: • Features: • Similar terms (Tf-Idf) • Citation-count • Author-h-index • Venue-citation-count • Cited using similar terms • Similar topics • Learn the feature weights

recommender system (4)

recommender systems (5) • Candidate set • D= document corpus • {D Э d2 | d2= global context + a set of in-link context} • LC100{outlink context to c*}+G1000{abstract +title to d1} • ranking

recommender systems (6) • Input: • a query manuscript without citation placeholders • Output: • where citation are needed • a list of candidate article to be cited • Finding citation context: • Divide the query manuscript into sentences- overlapping window of 100 word • Extract citation context of corpus • Language model • n-gram • Contextual similarity • Topical relevance

recommender systems (7) • Multi-class SVM classifier • Training and test data • Training: • Feature set: local context, global context, similarity features • Input: citing paper • Output: label of cited paper

Hybrid Approach • Composed of two independent module: • Content-base filtering • Collaborative filtering

recommender system (8) • The CBF module uses the text of the active paper as input and • the CF module uses the citations from the active paper as input.

Evaluation • Automatic • a particular paper from the collection as a query and its citations as the relevant documents. • Metrics: recall, precision, rank , coverage , co-cited probability, • it is circular; system is attempting to improve the citing ability of authors, but evaluate with the papers that authors actually cite. • System Might discover citations that are more relevant than the one held out. • Such citations may have not been included in the paper’s references list because of limits on space or because they overlapped with other references, possibly the one left out.

Evaluation (cont.) • Manual • authors of papers rate the relevance of citations recommended for a paper they had written. • A full manual evaluation of retrieval accuracy was not possible

References • He, Q., Pei, J., Kifer, D., Mitra, P., Giles, C.L., 2010, Context-aware Citation Recommendation, in Proceedings of the 19th International World Wide Web Conference (WWW), pp. 421–430. • Tang, J., Zhang, J., 2009, A Discriminative Approach to Topic-Based Citation Recommendations PAKDD'09. • Gipp, B., Beel, J., Hentschel, C., 2009, Scienstein: A Research Paper Recommender System, in Proceedings of the International Conference on Emerging Trends in Computing (ICETiC’09), pp. 309-315, January 2009. • Ritchie, A., 2008, Citation context analysis for information retrieval, PhD thesis, University of Cambridge • Strohman, T., Croft, W. B., Jensen, D., 2007, Recommending citations for academic papers, in Proceedings of the 30th Annual ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR)’, ACM Press, pp. 705–706. • McNee, S., Albert, I., Cosley, D., Gopalkrishnan, P., Lam, S., Rashid, A., Konstan, J., Ried, J., 2002, On the Recommending of Citations for Research Papers. CSCW'02.

References • Schafer, B., Frankowski, D., Herlocker, J., Sen, S., 2007, Collaborative filtering recommender systems, In Brusilovsky, P., Kobsa, A., Nejdl, W., eds., The Adaptive Web: Methods and Strategies of Web Personalization. Lecture Notes in Computer Science, Vol. 4321, Berlin Heidelberg New York, Springer-Verlag. • Gori, M., Pucci, A., 2006, Research Paper Recommender Systems: A Random-Walk Based Approach, in Proceedings of the 2006 International Conference on Web Intelligence, pp. 778-781. • Kessler, M. M., 1963, Bibliographic coupling between scientific papers, American Documentation 14(1), 10–25. • Small, H., 1973, Co-citation in the scientific literature: A new measurement of the relationship between two documents, Journal of the American Society of Information Science 24(4), 265–269.

By: Fattane Zarrinkalam Supervisor: Dr. Mohsen Kahani

By: Fattane Zarrinkalam Supervisor: Dr. Mohsen Kahani

Presentation Transcript

By Nithiapidary Muthuvelu FIT Supervisor: Dr Ian Chai Co-supervisor: Dr David Chieng Heng Tze

by Jason Perry Supervisor: Dr. Adel M. Sharaf

Presenter: R3 Supervisor: Dr

Jeremy Stempka: M.Sc. Candidate Dr. Scott Petrie: Supervisor Dr. Robert Bailey: Co Supervisor

Prepared by: Samia Ahmed Nadi P67778 Supervisor: Prof . Dr. Nowshad Amin Co- Supervisor: Prof . Dato

DR MOHSEN KHAYAT

Supervisor: Dr. Hassan Sawalha

Supervisor :Dr TARIQ ALMOFLEHI prepared by: Dr A.AZiZ Aonallah

Presented by Supervisor Selvaraja, A. Dr. Y. Venkatesha

Presented By: Sile Corbett Supervisor: Dr. Catriona Murphy

Dr. Michael McGuire - Supervisor

Supervisor: Dr. ElSayed Eissa Hemayed

Dr. Edward S. Marschilok Supervisor

Dr. Ashraf Armoush Supervisor

Barbara Nattabi Primary Supervisor: Dr Jaya Earnest Co Supervisor: Dr Sandy Thompson

Prepared by :Ayman Shtayah Qais Samarah Supervisor : Dr. Imad Ibrik

Sansak Nakavisut Principle supervisor: Dr Ron Crump Co-supervisor: Dr Hans Graser

Dr Bridie McCarthy Supervisor : Dr Tom Andrews Co-supervisor : Professor Josephine Hegarty

Management Software by Mohsen Zargar

By Timina Olive Kayaviri U29/3570/2010 Supervisor : Dr. Amugune

By: Dr Seyed Mohsen Zahraei National EPI Manager Center for Communicable Disease Control

jivan-ki-kahani