A Probabilistic Approach to Personalized Tag Recommendation

A Probabilistic Approach to Personalized Tag Recommendation MeiqunHu, Ee-Peng Lim and Jing Jiang School of Information Systems Singapore Management University

Image credit @ logorunner.com Social Tagging • Social tagging allows users to annotate resources with tags. • organize • tags are keywords, serving as (personalized) index terms that group relevant resources • store • online storage gives mobility and convenience to access • share • published bookmarks can be viewed by other users • explore • to leverage collective wisdom to find interesting resources

Personalized Tag Recommendation • Personalized tag recommendation aims to recommend tags to the query user for annotating the query resource. • Recommendation eases the tagging process. ?

Why Personalize Recommendations? • Tag recommendation should be personalized. • users exhibit individualized choice of tag terms • e.g., language preference • personalized index for personal consumption and consistency

Problem Formulation and A Basic Method • Problem Formulation: p(t|rq,uq) • A Basic Method: freq-r, to recommend top frequent tags • assuming that the more people have used this tag, the more likely it will be used again • current state-of-the-art in many social tagging sites, e.g., • fails to personalize the recommendations for the query user

Three Scenarios Scenario 1: ‘foto’ is an infrequent tag for the resource. Scenario 2: ‘foto’ is has not been used for the resource, but has been used by the user for annotating other resources in the past. Scenario 3: ‘foto’ has not been used for the resource but has been used by others when annotating other resources.

Collaborative Filtering Method • A Method based on Collaborative Filtering: knn, to select top k-nearest neighbors and recommend tags used by these neighbors for annotating the resource • assuming that there are like-minded users who have annotated the same resource • classic collaborative filtering, without ratings • addresses scenario 1, but • fails scenario 2,3

Personomy Translation Method • To translate the resources tags to the user’s personal tags (trans-u) • to learn p(‘foto’|uq, ‘photo’) • addresses scenario 2, but • fails scenario 3, if uq has never used ‘foto’

To Address Scenario 3 borrow translation

Personomy Translation A Framework Measuring User Similarity A Probabilistic Framework

Borrowing Translations • To learn p(‘foto’|u,‘photo’) and sim(u,uq) borrow translation

Personomy Translation • To learn p(‘foto’|uq,‘photo’) [Wetzker et al. 2009]

Measuring Similarity between Users • sim(u,uq) • assuming that users are similar if they perform similar translations • users are profiled by sets of translation probabilities, e.g., p(‘foto’|u,‘photo’),…, p(‘image’|u,‘photo’) p(‘netz’|u,‘web’),…, p(‘internet’|u,‘web’) • we adopt distributional divergence to measure (dis)similarity between users • JS-divergence, L1-norm, such as in [Lee 1997]

Distributional Divergence between Users sim(‘photo’)(u,uq) S  sim(u,uq) … sim(‘web’)(u,uq)

Remark on the 3 Scenarios • This framework is able to address all three scenarios • addresses scenario 1 by allowing self-translation, e.g., p(‘photo’|u,‘photo’) • addresses scenario 2 by allowing self-similarity, e.g., sim(uq,uq) • addresses scenario 3 by enabling borrowed translations

Data Collection Experimental Setup Recommendation Performance Experiments

Dataset from BibSonomy Note: time order: train  validation  test users in test set must have been appeared in validation set.

Experimental Setup • Methods to compare • trans-n1, trans-n2 • k: {5,10,20,50,100,200,300,400,500} • js-divergence, l1-norm • b: {1,2,4,8} for js-divergence b: {1,2,4,8,12,16} for l1-norm • trans-u1, trans-u2 • knn-ur, knn-ut • k: {5,10,20,50,100,200,300,400,500} • interpolating with freq-r • Evaluation metric • pr-curve at top 5 • macro-average for users • Parameter optimization • macro-average f1@5 • global vs. individual settings

Recommendation PerformanceGlobal Setting

Recommendation PerformanceIndividual Setting

Recommendation Case Study scenario 3 tags

Conclusion • We propose a probabilistic framework for solving the personalized tag recommendation task, which incorporate personomy translation and borrowing translation from neighbors. • We devise to use distributional divergence to measure similarity between users. Users are similar if they exhibit similar translation behavior. • We find the proposed methods give superior performance than translation by the query user only and classic collaborative filtering.

A Probabilistic Approach to Personalized Tag Recommendation

A Probabilistic Approach to Personalized Tag Recommendation

Presentation Transcript

Music Recommendation A Data Mining Approach

A Probabilistic Approach to Logic Equivalence Checking

Personalized marketing using recommendation systems Peter A. Csikos

A Human-Centered Computing Framework to Enable Personalized News Video Recommendation

A Probabilistic Approach to Personalized Tag Recommendation

Tag Data and Personalized Information Retrieval

A Sequential Recommendation Approach for Interactive Personalized Story Generation

A probabilistic approach to language structure

A personalized recommendation procedure for Internet shopping support

Probabilistic Question Recommendation for Question Answering Communities

A Probabilistic Approach to Collaborative Multi-robot Localization

“A Systems Approach to Personalized Medicine”

A Personalized Learning Object Approach to Teaching

A probabilistic XML approach to data integration

A probabilistic approach to microRNA-target binding

A Discriminative Approach to Topic-Based Citation Recommendation

A probabilistic approach to exploring global dynamics

A Probabilistic Approach to Semantic Representation

Mining Tag Semantics for Social Tag Recommendation

A Probabilistic Approach to Vieta’s Formula

Legal Experience With A Personalized Approach