Modeling user multiple interests by an improved GCS approach

Modeling user multiple interests by an improved GCS approach Advisor : Dr. Hsu Graduate : Kuo-min Wang Authors :Wu Lihua*, Liu Lu, Li Jing, Li Zongyong 2005 Expert Systems with Applications .

Outline • Motivation • Objective • Introduction • Background and related work • IGCS approach • Experiments • Conclusion • Personal Opinions

Motivate • User interest profile is the crucial component of most personalized recommender systems. • The diversity and time-dependent evolving nature of user interests are creating difficulties in constructing and maintaining a sound user profile.

Objective • This paper presents a simple but effective model to address this problem.

Introduction • Information overload problem • Users have to spend much time and effort on information search and selection before they can find out what they want. • Personalized recommender system • Summarizes and generalizes a user’s interests, to filter away irrelevant information • Only information most catering for user’s interests is advised to him/her.

Introduction (cont.) • Statistical keyword analysis • Represents user profile as a vector of keyword and its weight pairs. The keyword in user profile can be calculated from the TF-IDF • Drawback • Combines all interests of a user into a single vector. It is insufficient and ineffective to generalize all kinds of interest by a single vector. • This method assumes that user interests change only infrequently, so the user profile is adapted at a fixed, predetermined pace. • Can not change user’s interest continuously and rapidly.

Introduction (cont.) • We present a neural network-based approach for automatically learning and maintaining user multiple interests. • The proposed model utilizes improved growing cell structures (IGCS) to group user-interested information in to separate topic categories.

Background and related work • Modeling of user multiple interests • Neural networks for information recommendation

Modeling of user multiple interests • Amalthaea (Moukas, 1997) & Fab (Balabanovic & Shoham, 1997) • Topic information collection and answered for user interest maintenance. • Both systems updated user profile by an evolution process of agents based on user’s relevant feedback. • A high user involvement is required for updating the profile in Amathaea.

Modeling of user multiple interests (cont.) • Cetintemel, Franklin, and Giles (2000) • Widyantoro, Ioerger, and Yen (2001) • Combined both the long-term interest and short-term interest of a user into the interest category representation by a weighted average scheme. • Drawback • They also cost a great deal of user’s effort since they need user’s explicitly relevant feedback to maintain user profile.

Neural networks for information recommendation (cont.) • Adaptive resonance associative map(ARAM)(Tan & Teo, 1998) • A user profile was modeled by first classing information into categories by the ARAM network. • Then each of the information categories characterized by semantic features was associated with a relevance factor indicated by the user to acquire the user profile.

Neural networks for information recommendation (cont.) • Self-organizing map • Drawback • Cannot adapt to new input pattern after the learning period • User has to define a number of parameters including learning rates and neighborhood size which decay over time in the learning process • GCS (Fritzke, 1994)

IGCS approach • Learning user unchanged interests • adding new interests to current profile • removing outdated interests from current profile

The properties of the IGCS • Category representation of user profile • user profile is represented by a collection of interest clusters recognized by the neural network. • Learning user interests in an incremental and real-time fashion • IGCS approach is incremental since it receives new case individually and modifies the user profile accordingly • Robust learning and generalization inference ability • Properly learn user interests even with partial or inconsistent information • Visualization clustering analysis

Learning user unchanged interests (1) increment input counter, N, which numbers the input vector: (2) select the best matching unit busing Euclidean distance between input x and the weight vectors of all output cells.

Learning user unchanged interests (cont.) (3) adapt weight vector w for b and its direct topological neighbors Nb[0,05 0.1] forεb[0.002 0.01] forεn (4) update the winning counter for the winner cell b (5) compute the difference between the input counter , N, and the winning counter, λ

Adding new interests • The new cell is connected to the first two winners for x and, if there is any, to their common direct neighbor too.

Removing outdated interests • Deletion of a cell • If the difference, rjmaintained for cell j exceeds a removal threshold, rd, namely then the cell j and all its adjacent edges are removed.

Experiments • Data collection • RCV1-v2(Lewis, Yang, Rose, & Li, 2004) is a new benchmark collection for text categorization. • Three category sets • Topic(103), Industries(354), and Regions(366) • contains 804,414 documents • We randomly selected 20 categories from the 103 topic categories. • Training set comprising 1500 documents • Testing set comprising the rest 500 documents

Experiments • Performance metrics • F1 measure • Intra-cluster distance • Inter-cluster distance

Experiments

Conclusion • we designed a clustering model based on the growing cell structures neural network to learn user’s multiple interests and their changes over time. • The proposed model can adaptively find a suitable category number for user interests • And capture the time-dependent evolving nature of the user interests.

Conclusion • Further research • Investigation on the online adaptability of IGCS model • Further experiments should be made on some representative samples to see whether the selection of sample data does influence the performance of our model.

Personal Opinions • …

Modeling user multiple interests by an improved GCS approach

Modeling user multiple interests by an improved GCS approach

Presentation Transcript

Multiple Intelligence Approach

Multiple Question Approach

An Improved Approach To Updating Regulatory Dispersion Models

An End-User Demonstration Approach to Support Aspect-Oriented Modeling

User Modeling

User Modeling

Protein-Ligand Interaction Prediction: An Improved Chemogenomics Approach

Improved User Security

Modeling Approach

Improved Attacks on Multiple Encryption

User Modeling

Evidences of Multiple Populations in GCs

Modeling The User

User Modeling 1

User Modeling

Modeling Multiple Securities

An Improved Search Algorithm for Optimal Multiple-Sequence Alignment

Exploratory Testing – an improved approach

Modeling Approach

User Modeling 2007

User Modeling

User Modeling