210 likes | 293 Views
Collaborative Data Analysis and Multi-Agent Systems. Robert W. Thomas CSCE 824 15 APR 2013. Agenda. Problem Description Existing Research Overview Limitation of Existing Results Future Research Suggestions. Problem Description. Information Overload Divide and Conquer; Reconcile
E N D
Collaborative Data Analysisand Multi-Agent Systems Robert W. Thomas CSCE 824 15 APR 2013
Agenda • Problem Description • Existing Research Overview • Limitation of Existing Results • Future Research Suggestions
Problem Description • Information Overload • Divide and Conquer; Reconcile • Recommender Systems and Social Media • Content Filtering • Collaborative Filtering • Collaborative Data Analysis through Agents
Content Filtering • Recommendations based on items similar to what has been preferred previously
Collaborative Filtering (CF) • Recommendations based on what others in a network prefer • Different Techniques • Memory-Based • Model-Based • Hybrid
Memory-Based CF • Similarity Computation • Prediction and Recommendation Computation • Top-N Recommendations
Similarity Computation Two users: u,v Two items: i,j = items both u and v have rated = avg rating of co-rated items of the user = users who rated both i and j = avg rating of the item by those users R = m x n user-item matrix are n dimensional vectors corresponding to i and j column of R • Compares Users or Items • Correlation-Based (Pearson correlation) • Vector Cosine-Based
Prediction and Recommendation Computation • Weighted Sum of Others’ Ratings • Simple Weighted Average Prediction P for active user a, on item i = avg rating of user u = weight between user a and user u = users who rated item i Prediction P for user u on item i = all other rated items for user u = weight between items i and n = rating for user u on item n
Top-N Recommendations • Item-Based • User-Based
Model-Based CF • Bayesian Belief Net • Clustering • Regression-Based • Markov Decision Process (MDP) –Based • Latent Semantic
Bayesian Belief Net • Bayesian logic – decision making and inferential statistics • Simple Bayesian • Memory-Based • Laplace Estimator to avoid a conditional probability of 0 • Tree Augmented naïve Bayes and naïve Bayes optimized by Extended Logic Regression (ELR) • Require extended training periods to produce results beyond simple Bayesian and Pearson correlation
Clustering • Cluster: collection of similar objects, dissimilar to objects in other clusters • Pearson correlation can be used • Three Categories • Partitioning • Density-based • Hierarchal • Often an Intermediate Step
Regression-Based • Use approximation of ratings to make predictions against a regression model • Apply to situations where rating vectors have large Euclidean distances but very high Similarity Computation scores
MDP-Based • Sequential Optimization Problem • <S,A,R,Pr> • S = {states} • A = {actions} • R = {rewards} for r(s,a,s’) • Pr = {transition probabilities} for pr(s,a,s’) • Partially Observable MDP (POMDP)
Latent Semantic • Uses statistical modeling to discover additional communities or profiles
Network Trust • We’re all mad here; I’m mad; you’re mad. • Opinions of different contacts are valued more than others under certain conditions • Accounting for this can increase CF accuracy • Semantic Knowledge • Social Tie-Strength
Hybrid CF • CF + Content-Based • CF + CF • CF + CF and/or Content-Based
Limitations of Existing Solutions • Time / Accuracy Trade Offs • Noisy Data • Data Sparsity (New User) • Scalability • Synonymy • Gray Sheep • Shilling Attacks • Privacy
Future Research Suggestions • Hybrids • Semantics • Trust • Parallel Processing • Multi-Agent Systems
References • Su, Xiaoyuan, and Taghi M. Khoshgoftaar. "A survey of collaborative filtering techniques." Advances in Artificial Intelligence 2009 (2009): 4. • Chen, Wei, and Simon Fong. "Social network collaborative filtering framework and online trust factors: a case study on Facebook." Digital Information Management (ICDIM), 2010 Fifth International Conference on. IEEE, 2010. • O'Donovan, John, and Barry Smyth. "Trust in recommender systems." Proceedings of the 10th international conference on Intelligent user interfaces. ACM, 2005.