Learning the parts of objects by nonnegative matrix factorization

Learning the parts of objects by nonnegative matrix factorization D.D. Lee from Bell Lab H.S. Seung from MIT Presenter: Zhipeng Zhao

Introduction • NMF (Nonnegative Matrix Factorization): Theory: Perception of the whole is based on perception of its parts. • Comparison with another two matrix factorization methods: PCA (Principle Components Analysis) VA (Vector quantization )

Comparison: • Common features: • Represent a face as a linear combination of basis images. • Matrix factorization: VWH V: nm matrix. Each column of which contains n nonnegative pixel values of one of the m facial images. W: (n r): r columns of W are called basis images. H: (r m): each column of H is called encoding.

Comparison (cont’d) NMF PCA VQ Representation: parts- Based holistic holistic Basis Image: localized features eigenfaces whole face Constrains on allow multiple each face is each column of H is W and H: basis images to approximated by constrained to be a represent a face, a linear combi- unary vector, every but only additive nation of all face is approximat- combinations the eigenfaces ed by a single basis image.

Implementation of NMF • Iterative algorithm:

Implementation (cont’d) • Objective function: Updates: converges to a local maximum of the objective function. ( related to the likelihood of generating the images in V from the basis W and encoding H.

Network model of NMF

Semantic analysis of text doc. using NMF • A corpus of documents summarized by matrix V, where Vi is the number of times the ith word in the vocabulary appears in the th document. • NMF algorithm involves finding the approximate factorization of this Matrix VWH into a feature set W and hidden variables H, in the same way as was done for faces.

Semantic analysis of text doc. using NMF (cont’d) • VQ: A single hidden variable is active for each document. If the same variable is active for a group of documents, they are semantically related. • PCA: allow activation of multiple semantic variables, but they are difficult to interpret. • NMF: It makes sense for each document to associate with some small subset of a large array of topics.

Limitation of NMF • Not suitable for learning parts for complex cases:require fully hierarchical models with multiple levels of hidden variables. • NMF does not learn anything about the “syntactic” relationships between parts: NMF assumes that the hidden variables are nonnegative, but makes no assumption about their statistical dependency.

Learning the parts of objects by nonnegative matrix factorization

Learning the parts of objects by nonnegative matrix factorization

Presentation Transcript

Non-Negative Matrix Factorization

Non-negative Matrix Factorization

Shifted Non-negative Matrix Factorization

Matrix Factorization

Non Negative Matrix Factorization

Stochastic Matrix Factorization

Direct Robust Matrix Factorization

Nonnegative Matrix Factorization via Rank-one Downdate

Robust Nonnegative Matrix Factorization

Matrix Factorization and its applications

Nonnegative Matrix Factorization via Rank-one Downdate

An Efficient Initialization Method for Nonnegative Matrix Factorization

A Clustering Method Based on Nonnegative Matrix Factorization for Text Mining

Matrix Factorization

Probabilistic Sparse Matrix Factorization

Collaborative Filtering Matrix Factorization Approach

Learning Hierarchical Models of Scenes, Objects and Parts

Matrix Factorization

Matrix Factorization via SGD

Distributed Nonnegative Matrix Factorization for Web-Scale Dyadic Data Analysis on MapReduce

Online Learning for Matrix Factorization and Sparse Coding

Non-negative Matrix Factorization