Markov Chain Sampling Methods for Dirichlet Process Mixture Models R.M. Neal

Markov Chain Sampling Methods for Dirichlet Process Mixture ModelsR.M. Neal Summarized by Joon Shik Kim 12.03.15.(Thu) Computational Models of Intelligence

Abstract • This article reviews Markov chain methods for sampling from the posterior distribution of a Dirichlet process mixture model and presents two new classes of methods. • One new approach is to make Metropolis-Hastings updates of the indicators specifying which mixture component is associated with each observation, perhaps supplemented with a partial form of Gibbs sampling.

Chinese Restaurant Process (1/2)

Chinese Restaurant Process (2/2)

Introduction (1/2) • Modeling a distribution as a mixture of simpler distribution is useful both as a nonparametric density estimation method and as a way of identifying latent classes that can explain the dependencies observed between variables. • Use of Dirichlet process mixture models has become computationally feasible with the development of Markov chain methods for sampling from the posterior distribution of the parameters of the component distribution and/or of the associations of mixture components with observations.

Introduction (2/2) • In this article, I present two new approaches to Markov chain sampling. • A very simple method for handling non-conjugate priors is to use Metropolis-Hastings updates with the conditional prior as the proposal distribution. • A variation of this method may sometimes sample more efficiently, particularly when combined with a partial form of Gibbs sampling.

Dirichlet Process Mixture Models (1/5) • The basic model applies to data y1,…,yn which we regard as part of an indefinite exchangeable sequence, or equivalent, as being independently drawn from some unknown distribution.

Dirichlet Process Mixture Models (2/5) • We model the distribution from which the yi are drawn as a mixture of distributions of the form F(θ), with the mixing distribution over θ given G. We let the prior for this mixing distribution be a Dirichlet process, with concentration parameter αand base distribution G0.

Dirichlet Process Mixture Models (3/5)

Dirichlet Process Mixture Models (4/5)

Dirichlet Process Mixture Models (5/5) • If we let K go to infinity, the conditional probabilities reach the following limits:

Gibbs Sampling when Conjugate Priors are used (3/4)

Nested CRP Day 1 Day 2 Day 3

Markov Chain Sampling Methods for Dirichlet Process Mixture Models R.M. Neal

Markov Chain Sampling Methods for Dirichlet Process Mixture Models R.M. Neal

Presentation Transcript

Collapsed Variational Dirichlet Process Mixture Models

Hierarchical Dirichlet Process and Infinite Hidden Markov Model

Markov Models

Variational Inference for Dirichlet Process Mixture

New Geometric Methods of Mixture Models for Interactive Visualization

Markov Models

Markov Models

Markov Models

Algebraic-Geometric Methods for Learning Gaussian Mixture Models

Markov Models

Markov Chain

Nonlinear Mixture Autoregressive Hidden Markov Models for Speech Recognition

Dirichlet process tutorial

Generalized Spatial Dirichlet Process Models

Hierarchical Double Dirichlet Process Mixture of Gaussian Processes

Memoized Online Variational Inference for Dirichlet Process Mixture Models

Markov Chain Models

Markov-chain Monte Carlo methods for flood data analysis

Markov Chain Monte Carlo Methods

Markov Models

Markov Chain Models

Document Clustering via Dirichlet Process Mixture Model with Feature Selection