A Collapsed Variational Bayesian Inference Algorithm for Latent Dirichlet Allocation

A Collapsed Variational Bayesian Inference Algorithm for Latent Dirichlet Allocation Yee W. Teh, David Newman and Max Welling Published on NIPS 2006 Discussion led by Iulian Pruteanu

Outline • Introduction • Approximate inferences for LDA • Collapsed VB inference for LDA • Experimental results • Conclusions

Introduction (1/2) • Latent Dirichlet Allocation is suitable for many applications from document modeling to computer vision. • Collapsed Gibbs sampling seems to be the preferred choice to the large scale problems; however collapsed Gibbs sampling has its own problems. • CVB algorithm, making use of some approximations, is easy to implement and more accurate than standard VB.

Introduction (2/2) • This paper • proposes an improved VB algorithm based on integrating out the model parameters - assumption: the latent variables are mutually independent • uses a Gaussian approximation for computation efficiency

- number of documents - number of topics - observed words - latent variables (topic indices) - mixing proportions - topic parameters Approximate inferences for LDA(1/3)

Approximate inferences for LDA (2/3) Given the observed words the task of Bayesian inference is to compute the posterior distribution over • Variational Bayes

Approximate inferences for LDA (3/3) 2. Collapsed Gibbs sampling

Collapsed VB inference for LDAand marginalization on model parameters In variational Bayesian approximation, we assume a factorized form for the posterior approximating distribution. However it is not a good assumption since changes in model parameters ( ) will have a considerable impact on latent variables ( ). CVB is equivalent to marginalizing out the model parameters before approximating the posterior over the latent variable . The exact implementation of CVB has a closed form but is computationally too expensive to be practical. Therefore, the authors propose a simple Gaussian approximation which seems to work very accurately.

Experimental results Left: results for KOS. D=3,430 documents; W=6,909; N=467,714 words Right: results for NIPS. D=1,675 documents; W=12,419; N=2,166,029 words 10% for testing; 50 random runs Variational bounds (# iterations) Log probabilities (# iterations)

Conclusions • Variational approximation are much more efficient computationally than Gibbs sampling, with almost no loss in accuracy • The CVB inference algorithm is easy to implement*, computationally efficient (Gaussian approximation) and more accurate than standard VB.

A Collapsed Variational Bayesian Inference Algorithm for Latent Dirichlet Allocation

A Collapsed Variational Bayesian Inference Algorithm for Latent Dirichlet Allocation

Presentation Transcript

An Introduction to Latent Dirichlet Allocation (LDA)

Latent Dirichlet Allocation

Collapsed Variational Dirichlet Process Mixture Models

Latent Dirichlet Allocation a generative model for text

Topic Model Latent Dirichlet Allocation

Latent Dirichlet Allocation( LDA)

Variational Inference for Dirichlet Process Mixture

A Latent Dirichlet Allocation Method For Selectional Preferences

Using Latent Dirichlet Allocation for Child Narrative Analysis

Project 2 Latent Dirichlet Allocation

Bayesian Multi-Population Haplotype Inference via a Hierarchical Dirichlet Process Mixture

VIBES Variational Inference Engine For Bayesian Networks

Latent Dirichlet Allocation

Latent Dirichlet Allocation

Bayesian inference

Text-classification using Latent Dirichlet Allocation - intro graphical model

Latent Dirichlet Allocation (LDA)

Memoized Online Variational Inference for Dirichlet Process Mixture Models

Stochastic Collapsed Variational Bayesian Inference for Latent Dirichlet Allocation

TopicXP: Exploring Topics in Source Code using Latent Dirichlet Allocation

Latent Dirichlet Allocation (LDA)

Variational Bayesian Inference for fMRI time series