Parallel Gibbs Sampling for Hierarchical Dirichlet Processes via Gamma Processes Equivalence

被引:8
作者
Cheng, Dehua [1 ]
Liu, Yan [1 ]
机构
[1] Univ Southern Calif, Los Angeles, CA 90089 USA
来源
PROCEEDINGS OF THE 20TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING (KDD'14) | 2014年
关键词
Parallel Inference; Hierarchical Dirichlet Process; Topic Model; MIXTURE MODEL;
D O I
10.1145/2623330.2623708
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The hierarchical Dirichlet process (HDP) is an intuitive and elegant technique to model data with latent groups. However, it has not been widely used for practical applications due to the high computational costs associated with inference. In this paper, we propose an effective parallel Gibbs sampling algorithm for HDP by exploring its connections with the gamma-gamma-Poisson process. Specifically, we develop a novel framework that combines bootstrap and Reversible Jump MCMC algorithm to enable parallel variable updates. We also provide theoretical convergence analysis based on Gibbs sampling with asynchronous variable updates. Experiment results on both synthetic datasets and two large-scale text collections show that our algorithm can achieve considerable speedup as well as better inference accuracy for HDP compared with existing parallel sampling algorithms.
引用
收藏
页码:562 / 571
页数:10
相关论文
empty
未找到相关数据