Negative Binomial Process Count and Mixture Modeling

被引:121
作者
Zhou, Mingyuan [1 ]
Carin, Lawrence [2 ]
机构
[1] Univ Texas Austin, McCombs Sch Business, Dept Informat Risk & Operat Management, Austin, TX 78712 USA
[2] Duke Univ, Dept Elect & Comp Engn, Durham, NC 27708 USA
关键词
Beta process; Chinese restaurant process; completely random measures; count modeling; Dirichlet process; gamma process; hierarchical Dirichlet process; mixed-membership modeling; mixture modeling; negative binomial process; normalized random measures; Poisson factor analysis; Poisson process; topic modeling; MAXIMUM-LIKELIHOOD-ESTIMATION; CHAIN MONTE-CARLO; DIRICHLET; DISPERSION; ESTIMATORS; PARAMETER; INFERENCE;
D O I
10.1109/TPAMI.2013.211
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The seemingly disjoint problems of count and mixture modeling are united under the negative binomial (NB) process. A gamma process is employed to model the rate measure of a Poisson process, whose normalization provides a random probability measure for mixture modeling and whose marginalization leads to an NB process for count modeling. A draw from the NB process consists of a Poisson distributed finite number of distinct atoms, each of which is associated with a logarithmic distributed number of data samples. We reveal relationships between various count-and mixture-modeling distributions and construct a Poisson-logarithmic bivariate distribution that connects the NB and Chinese restaurant table distributions. Fundamental properties of the models are developed, and we derive efficient Bayesian inference. It is shown that with augmentation and normalization, the NB process and gamma-NB process can be reduced to the Dirichlet process and hierarchical Dirichlet process, respectively. These relationships highlight theoretical, structural, and computational advantages of the NB process. A variety of NB processes, including the beta-geometric, beta-NB, marked-beta-NB, marked-gamma-NB and zero-inflated-NB processes, with distinct sharing mechanisms, are also constructed. These models are applied to topic modeling, with connections made to existing algorithms under Poisson factor analysis. Example results show the importance of inferring both the NB dispersion and probability parameters.
引用
收藏
页码:307 / 320
页数:14
相关论文
共 71 条
[1]  
Aldous D., 1983, ECOLE DETE PROBABILI, P1
[2]  
[Anonymous], 2013, Regression Analysis of Count Data
[3]  
[Anonymous], 2001, P NIPS
[4]  
[Anonymous], 2010, ENCY MACHINE LEARNIN
[5]  
[Anonymous], 2006, Combinatorial Stochastic Processes
[6]  
[Anonymous], 2008, Econometric analysis of count data, DOI DOI 10.1007/978-3-540-24728-9_5
[7]   MIXTURES OF DIRICHLET PROCESSES WITH APPLICATIONS TO BAYESIAN NONPARAMETRIC PROBLEMS [J].
ANTONIAK, CE .
ANNALS OF STATISTICS, 1974, 2 (06) :1152-1174
[8]  
Asuncion A., 2009, P 25 UAI MONTR QC CA
[9]   FERGUSON DISTRIBUTIONS VIA POLYA URN SCHEMES [J].
BLACKWELL, D ;
MACQUEEN, JB .
ANNALS OF STATISTICS, 1973, 1 (02) :353-355
[10]   Latent Dirichlet allocation [J].
Blei, DM ;
Ng, AY ;
Jordan, MI .
JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (4-5) :993-1022