Combinatorial Clustering and the Beta Negative Binomial Process

被引:31
作者
Broderick, Tamara [1 ,2 ]
Mackey, Lester [3 ]
Paisley, John [4 ]
Jordan, Michael I. [1 ,2 ]
机构
[1] Univ Calif Berkeley, Dept Stat, Berkeley, CA 94705 USA
[2] Univ Calif Berkeley, Dept Elect Engn & Comp Sci, Berkeley, CA 94705 USA
[3] Stanford Univ, Dept Stat, Stanford, CA 94305 USA
[4] Columbia Univ, Dept Elect Engn, New York, NY 10027 USA
基金
美国国家科学基金会;
关键词
Beta process; admixture; mixed membership; Bayesian; nonparametric; integer latent feature model; DIRICHLET; MIXTURE; MODELS; DISTRIBUTIONS; ESTIMATORS;
D O I
10.1109/TPAMI.2014.2318721
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We develop a Bayesian nonparametric approach to a general family of latent class problems in which individuals can belong simultaneously to multiple classes and where each class can be exhibited multiple times by an individual. We introduce a combinatorial stochastic process known as the negative binomial process (NBP) as an infinite-dimensional prior appropriate for such problems. We show that the NBP is conjugate to the beta process, and we characterize the posterior distribution under the beta-negative binomial process (BNBP) and hierarchical models based on the BNBP (the HBNBP). We study the asymptotic properties of the BNBP and develop a three-parameter extension of the BNBP that exhibits power-law behavior. We derive MCMC algorithms for posterior inference under the HBNBP, and we present experiments using these algorithms in the domains of image segmentation, object recognition, and document analysis.
引用
收藏
页码:290 / 306
页数:17
相关论文
共 53 条
[1]  
[Anonymous], INT C MACH LEARN HAI
[2]  
[Anonymous], P INT C ART INT STAT
[3]  
[Anonymous], 2009, Advances in Neural Information Processing Systems
[4]   Latent Dirichlet allocation [J].
Blei, DM ;
Ng, AY ;
Jordan, MI .
JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (4-5) :993-1022
[5]   Beta Processes, Stick-Breaking and Power Laws [J].
Broderick, Tamara ;
Jordan, Michael I. ;
Pitman, Jim .
BAYESIAN ANALYSIS, 2012, 7 (02) :439-475
[6]   Estimation of Parent Specific DNA Copy Number in Tumors using High-Density Genotyping Arrays [J].
Chen, Hao ;
Xing, Haipeng ;
Zhang, Nancy R. .
PLOS COMPUTATIONAL BIOLOGY, 2011, 7 (01)
[7]   Histograms of oriented gradients for human detection [J].
Dalal, N ;
Triggs, B .
2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, :886-893
[8]  
Damien P, 1999, J ROY STAT SOC B, V61, P331
[9]  
Erdelyi A., 1951, Pacific J. Math, V1, P133, DOI [10.2140/pjm.1951.1.133, DOI 10.2140/PJM.1951.1.133]
[10]   Bayesian mixed membership models for soft clustering and classification [J].
Erosheva, EA ;
Fienberg, SE .
CLASSIFICATION - THE UBIQUITOUS CHALLENGE, 2005, :11-26