Bayesian variable selection for latent class analysis using a collapsed Gibbs sampler

被引:19
|
作者
White, Arthur [1 ]
Wyse, Jason [1 ]
Murphy, Thomas Brendan [2 ,3 ]
机构
[1] Trinity Coll Dublin, OReilly Inst, Sch Comp Sci & Stat, Dublin 2, Ireland
[2] Univ Coll Dublin, Sch Math Sci, Complex & Adapt Syst Lab, Dublin 4, Ireland
[3] Univ Coll Dublin, Insight Res Ctr, Dublin 4, Ireland
基金
爱尔兰科学基金会;
关键词
Latent class analysis; Variable selection; Model selection; Collapsed sampler; Finite mixture model; Trans-dimensional MCMC; UNKNOWN NUMBER; REVERSIBLE JUMP; MODEL; INFERENCE; MIXTURES; LIKELIHOOD; COMPONENTS;
D O I
10.1007/s11222-014-9542-5
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Latent class analysis is used to perform model based clustering for multivariate categorical responses. Selection of the variables most relevant for clustering is an important task which can affect the quality of clustering considerably. This work considers a Bayesian approach for selecting the number of clusters and the best clustering variables. The main idea is to reformulate the problem of group and variable selection as a probabilistically driven search over a large discrete space using Markov chain Monte Carlo (MCMC) methods. Both selection tasks are carried out simultaneously using an MCMC approach based on a collapsed Gibbs sampling method, whereby several model parameters are integrated from the model, substantially improving computational performance. Post-hoc procedures for parameter and uncertainty estimation are outlined. The approach is tested on simulated and real data
引用
收藏
页码:511 / 527
页数:17
相关论文
共 50 条