Stochastic matching pursuit for Bayesian variable selection

被引:26
作者
Chen, Ray-Bing [1 ]
Chu, Chi-Hsiang [1 ]
Lai, Te-You [1 ]
Wu, Ying Nian [2 ]
机构
[1] Natl Univ Kaohsiung, Inst Stat, Kaohsiung, Taiwan
[2] Univ Calif Los Angeles, Dept Stat, Los Angeles, CA USA
基金
美国国家科学基金会;
关键词
Gibbs sampler; Metropolis algorithm; Stochastic search variable selection; MODEL SELECTION; REGRESSION;
D O I
10.1007/s11222-009-9165-4
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
This article proposes a stochastic version of the matching pursuit algorithm for Bayesian variable selection in linear regression. In the Bayesian formulation, the prior distribution of each regression coefficient is assumed to be a mixture of a point mass at 0 and a normal distribution with zero mean and a large variance. The proposed stochastic matching pursuit algorithm is designed for sampling from the posterior distribution of the coefficients for the purpose of variable selection. The proposed algorithm can be considered a modification of the componentwise Gibbs sampler. In the componentwise Gibbs sampler, the variables are visited by a random or a systematic scan. In the stochastic matching pursuit algorithm, the variables that better align with the current residual vector are given higher probabilities of being visited. The proposed algorithm combines the efficiency of the matching pursuit algorithm and the Bayesian formulation with well defined prior distributions on coefficients. Several simulated examples of small n and large p are used to illustrate the algorithm. These examples show that the algorithm is efficient for screening and selecting variables.
引用
收藏
页码:247 / 259
页数:13
相关论文
共 17 条
[11]   MATCHING PURSUITS WITH TIME-FREQUENCY DICTIONARIES [J].
MALLAT, SG ;
ZHANG, ZF .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 1993, 41 (12) :3397-3415
[12]   Variable screening in predicting clinical outcome with high-dimensional microarrays [J].
Shao, Jun ;
Chow, Shein-Chung .
JOURNAL OF MULTIVARIATE ANALYSIS, 2007, 98 (08) :1529-1538
[13]   Nonparametric regression using Bayesian variable selection [J].
Smith, M ;
Kohn, R .
JOURNAL OF ECONOMETRICS, 1996, 75 (02) :317-343
[15]   Bayesian variable selection and regularization for time-frequency surface estimation [J].
Wolfe, PJ ;
Godsill, SJ ;
Ng, WJ .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2004, 66 :575-589
[16]  
WU YN, 2002, P EUR C COMP VIS, P240
[17]  
Yi NJ, 2003, GENETICS, V164, P1129