Subsampled Information Criteria for Bayesian Model Selection in the Big Data Setting

被引:0
作者
Geng, Lijiang [1 ]
Xue, Yishu [1 ]
Hu, Guanyu [1 ]
机构
[1] Univ Connecticut, Dept Stat, Storrs, CT 06269 USA
来源
2019 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA) | 2019年
关键词
DIC; IC; MCMC; Nonuniform Subsample;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Bayesian methods face unprecedented challenges in the era of big data, as the evaluation of likelihood in each iteration is computationally intensive. To deal with this bottleneck, recent literature focus mostly on speeding up Markov chain Monte Carlo (MCMC). Model selection, which is an important topic, has not received much attention. In the Bayesian context, deviance-based criteria, such as the deviance information criterion (DIC), are well-known for model selection purposes. In this article, we introduce the subsampled DIC and the subsampled information criterion IC in the big data context. Extensive simulation studies are conducted to evaluate the empirical performance of the proposed criterion. The usage of our proposed criterion is further illustrated with an analysis of the Covertype dataset.
引用
收藏
页码:194 / 199
页数:6
相关论文
共 18 条
[1]  
Ai M., 2018, OPTIMAL SUBSAMPLING
[2]  
Akaike H., 1998, 2 INT S INF THEOR, P199, DOI 10.1007/978-1-4612-1694-015
[3]   Bayesian predictive information criterion for the evaluation of hierarchical Bayesian and empirical Bayes models [J].
Ando, Tomohiro .
BIOMETRIKA, 2007, 94 (02) :443-458
[4]   Predictive likelihood for Bayesian model selection and averaging [J].
Ando, Tomohiro ;
Tsay, Ruey .
INTERNATIONAL JOURNAL OF FORECASTING, 2010, 26 (04) :744-763
[5]  
[Anonymous], 2011, THESIS
[6]  
Bardenet R, 2017, J MACH LEARN RES, V18, P1
[7]  
Bardenet R, 2014, PR MACH LEARN RES, V32
[8]  
Collobert R, 2002, ADV NEUR IN, V14, P633
[9]  
Dua D., 2017, Uci machine learning repository
[10]  
Hu G., 2018, MINIMAX OPTIMA UNPUB