Double-Parallel Monte Carlo for Bayesian analysis of big data

被引:0
|
作者
Jingnan Xue
Faming Liang
机构
[1] Texas A&M University,Department of Statistics
[2] Purdue University,Department of Statistics
来源
Statistics and Computing | 2019年 / 29卷
关键词
Embarrassingly parallel; Divide-and-combine; MCMC; Pop-SAMC; Subset posterior aggregation;
D O I
暂无
中图分类号
学科分类号
摘要
This paper proposes a simple, practical, and efficient MCMC algorithm for Bayesian analysis of big data. The proposed algorithm suggests to divide the big dataset into some smaller subsets and provides a simple method to aggregate the subset posteriors to approximate the full data posterior. To further speed up computation, the proposed algorithm employs the population stochastic approximation Monte Carlo algorithm, a parallel MCMC algorithm, to simulate from each subset posterior. Since this algorithm consists of two levels of parallel, data parallel and simulation parallel, it is coined as “Double-Parallel Monte Carlo.” The validity of the proposed algorithm is justified mathematically and numerically.
引用
收藏
页码:23 / 32
页数:9
相关论文
共 50 条
  • [31] Variational Markov chain Monte Carlo for Bayesian smoothing of non-linear diffusions
    Shen, Yuan
    Cornford, Dan
    Opper, Manfred
    Archambeau, Cedric
    COMPUTATIONAL STATISTICS, 2012, 27 (01) : 149 - 176
  • [32] Bayesian models and Markov chain Monte Carlo methods for protein motifs with the secondary characteristics
    Xie, J
    Kim, NK
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2005, 12 (07) : 952 - 970
  • [33] Improved Discretization Analysis for Underdamped Langevin Monte Carlo
    Zhang, Matthew Shunshi
    Chewi, Sinho
    Li, Mufan Bill
    Balasubramanian, Krishnakumar
    Erdogdu, Murat A.
    THIRTY SIXTH ANNUAL CONFERENCE ON LEARNING THEORY, VOL 195, 2023, 195 : 36 - 71
  • [34] USING PARALLEL MARKOV CHAIN MONTE CARLO TO QUANTIFY UNCERTAINTIES IN GEOTHERMAL RESERVOIR CALIBRATION
    Cui, T.
    Fox, C.
    Nicholls, G. K.
    O'Sullivan, M. J.
    INTERNATIONAL JOURNAL FOR UNCERTAINTY QUANTIFICATION, 2019, 9 (03) : 295 - 310
  • [35] Bayesian inversion of seismic and electromagnetic data for marine gas reservoir characterization using multi-chain Markov chain Monte Carlo sampling
    Ren, Huiying
    Ray, Jaideep
    Hou, Zhangshuan
    Huang, Maoyi
    Bao, Jie
    Swiler, Laura
    JOURNAL OF APPLIED GEOPHYSICS, 2017, 147 : 68 - 80
  • [36] Importance Sampling-Based Transport Map Hamiltonian Monte Carlo for Bayesian Hierarchical Models
    Osmundsen, Kjartan Kloster
    Kleppe, Tore Selland
    Liesenfeld, Roman
    JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2021, 30 (04) : 906 - 919
  • [37] Uncertainty quantification for chromatography model parameters by Bayesian inference using sequential Monte Carlo method
    Yamamoto, Yota
    Yajima, Tomoyuki
    Kawajiri, Yoshiaki
    CHEMICAL ENGINEERING RESEARCH & DESIGN, 2021, 175 : 223 - 237
  • [38] Improving Operational Intensity in Data Bound Markov Chain Monte Carlo
    Nemeth, Balazs
    Haber, Tom
    Ashby, Thomas J.
    Lamotte, Wim
    INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE (ICCS 2017), 2017, 108 : 2348 - 2352
  • [39] Bayesian analysis of soil water characteristic curve using Markov chain Monte Carlo simulation and its application on soil water infiltration
    Liu, Weiping
    Luo, Xiaoyan
    Fu, Mingfu
    Ouyang, Guoquan
    DESALINATION AND WATER TREATMENT, 2018, 121 : 172 - 179
  • [40] Subsampled Information Criteria for Bayesian Model Selection in the Big Data Setting
    Geng, Lijiang
    Xue, Yishu
    Hu, Guanyu
    2019 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2019, : 194 - 199