On Markov chain Monte Carlo algorithms for computing conditional expectations based on sufficient statistics

被引:3
|
作者
Jones, LK [1 ]
O'Neil, PJ
机构
[1] Univ Massachusetts, Dept Math Sci, Lowell, MA 01854 USA
[2] AnVil Inc, Burlington, MA 01803 USA
基金
美国国家科学基金会;
关键词
contingency table; logistic regression; MCMC; Markov basis;
D O I
10.1198/106186002510
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Much work has focused on developing exact tests for the analysis of discrete data using log linear or logistic regression models. A parametric model is tested for a dataset by conditioning on the value of a sufficient statistic and determining the probability of obtaining another dataset as extreme or more extreme relative to the general model, where extremeness is determined by the value of a test statistic such as the chi-square or the log-likelihood ratio. Exact determination of these probabilities can be infeasible for high dimensional problems, and asymptotic approximations to them are often inaccurate when there are small data entries and/or there are many nuisance parameters. In these cases Monte Carlo methods can be used to estimate exact probabilities by randomly generating datasets (tables) that match the sufficient statistic of the original table. However, naive Monte Carlo methods produce tables that are usually far from matching the sufficient statistic. The Markov chain Monte Carlo method used in this work (the regression/attraction approach) uses attraction to concentrate the distribution around the set of tables that match the sufficient statistic, and uses regression to take advantage of information in tables that "almost" match. It is also more general than others in that it does not require the sufficient statistic to be linear, and it can be adapted to problems involving continuous variables. The method is applied to several high dimensional settings including four-way tables with a model of no four-way interaction, and a table of continuous data based on beta distributions. It is powerful enough to deal with the difficult problem of four-way tables and flexible enough to handle continuous data with a nonlinear sufficient statistic.
引用
收藏
页码:660 / 677
页数:18
相关论文
共 50 条
  • [1] Fast Markov chain Monte Carlo algorithms via Lie groups
    Huntsman, Steve
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 108, 2020, 108 : 2841 - 2850
  • [2] Transdimensional transformation based Markov chain Monte Carlo
    Das, Moumita
    Bhattacharya, Sourabh
    BRAZILIAN JOURNAL OF PROBABILITY AND STATISTICS, 2019, 33 (01) : 87 - 138
  • [3] Markov Chain Monte Carlo - Based Approaches for Modeling the Spatial Survival with Conditional Autoregressive (CAR) Frailty
    Iriawan, Nur
    Astutik, Suci
    Prastyo, Dedy Dwi
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2010, 10 (12): : 211 - 216
  • [4] Galaxy Decomposition in Multispectral Images Using Markov Chain Monte Carlo Algorithms
    Perret, Benjamin
    Mazet, Vincent
    Collet, Christophe
    Slezak, Eric
    IMAGE ANALYSIS, PROCEEDINGS, 2009, 5575 : 209 - +
  • [5] COMPLEXITY BOUNDS FOR MARKOV CHAIN MONTE CARLO ALGORITHMS VIA DIFFUSION LIMITS
    Roberts, Gareth O.
    Rosenthal, Jeffrey S.
    JOURNAL OF APPLIED PROBABILITY, 2016, 53 (02) : 410 - 420
  • [6] An Auxiliary Variable Method for Markov Chain Monte Carlo Algorithms in High Dimension
    Marnissi, Yosra
    Chouzenoux, Emilie
    Benazza-Benyahia, Amel
    Pesquet, Jean-Christophe
    ENTROPY, 2018, 20 (02)
  • [7] Convergence Diagnostics for Markov Chain Monte Carlo
    Roy, Vivekananda
    ANNUAL REVIEW OF STATISTICS AND ITS APPLICATION, VOL 7, 2020, 2020, 7 : 387 - 412
  • [8] THE BOOTSTRAP AND MARKOV-CHAIN MONTE CARLO
    Efron, Bradley
    JOURNAL OF BIOPHARMACEUTICAL STATISTICS, 2011, 21 (06) : 1052 - 1062
  • [9] Hypothesis testing for Markov chain Monte Carlo
    Benjamin M. Gyori
    Daniel Paulin
    Statistics and Computing, 2016, 26 : 1281 - 1292
  • [10] Markov Chain Monte Carlo confidence intervals
    Atchade, Yves F.
    BERNOULLI, 2016, 22 (03) : 1808 - 1838