Scalable Computation of Predictive Probabilities in Probit Models with Gaussian Process Priors

被引:2
|
作者
Cao, Jian [1 ]
Durante, Daniele [2 ,3 ]
Genton, Marc G. [1 ]
机构
[1] King Abdullah Univ Sci & Technol, Stat Program, Thuwal, Saudi Arabia
[2] Bocconi Univ, Dept Decis Sci, Milan, Italy
[3] Bocconi Univ, Bocconi Inst Data Sci & Analyt, Milan, Italy
关键词
Binary data; Gaussian process; Multivariate truncated normal; Probit model; Unified skew-normal; Variational Bayes; CONDITIONING APPROXIMATIONS; BAYESIAN-INFERENCE; REGRESSION; BINARY; CLASSIFICATION; SIMULATION;
D O I
10.1080/10618600.2022.2036614
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Predictive models for binary data are fundamental in various fields, and the growing complexity of modern applications has motivated several flexible specifications for modeling the relationship between the observed predictors and the binary responses. A widely-implemented solution is to express the probability parameter via a probit mapping of a Gaussian process indexed by predictors. However, unlike for continuous settings, there is a lack of closed-form results for predictive distributions in binary models with Gaussian process priors. Markov chain Monte Carlo methods and approximation strategies provide common solutions to this problem, but state-of-the-art algorithms are either computationally intractable or inaccurate in moderate-to-high dimensions. In this article, we aim to cover this gap by deriving closed-form expressions for the predictive probabilities in probit Gaussian processes that rely either on cumulative distribution functions of multivariate Gaussians or on functionals of multivariate truncated normals. To evaluate these quantities we develop novel scalable solutions based on tile-low-rank Monte Carlo methods for computing multivariate Gaussian probabilities, and on mean-field variational approximations of multivariate truncated normals. Closed-form expressions for the marginal likelihood and for the posterior distribution of the Gaussian process are also discussed. As shown in simulated and real-world empirical studies, the proposed methods scale to dimensions where state-of-the-art solutions are impractical.
引用
收藏
页码:709 / 720
页数:12
相关论文
共 50 条
  • [21] gptools: Scalable Gaussian Process Inference with Stan
    Hoffmann, Till
    Onnela, Jukka-Pekka
    JOURNAL OF STATISTICAL SOFTWARE, 2025, 112 (02): : 1 - 31
  • [22] Nested Expectation Propagation for Gaussian Process Classification with a Multinomial Probit Likelihood
    Riihimaki, Jaakko
    Jylanki, Pasi
    Vehtari, Aki
    JOURNAL OF MACHINE LEARNING RESEARCH, 2013, 14 : 75 - 109
  • [23] GoGP: scalable geometric-based Gaussian process for online regression
    Trung Le
    Khanh Nguyen
    Vu Nguyen
    Tu Dinh Nguyen
    Dinh Phung
    Knowledge and Information Systems, 2019, 60 : 197 - 226
  • [24] GoGP: scalable geometric-based Gaussian process for online regression
    Trung Le
    Khanh Nguyen
    Vu Nguyen
    Tu Dinh Nguyen
    Dinh Phung
    KNOWLEDGE AND INFORMATION SYSTEMS, 2019, 60 (01) : 197 - 226
  • [25] Gaussian process latent class choice models
    Sfeir, Georges
    Rodrigues, Filipe
    Abou-Zeid, Maya
    TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2022, 136
  • [26] A Gaussian Process Approach for Predictive Maintenance
    Zeng, Junqi
    Liang, Zhenglin
    Guo, Chunhui
    Song, Minyuan
    Xue, Zongqi
    2021 21ST INTERNATIONAL CONFERENCE ON SOFTWARE QUALITY, RELIABILITY AND SECURITY COMPANION (QRS-C 2021), 2021, : 745 - 750
  • [27] Chebyshev Polynomials for Efficient Gaussian Process Computation
    Dudek, Adrian
    Baranowski, Jerzy
    2023 27TH INTERNATIONAL CONFERENCE ON METHODS AND MODELS IN AUTOMATION AND ROBOTICS, MMAR, 2023, : 240 - 245
  • [28] Variational Bayesian multinomial probit model with Gaussian process classification on mice protein expression level data
    Son, Donghyun
    Hwang, Beom Seuk
    KOREAN JOURNAL OF APPLIED STATISTICS, 2023, 36 (02)
  • [29] Priors and Posterior Computation in Linear Endogenous Variable Models with Imperfect Instruments
    Chan, Joshua C. C.
    Tobias, Justin L.
    JOURNAL OF APPLIED ECONOMETRICS, 2015, 30 (04) : 650 - 674
  • [30] A comparison of centring parameterisations of Gaussian process-based models for Bayesian computation using MCMC
    Bass, Mark R.
    Sahu, Sujit K.
    STATISTICS AND COMPUTING, 2017, 27 (06) : 1491 - 1512