Scalable Bayesian modelling for smoothing disease risks in large spatial data sets using INLA

被引:18
|
作者
Orozco-Acosta, Erick [1 ,2 ,3 ]
Adin, Aritz [1 ,2 ,3 ]
Dolores Ugarte, Maria [1 ,2 ,3 ,4 ]
机构
[1] Univ Publ Navarra, Campus Arrosadia, Pamplona 31006, Spain
[2] Univ Publ Navarra, Dept Stat Comp Sci & Math, Navarra, Spain
[3] Univ Publ Navarra, Inst Adv Mat & Math InaMat, Navarra, Spain
[4] UNED, Ctr Asociado Pamplona, Madrid, Spain
关键词
High-dimensional data; Hierarchical models; Mixture models; Spatial epidemiology; MARKOV RANDOM-FIELDS; INFERENCE;
D O I
10.1016/j.spasta.2021.100496
中图分类号
P [天文学、地球科学];
学科分类号
07 ;
摘要
Several methods have been proposed in the spatial statistics literature to analyse big data sets in continuous domains. However, new methods for analysing high-dimensional areal data are still scarce. Here, we propose a scalable Bayesian modelling approach for smoothing mortality (or incidence) risks in high-dimensional data, that is, when the number of small areas is very large. The method is implemented in the R add-on package bigDM and it is based on the idea of "divide and conquer". Although this proposal could possibly be implemented using any Bayesian fitting technique, we use INLA here (integrated nested Laplace approximations) as it is now a well-known technique, computationally efficient, and easy for practitioners to handle. We analyse the proposal's empirical performance in a comprehensive simulation study that considers two model-free settings. Finally, the methodology is applied to analyse male colorectal cancer mortality in Spanish municipalities showing its benefits with regard to the standard approach in terms of goodness of fit and computational time. (C) 2021 Elsevier B.V. All rights reserved.
引用
收藏
页数:15
相关论文
共 44 条
  • [21] Bayesian calibration of a flood inundation model using spatial data
    Hall, Jim W.
    Manning, Lucy J.
    Hankin, Robin K. S.
    WATER RESOURCES RESEARCH, 2011, 47
  • [22] A comparison between Markov approximations and other methods for large spatial data sets
    Bolin, David
    Lindgren, Finn
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2013, 61 : 7 - 21
  • [23] Evaluating Bayesian spatial methods for modelling species distributions with clumped and restricted occurrence data
    Redding, David W.
    Lucas, Tim C. D.
    Blackburn, Tim M.
    Jones, Kate E.
    PLOS ONE, 2017, 12 (11):
  • [24] Bayesian modelling of Dupuytren disease by using Gaussian copula graphical models
    Mohammadi, Abdolreza
    Abegaz, Fentaw
    van den Heuvel, Edwin
    Wit, Ernst C.
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES C-APPLIED STATISTICS, 2017, 66 (03) : 629 - 645
  • [25] Bayesian spatial modelling of childhood cancer incidence in Switzerland using exact point data: a nationwide study during 1985-2015
    Konstantinoudis, Garyfallos
    Schuhmacher, Dominic
    Ammann, Roland A.
    Diesch, Tamara
    Kuehni, Claudia E.
    Spycher, Ben D.
    INTERNATIONAL JOURNAL OF HEALTH GEOGRAPHICS, 2020, 19 (01)
  • [26] Analysis of recurrent event data with spatial random effects using a Bayesian approach
    Jin, Jin
    Sun, Liuquan
    Ou, Huang-Tz
    Su, Pei-Fang
    STATISTICAL METHODS IN MEDICAL RESEARCH, 2024, 33 (11-12) : 1993 - 2006
  • [27] A scalable approach for short-term disease forecasting in high spatial resolution areal data
    Orozco-Acosta, Erick
    Riebler, Andrea
    Adin, Aritz
    Ugarte, Maria D.
    BIOMETRICAL JOURNAL, 2023, 65 (08)
  • [28] Bayesian Life Test Acceptance Criteria for Progressively Censored Competing Risks Data Using Copulas
    Salem, Maram Magdy
    INTERNATIONAL JOURNAL OF RELIABILITY QUALITY AND SAFETY ENGINEERING, 2022, 29 (06)
  • [29] Model Choice Problems Using Approximate Bayesian Computation with Applications to Pathogen Transmission Data Sets
    Lee, Xing Ju
    Drovandi, Christopher C.
    Pettitt, Anthony N.
    BIOMETRICS, 2015, 71 (01) : 198 - 207
  • [30] SMOOTHED FULL-SCALE APPROXIMATION OF GAUSSIAN PROCESS MODELS FOR COMPUTATION OF LARGE SPATIAL DATA SETS
    Zhang, Bohai
    Sang, Huiyan
    Huang, Jianhua Z.
    STATISTICA SINICA, 2019, 29 (04) : 1711 - 1737