Several methods have been proposed in the spatial statistics literature to analyse big data sets in continuous domains. However, new methods for analysing high-dimensional areal data are still scarce. Here, we propose a scalable Bayesian modelling approach for smoothing mortality (or incidence) risks in high-dimensional data, that is, when the number of small areas is very large. The method is implemented in the R add-on package bigDM and it is based on the idea of "divide and conquer". Although this proposal could possibly be implemented using any Bayesian fitting technique, we use INLA here (integrated nested Laplace approximations) as it is now a well-known technique, computationally efficient, and easy for practitioners to handle. We analyse the proposal's empirical performance in a comprehensive simulation study that considers two model-free settings. Finally, the methodology is applied to analyse male colorectal cancer mortality in Spanish municipalities showing its benefits with regard to the standard approach in terms of goodness of fit and computational time. (C) 2021 Elsevier B.V. All rights reserved.
机构:
Univ York, Bioctr, York Neuroimaging Ctr, York YO10 5DG, N Yorkshire, EnglandUniv York, Bioctr, York Neuroimaging Ctr, York YO10 5DG, N Yorkshire, England
Harrison, L. M.
Green, G. G. R.
论文数: 0引用数: 0
h-index: 0
机构:
Univ York, Bioctr, York Neuroimaging Ctr, York YO10 5DG, N Yorkshire, EnglandUniv York, Bioctr, York Neuroimaging Ctr, York YO10 5DG, N Yorkshire, England
机构:
Univ Costa Rica, Programa Invest Desarrollo Urbano Sostenible, San Jose, Costa RicaUniv Politecn Madrid, Dept Ingn Transporte Terr & Urbanismo, Madrid, Spain