PREDICTING PALEOCLIMATE FROM COMPOSITIONAL DATA USING MULTIVARIATE GAUSSIAN PROCESS INVERSE PREDICTION

被引:2
作者
Tipton, John R. [1 ]
Hooten, Mevin B. [2 ,3 ]
Nolan, Connor [4 ]
Booth, Robert K. [5 ]
McLachlan, Jason [6 ]
机构
[1] Univ Arkansas, Dept Math Sci, Fayetteville, AR 72701 USA
[2] Colorado State Univ, Dept Stat, Ft Collins, CO 80523 USA
[3] US Geol Survey, Colorado Cooperat Fish & Wildlife Res Unit, Dept Fish Wildlife & Conservat Biol, Ft Collins, CO 80523 USA
[4] Univ Arizona, Dept Geosci, Tucson, AZ 85721 USA
[5] Lehigh Univ, Earth & Environm Sci Dept, Bethlehem, PA 18015 USA
[6] Univ Notre Dame, Dept Biol, Notre Dame, IN 46556 USA
基金
美国国家科学基金会;
关键词
Bayesian hierarchical models; predictive validation; model comparison; ecological functional response model; BAYESIAN-INFERENCE; FOREST COMPOSITION; MODEL; PH; DIATOMS; RECONSTRUCTION; CALIBRATION; LIKELIHOOD; REGRESSION; ANALOGS;
D O I
10.1214/19-AOAS1281
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Multivariate compositional count data arise in many applications including ecology, microbiology, genetics and paleoclimate. A frequent question in the analysis of multivariate compositional count data is what underlying values of a covariate(s) give rise to the observed composition. Learning the relationship between covariates and the compositional count allows for inverse prediction of unobserved covariates given compositional count observations. Gaussian processes provide a flexible framework for modeling functional responses with respect to a covariate without assuming a functional form. Many scientific disciplines use Gaussian process approximations to improve prediction and make inference on latent processes and parameters. When prediction is desired on unobserved covariates given realizations of the response variable, this is called inverse prediction. Because inverse prediction is often mathematically and computationally challenging, predicting unobserved covariates often requires fitting models that are different from the hypothesized generative model. We present a novel computational framework that allows for efficient inverse prediction using a Gaussian process approximation to generative models. Our framework enables scientific learning about how the latent processes co-vary with respect to covariates while simultaneously providing predictions of missing covariates. The proposed framework is capable of efficiently exploring the high dimensional, multi-modal latent spaces that arise in the inverse problem. To demonstrate flexibility, we apply our method in a generalized linear model framework to predict latent climate states given multivariate count data. Based on cross-validation, our model has predictive skill competitive with current methods while simultaneously providing formal, statistical inference on the underlying community dynamics of the biological system previously not available.
引用
收藏
页码:2363 / 2388
页数:26
相关论文
共 63 条
  • [1] The relationship of fine-resolution, multi-proxy palaeoclimate records to meteorological data at Fagelmossen, Varmland, Sweden and the implications for the debate on climate drivers of the peat-based record
    Amesbury, M. J.
    Barber, K. E.
    Hughes, P. D. M.
    [J]. QUATERNARY INTERNATIONAL, 2012, 268 : 77 - 86
  • [2] [Anonymous], 1999, SPRINGER SERIES STAT, DOI [DOI 10.1007/978-1-4612-1494-6, 10.1007/978-1-4612-1494-6]
  • [3] [Anonymous], 2016, STAN MOD LANG US GUI
  • [4] [Anonymous], 2010, P 13 INT C ARTIFICIA
  • [5] [Anonymous], 2002, Quantitative methods for current environmental issues
  • [6] Armagan Artin, 2011, Adv Neural Inf Process Syst, V24, P523
  • [7] Stationary process approximation for the analysis of large spatial datasets
    Banerjee, Sudipto
    Gelfand, Alan E.
    Finley, Andrew O.
    Sang, Huiyan
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2008, 70 : 825 - 848
  • [8] Barnard J, 2000, STAT SINICA, V10, P1281
  • [9] A Bayesian semiparametric model for organism based environmental reconstruction
    Bhattacharya, Sourabh
    [J]. ENVIRONMETRICS, 2006, 17 (07) : 763 - 776
  • [10] 'Diatoms and pH reconstruction' (1990) revisited
    Birks, H. John B.
    Simpson, Gavin L.
    [J]. JOURNAL OF PALEOLIMNOLOGY, 2013, 49 (03) : 363 - 371