SPATIAL JOINT SPECIES DISTRIBUTION MODELING USING DIRICHLET PROCESSES

被引:13
作者
Shirota, Shinichiro [1 ]
Gelfand, Alan E. [2 ]
Banerjee, Sudipto [1 ]
机构
[1] Univ Calif Los Angeles, Dept Biostat, 650 Charles E Young Dr, South Los Angeles, CA 90095 USA
[2] Duke Univ, Dept Stat, Durham, NC 27708 USA
关键词
Dimension reduction; Gaussian processes; high-dimensional covariance matrix; spatial factor model; species dependence; CLIMATE-CHANGE; DIMENSION REDUCTION; BAYESIAN MODEL; MULTIVARIATE; INFERENCE; COOCCURRENCE; PATTERNS; HABITAT; TREE;
D O I
10.5705/ss.202017.0482
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Species distribution models usually attempt to explain the presence-absence or abundance of a species at a site in terms of the environmental features (so-called abiotic features) present at the site. Historically, such models have considered species individually. However, it is well established that species interact to influence the presence-absence and abundance (envisioned as biotic factors). As a result, recently joint species distribution models with various types of responses, such as presence-absence, continuous, and ordinal data have attracted a significant amount of interest. Such models incorporate the dependence between species' responses as a proxy for interaction. We address the accommodation of such modeling in the context of a large number of species (e.g., order 10(2)) across sites numbering in the order of 10(2) or 10(3) when, in practice, only a few species are found at any observed site. To do so, we adopt a dimension-reduction approach. The novelty of our approach is that we add spatial dependence. That is, we consider a collection of sites over a relatively small spatial region. As such, we anticipate that the species distribution at a given site will be similar to that at a nearby site. Specifically, we handle dimension reduction using Dirichlet processes, which enables the clustering of species, and add spatial dependence across sites using Gaussian processes. We use simulated data and a plant communities data set for the Cape Floristic Region (CFR) of South Africa to demonstrate our approach. The latter consists of presence-absence measurements for 639 tree species at 662 locations. These two examples demonstrate the improved predictive performance of our method using the aforementioned specification.
引用
收藏
页码:1127 / 1154
页数:28
相关论文
共 55 条
  • [1] [Anonymous], 2014, Hierarchical Modelling and Analysis for Spatial Data
  • [2] [Anonymous], 1986, Floristic regions of the world
  • [3] Current approaches to modelling the environmental niche of eucalypts: Implication for management of forest biodiversity
    Austin, MP
    Meyers, JA
    [J]. FOREST ECOLOGY AND MANAGEMENT, 1996, 85 (1-3) : 95 - 106
  • [4] Stationary process approximation for the analysis of large spatial datasets
    Banerjee, Sudipto
    Gelfand, Alan E.
    Finley, Andrew O.
    Sang, Huiyan
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2008, 70 : 825 - 848
  • [5] Sparse Bayesian infinite factor models
    Bhattacharya, A.
    Dunson, D. B.
    [J]. BIOMETRIKA, 2011, 98 (02) : 291 - 306
  • [6] Forecasting the effects of global warming on biodiversity
    Botkin, Daniel B.
    Saxe, Henrik
    Araujo, Miguel B.
    Betts, Richard
    Bradshaw, Richard H. W.
    Cedhagen, Tomas
    Chesson, Peter
    Dawson, Terry P.
    Etterson, Julie R.
    Faith, Daniel P.
    Ferrier, Simon
    Guisan, Antoine
    Hansen, Anja Skjoldborg
    Hilbert, David W.
    Loehle, Craig
    Margules, Chris
    New, Mark
    Sobel, Matthew J.
    Stockwell, David R. B.
    [J]. BIOSCIENCE, 2007, 57 (03) : 227 - 236
  • [7] A semiparametric Bayesian model for randomised block designs
    Bush, CA
    MacEachern, SN
    [J]. BIOMETRIKA, 1996, 83 (02) : 275 - 285
  • [8] Stacking species distribution models and adjusting bias by linking them to macroecological models
    Calabrese, Justin M.
    Certain, Gregoire
    Kraan, Casper
    Dormann, Carsten F.
    [J]. GLOBAL ECOLOGY AND BIOGEOGRAPHY, 2014, 23 (01): : 99 - 112
  • [9] Point pattern modelling for degraded presence-only data over large regions
    Chakraborty, Avishek
    Gelfand, Alan E.
    Wilson, Adam M.
    Latimer, Andrew M.
    Silander, John A.
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES C-APPLIED STATISTICS, 2011, 60 : 757 - 776
  • [10] Analysis of multivariate probit models
    Chib, S
    Greenberg, E
    [J]. BIOMETRIKA, 1998, 85 (02) : 347 - 361