Point pattern modelling for degraded presence-only data over large regions

被引:67
作者
Chakraborty, Avishek [1 ]
Gelfand, Alan E. [2 ]
Wilson, Adam M. [3 ]
Latimer, Andrew M. [4 ]
Silander, John A. [3 ]
机构
[1] Texas A&M Univ, Dept Stat, College Stn, TX 77843 USA
[2] Duke Univ, Durham, NC 27706 USA
[3] Univ Connecticut, Storrs, CT USA
[4] Univ Calif Davis, Davis, CA 95616 USA
基金
美国国家科学基金会;
关键词
Gaussian process; Intensity surface; Land use transformation; Non-homogeneous Poisson process; Predictive spatial process; Sampling bias; Spatial point pattern; SPECIES DISTRIBUTIONS; SPATIAL-DISTRIBUTION; BIODIVERSITY; ABUNDANCE; BIAS;
D O I
10.1111/j.1467-9876.2011.01023.x
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Explaining the distribution of a species by using local environmental features is a long-standing ecological problem. Often, available data are collected as a set of presence locations only, thus precluding the possibility of a desired presence-absence analysis. We propose that it is natural to view presence-only data as a point pattern over a region and to use local environmental features to explain the intensity driving this point pattern. We use a hierarchical model to treat the presence data as a realization of a spatial point process, whose intensity is governed by the set of environmental covariates. Spatial dependence in the intensity levels is modelled with random effects involving a zero-mean Gaussian process. We augment the model to capture highly variable and typically sparse sampling effort as well as land transformation, both of which degrade the point pattern. The Cape Floristic Region in South Africa provides an extensive class of such species data. The potential (i.e. non-degraded) presence surfaces over the entire area are of interest from a conservation and policy perspective. The region is divided into about 37000 grid cells. To work with a Gaussian process over a very large number of cells we use a predictive spatial process approximation. Bias correction by adding a heteroscedastic error component has also been implemented. We illustrate with modelling for six different species. Also, a comparison is made with the now popular Maxent approach though it is limited with regard to inference. The resultant patterns are important on their own but also enable a comparative view, for example, to investigate whether a pair of species are potentially competing in the same area. An additional feature of our modelling is the opportunity to infer about biodiversity through species richness, i.e. the number of distinct species in an areal unit. Such an investigation immediately follows within our modelling framework.
引用
收藏
页码:757 / 776
页数:20
相关论文
共 48 条
  • [1] [Anonymous], 2003, Statistical Inference and Simulation for Spatial Point Processes
  • [2] An autologistic model for the spatial distribution of wildlife
    Augustin, NH
    Mugglestone, MA
    Buckland, ST
    [J]. JOURNAL OF APPLIED ECOLOGY, 1996, 33 (02) : 339 - 347
  • [3] Banerjee S., 2003, Hierarchical modeling and analysis for spatial data
  • [4] Stationary process approximation for the analysis of large spatial datasets
    Banerjee, Sudipto
    Gelfand, Alan E.
    Finley, Andrew O.
    Sang, Huiyan
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2008, 70 : 825 - 848
  • [5] BENES V, 2003, P INT C ENV STAT HLT, P95
  • [6] BESAG J, 1974, J ROY STAT SOC B MET, V36, P192
  • [7] Random forests
    Breiman, L
    [J]. MACHINE LEARNING, 2001, 45 (01) : 5 - 32
  • [8] CHAKRABORTY A, 2010, THESIS DUKE U DURHAM
  • [9] MODELING LARGE SCALE SPECIES ABUNDANCE WITH LATENT SPATIAL PROCESSES
    Chakraborty, Avishek
    Gelfand, Alan E.
    Wilson, Adam M.
    Latimer, Andrew M.
    Silander, John A., Jr.
    [J]. ANNALS OF APPLIED STATISTICS, 2010, 4 (03) : 1403 - 1429
  • [10] Fixed rank kriging for very large spatial data sets
    Cressie, Noel
    Johannesson, Gardar
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2008, 70 : 209 - 226