Comparative interpretation of count, presence-absence and point methods for species distribution models

被引:204
作者
Aarts, Geert [1 ,2 ]
Fieberg, John [3 ]
Matthiopoulos, Jason [4 ,5 ]
机构
[1] Inst Marine Resources & Ecosyst Studies, IMARES Wageningen UR, NL-1790 AD Den Burg, Netherlands
[2] Royal Netherlands Inst Sea Res NIOZ, NL-1790 AD Den Burg, Netherlands
[3] Minnesota Dept Nat Resources MN DNR, Biometr Unit, Forest Lake, MN 55025 USA
[4] Univ St Andrews, Sch Biol, Scottish Oceans Inst, St Andrews KY16 8LB, Fife, Scotland
[5] Univ St Andrews, Ctr Res Environm & Ecol Modelling, St Andrews KY16 9LZ, Fife, Scotland
来源
METHODS IN ECOLOGY AND EVOLUTION | 2012年 / 3卷 / 01期
关键词
accessibility; availability; case-control; line-transect; occupancy; presence-only; resource selection; telemetry; usage; utilization distribution; RESOURCE SELECTION; SPATIAL AUTOCORRELATION; LOGISTIC-REGRESSION; ABUNDANCE; PREDICTION; SUPPORT; ACCOUNT; SPACE; SCALE; BIAS;
D O I
10.1111/j.2041-210X.2011.00141.x
中图分类号
Q14 [生态学(生物生态学)];
学科分类号
071012 ; 0713 ;
摘要
1. The need to understand the processes shaping population distributions has resulted in a vast increase in the diversity of spatial wildlife data, leading to the development of many novel analytical techniques that are fit-for-purpose. One may aggregate location data into spatial units (e.g. grid cells) and model the resulting counts or presenceabsences as a function of environmental covariates. Alternatively, the point data may be modelled directly, by combining the individual observations with a set of random or regular points reflecting habitat availability, a method known as a use-availability design (or, alternatively a presence pseudo-absence or casecontrol design). 2. Although these spatial point, count and presence-absence methods are widely used, the ecological literature is not explicit about their connections and how their parameter estimates and predictions should be interpreted. The objective of this study is to recapitulate some recent statistical results and illustrate that under certain assumptions, each method can be motivated by the same underlying spatial inhomogeneous Poisson point process (IPP) model in which the intensity function is modelled as a log-linear function of covariates. 3. The Poisson likelihood used for count data is a discrete approximation of the IPP likelihood. Similarly, the presence-absence design will approximate the IPP likelihood, but only when spatial units (i.e. pixels) are extremely small (Electric Journal of Statistics, 2010, 4, 1151-1201). For larger pixel sizes, presence-absence designs do not differentiate between one or multiple observations within each pixel, hence leading to information loss. 4. Logistic regression is often used to estimate the parameters of the IPP model using point data. Although the response variable is defined as 0 for the availability points, these zeros do not serve as true absences as is often assumed; rather, their role is to approximate the integral of the denominator in the IPP likelihood (The Annals of Applied Statistics, 2010, 4, 1383-1402). Because of this common misconception, the estimated exponential function of the linear predictor (i.e. the resource selection function) is often assumed to be proportional to occupancy. Like IPP and count models, this function is proportional to the expected density of observations. 5. Understanding these (dis-)similarities between different species distribution modelling techniques should improve biological interpretation of spatial models and therefore advance ecological and methodological cross-fertilization.
引用
收藏
页码:177 / 187
页数:11
相关论文
共 62 条
  • [1] Estimating space-use and habitat preference from wildlife telemetry data
    Aarts, Geert
    MacKenzie, Monique
    McConnell, Bernie
    Fedak, Mike
    Matthiopoulos, Jason
    [J]. ECOGRAPHY, 2008, 31 (01) : 140 - 160
  • [2] Spatial prediction of species distribution: an interface between ecological theory and statistical modelling
    Austin, MP
    [J]. ECOLOGICAL MODELLING, 2002, 157 (2-3) : 101 - 118
  • [3] Spatial logistic regression and change-of-support in Poisson point processes
    Baddeley, A.
    Berman, M.
    Fisher, N. I.
    Hardegen, A.
    Milne, R. K.
    Schuhmacher, D.
    Shah, R.
    Turner, R.
    [J]. ELECTRONIC JOURNAL OF STATISTICS, 2010, 4 : 1151 - 1201
  • [4] Practical maximum pseudolikelihood for spatial point patterns
    Baddeley, A
    Turner, R
    [J]. AUSTRALIAN & NEW ZEALAND JOURNAL OF STATISTICS, 2000, 42 (03) : 283 - 315
  • [5] Relating populations to habitats using resource selection functions
    Boyce, MS
    McDonald, LL
    [J]. TRENDS IN ECOLOGY & EVOLUTION, 1999, 14 (07) : 268 - 272
  • [6] Random forests
    Breiman, L
    [J]. MACHINE LEARNING, 2001, 45 (01) : 5 - 32
  • [7] Cramer EM, 1985, ENCY STAT SCI, P639
  • [8] Cressie N.A.C., 1993, SPATIAL STAT SPATIAL
  • [9] Partial-Likelihood Analysis of Spatio-Temporal Point-Process Data
    Diggle, Peter J.
    Kaimi, Irene
    Abellana, Rosa
    [J]. BIOMETRICS, 2010, 66 (02) : 347 - 354
  • [10] Geostatistical inference under preferential sampling
    Diggle, Peter J.
    Menezes, Raquel
    Su, Ting-li
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES C-APPLIED STATISTICS, 2010, 59 : 191 - 232