Should data be partitioned spatially before building large-scale distribution models?

被引:121
作者
Osborne, PE [1 ]
Suárez-Seoane, S [1 ]
机构
[1] Univ Stirling, Dept Environm Sci, Stirling FK9 4LA, Scotland
关键词
distribution models; spatial non-stationary; logistic regression; spatial heterogeneity; birds; Spain;
D O I
10.1016/S0304-3800(02)00198-9
中图分类号
Q14 [生态学(生物生态学)];
学科分类号
071012 ; 0713 ;
摘要
There is growing interest in building predictive models of species distributions over large geographic areas. As larger areas are modelled, however, it is highly likely that heterogeneity in the predictors variable increases and that areas are included where animals respond to habitats in different ways, for example, due to social status. These effects (spatial non-stationary) may weaken model performance. This paper explores whether data partitioning prior to analysis can improve the tit of models and provide ecological insight into distribution patterns. Data on three bird species were modelled for the whole of Spain at 1 km(2) resolution using logistic regression analysis. Data were partitioned into geographic quarters, concentric rings around the centroid of the distribution, and into random samples for comparison. In all cases, data partitioning produced better models as assessed by Receiver Operating Characteristic curve (AUC) statistics than analysis of the global data set. Inclusion of latitude and longitude improved the global models only when added as smoothed splines but produced different probabilities to the partitioned data. Geographic partitioning is a very crude local modelling approach and we suggest that some form of geographically-weighted regression could offer the best solution to large-scale modelling but is computationally intensive on Geographical Information Systems (GIs) data. It is concluded that simple partitioning by geographic quarters may detect spatial non-stationary and alert the modeller to possible problems; that partitioning into more novel arrangements may be used to test ecological hypotheses; and that data should not be partitioned spatially to build and test models if non-stationary is suspected. (C) 2002 Elsevier Science B.V. All rights reserved.
引用
收藏
页码:249 / 259
页数:11
相关论文
共 29 条
[11]   Predictive habitat distribution models in ecology [J].
Guisan, A ;
Zimmermann, NE .
ECOLOGICAL MODELLING, 2000, 135 (2-3) :147-186
[12]  
Hohmann Ulf, 1994, P359
[13]  
Hosmer DW, 2004, APPL LOGISTIC REGRES
[14]  
Huberty C., 1994, APPL DISCRIMINANT AN
[15]   Alternative methods for predicting species distribution: an illustration with Himalayan river birds [J].
Manel, S ;
Dias, JM ;
Buckton, ST ;
Ormerod, SJ .
JOURNAL OF APPLIED ECOLOGY, 1999, 36 (05) :734-747
[16]   Comparing discriminant analysis, neural networks and logistic regression for predicting species distributions: a case study with a Himalayan river bird [J].
Manel, S ;
Dias, JM ;
Ormerod, SJ .
ECOLOGICAL MODELLING, 1999, 120 (2-3) :337-347
[17]   One hundred fifty years of land values in Chicago: A nonparametric approach [J].
McMillen, DP .
JOURNAL OF URBAN ECONOMICS, 1996, 40 (01) :100-124
[18]  
ORMEROD SJ, 2000, J APPL ECOLOGY S1, V27, P1
[19]   INTERPRETING BIRD ATLAS DATA USING LOGISTIC-MODELS - AN EXAMPLE FROM LESOTHO, SOUTHERN AFRICA [J].
OSBORNE, PE ;
TIGAR, BJ .
JOURNAL OF APPLIED ECOLOGY, 1992, 29 (01) :55-62
[20]   Modelling landscape-scale habitat use using GIS and remote sensing: a case study with great bustards [J].
Osborne, PE ;
Alonso, JC ;
Bryant, RG .
JOURNAL OF APPLIED ECOLOGY, 2001, 38 (02) :458-471