Bias correction in species distribution models: pooling survey and collection data for multiple species

被引:307
作者
Fithian, William [1 ]
Elith, Jane [2 ]
Hastie, Trevor [1 ]
Keith, David A. [3 ]
机构
[1] Stanford Univ, Dept Stat, Stanford, CA 94305 USA
[2] Univ Melbourne, Sch Bot, Parkville, Vic 3010, Australia
[3] Univ New S Wales, Ctr Ecosyst Sci, Sydney, NSW 2052, Australia
来源
METHODS IN ECOLOGY AND EVOLUTION | 2015年 / 6卷 / 04期
基金
美国国家卫生研究院; 美国国家科学基金会; 澳大利亚研究理事会;
关键词
presence-absence; presence-only; sampling bias; spatial point processes; species distribution models; PRESENCE-ONLY DATA; POINT PROCESS MODELS; PRESENCE-ABSENCE; ABUNDANCE; EQUIVALENCE; OCCUPANCY; BOOTSTRAP;
D O I
10.1111/2041-210X.12242
中图分类号
Q14 [生态学(生物生态学)];
学科分类号
071012 ; 0713 ;
摘要
Presence-only records may provide data on the distributions of rare species, but commonly suffer from large, unknown biases due to their typically haphazard collection schemes. Presence-absence or count data collected in systematic, planned surveys are more reliable but typically less abundant. We proposed a probabilistic model to allow for joint analysis of presence-only and survey data to exploit their complementary strengths. Our method pools presence-only and presence-absence data for many species and maximizes a joint likelihood, simultaneously estimating and adjusting for the sampling bias affecting the presence-only data. By assuming that the sampling bias is the same for all species, we can borrow strength across species to efficiently estimate the bias and improve our inference from presence-only data. We evaluate our model's performance on data for 36 eucalypt species in south-eastern Australia. We find that presence-only records exhibit a strong sampling bias towards the coast and towards Sydney, the largest city. Our data-pooling technique substantially improves the out-of-sample predictive performance of our model when the amount of available presence-absence data for a given species is scarce If we have only presence-only data and no presence-absence data for a given species, but both types of data for several other species that suffer from the same spatial sampling bias, then our method can obtain an unbiased estimate of the first species' geographic range.
引用
收藏
页码:424 / 438
页数:15
相关论文
共 41 条
[1]   Comparative interpretation of count, presence-absence and point methods for species distribution models [J].
Aarts, Geert ;
Fieberg, John ;
Matthiopoulos, Jason .
METHODS IN ECOLOGY AND EVOLUTION, 2012, 3 (01) :177-187
[2]  
[Anonymous], 2005, P 18 INT C NEUR INF
[3]  
[Anonymous], 1989, GEN LINEAR MODELS
[4]  
[Anonymous], 1996, J ECON LIT
[5]   Spatial logistic regression and change-of-support in Poisson point processes [J].
Baddeley, A. ;
Berman, M. ;
Fisher, N. I. ;
Hardegen, A. ;
Milne, R. K. ;
Schuhmacher, D. ;
Shah, R. ;
Turner, R. .
ELECTRONIC JOURNAL OF STATISTICS, 2010, 4 :1151-1201
[6]   Point pattern modelling for degraded presence-only data over large regions [J].
Chakraborty, Avishek ;
Gelfand, Alan E. ;
Wilson, Adam M. ;
Latimer, Andrew M. ;
Silander, John A. .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES C-APPLIED STATISTICS, 2011, 60 :757-776
[7]  
Cressie N.A., 1993, Statistics for Spatial Data, V928
[8]   Accounting for imperfect detection and survey bias in statistical analysis of presence-only data [J].
Dorazio, Robert M. .
GLOBAL ECOLOGY AND BIOGEOGRAPHY, 2014, 23 (12) :1472-1484
[9]   Predicting the Geographic Distribution of a Species from Presence-Only Data Subject to Detection Errors [J].
Dorazio, Robert M. .
BIOMETRICS, 2012, 68 (04) :1303-1312
[10]   A statistical explanation of MaxEnt for ecologists [J].
Elith, Jane ;
Phillips, Steven J. ;
Hastie, Trevor ;
Dudik, Miroslav ;
Chee, Yung En ;
Yates, Colin J. .
DIVERSITY AND DISTRIBUTIONS, 2011, 17 (01) :43-57