A perceptron for detecting the preferential sampling of locations and times chosen to monitor a spatio-temporal process

被引:9
作者
Watson, Joe [1 ]
机构
[1] Univ British Columbia, Dept Stat, Vancouver, BC, Canada
关键词
Preferential sampling; Spatio-temporal statistics; Point processes; Spatial statistics; Environmental monitoring; Geostatistics; POSTERIOR CONSISTENCY; AIR-POLLUTION; INFERENCE; TESTS; BIAS; MODELS; MARKS;
D O I
10.1016/j.spasta.2021.100500
中图分类号
P [天文学、地球科学];
学科分类号
07 ;
摘要
The preferential sampling of locations chosen to observe a spatio-temporal process has been identified as a major problem across multiple fields. Predictions of the process can be severely biased when standard statistical methodologies are applied to preferentially sampled data without adjustment. Detecting preferential sampling is currently a technically demanding task. As a result, the problem is often ignored in data analyses. This paper offers a general, intuitive, and computationally-fast solution. A novel approach for testing if a spatio-temporal dataset was preferentially sampled is presented. We refer to the test as a perceptron as it attempts to capture the numerous factors behind the human decision-making that selected the sampled locations and times. Importantly, the method can also help with the discovery of a set of informative covariates that can sufficiently control for the preferential sampling. The discovery of these covariates can justify the continued use of standard methodologies. A thorough simulation study is presented to demonstrate both the power and validity of the test in various data settings. The test is shown to attain high power for non-Gaussian data with sample sizes as low as 50. Finally, two previously-published case studies are revisited and new insights into the nature of the informative sampling are gained. The test can be implemented with the R package PStestR. (C) 2021 Elsevier B.V. All rights reserved.
引用
收藏
页数:22
相关论文
共 50 条
[1]   Effective sample size for spatial regression models [J].
Acosta, Jonathan ;
Vallejos, Ronny .
ELECTRONIC JOURNAL OF STATISTICS, 2018, 12 (02) :3147-3180
[2]  
[Anonymous], 2015, STAT SPATIO TEMPORAL
[3]  
[Anonymous], 2008, STAT ANAL MODELLING
[4]   Residual analysis for spatial point processes [J].
Baddeley, A ;
Turner, R ;
Moller, J ;
Hazelton, M .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2005, 67 :617-651
[5]  
Baddeley A. J., 2015, CHAP HALL CRC INTERD, P611
[6]   On two-stage Monte Carlo tests of composite hypotheses [J].
Baddeley, Adrian ;
Hardegen, Andrew ;
Lawrence, Thomas ;
Milne, Robin K. ;
Nair, Gopalan ;
Rakshit, Suman .
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2017, 114 :75-87
[7]  
BESAG J, 1974, J ROY STAT SOC B MET, V36, P192
[8]  
Bivand R. S., 2013, APPL SPATIAL DATA AN, DOI 10.1007/978-1-4614-7618-4
[9]  
Blangiardo M, 2015, SPATIAL AND SPATIO-TEMPORAL BAYESIAN MODELS WITH R-INLA, P1, DOI 10.1002/9781118950203
[10]   On posterior consistency in nonparametric regression problems [J].
Choi, Taeryon ;
Schervish, Mark J. .
JOURNAL OF MULTIVARIATE ANALYSIS, 2007, 98 (10) :1969-1987