A method to combine non-probability sample data with probability sample data in estimating spatial means of environmental variables

被引:16
作者
Brus, DJ [1 ]
De Gruijter, JJ [1 ]
机构
[1] Alterra, Green World Res, Dept Soil & Land Use, Wageningen, Netherlands
关键词
bias; declustering; difference estimator; found data; kriging; preferential sampling; regression estimator; spatial mean;
D O I
10.1023/A:1022618406507
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
In estimating spatial means of environmental variables of a region from data collected by convenience or purposive sampling, validity of the results can be ensured by collecting additional data through probability sampling. The precision of the pi estimator that uses the probability sample can be increased by interpolating the values at the nonprobability sample points to the probability sample points, and using these interpolated values as an auxiliary variable in the difference or regression estimator. These estimators are (approximately) unbiased, even when the nonprobability sample is severely biased such as in preferential samples. The gain in precision compared to the pi estimator in combination with Simple Random Sampling is controlled by the correlation between the target variable and interpolated variable. This correlation is determined by the size (density) and spatial coverage of the nonprobability sample, and the spatial continuity of the target variable. In a case study the average ratio of the variances of the simple regression estimator and pi estimator was 0.68 for preferential samples of size 150 with moderate spatial clustering, and 0.80 for preferential samples of similar size with strong spatial clustering. In the latter case the simple regression estimator was substantially more precise than the simple difference estimator.
引用
收藏
页码:303 / 317
页数:15
相关论文
共 13 条
[1]  
[Anonymous], 2003, Model Assisted SurveySampling
[2]   Random sampling or geostatistical modelling? Choosing between design-based and model-based sampling strategies for soil (with discussion) [J].
Brus, DJ ;
deGruijter, JJ .
GEODERMA, 1997, 80 (1-2) :1-44
[3]  
Cox LH, 1996, ENVIRONMETRICS, V7, P299, DOI 10.1002/(SICI)1099-095X(199605)7:3&lt
[4]  
299::AID-ENV214&gt
[5]  
3.0.CO
[6]  
2-O
[7]  
HANSEN MH, 1983, J AM STAT ASSOC, V78, P776, DOI 10.2307/2288182
[8]   Resampling from stochastic simulations [J].
Journel, A. G. .
ENVIRONMENTAL AND ECOLOGICAL STATISTICS, 1994, 1 (01) :63-84
[9]   USING FOUND DATA TO AUGMENT A PROBABILITY SAMPLE - PROCEDURE AND CASE-STUDY [J].
OVERTON, JM ;
YOUNG, TC ;
OVERTON, WS .
ENVIRONMENTAL MONITORING AND ASSESSMENT, 1993, 26 (01) :65-83
[10]   SMALL DOMAIN ESTIMATION - A CONDITIONAL ANALYSIS [J].
SARNDAL, CE ;
HIDIROGLOU, MA .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1989, 84 (405) :266-275