Combining data from two independent surveys: a model-assisted approach

被引:52
作者
Kim, Jae Kwang [1 ]
Rao, J. N. K. [2 ]
机构
[1] Iowa State Univ, Dept Stat, Ames, IA 50011 USA
[2] Carleton Univ, Sch Math & Stat, Ottawa, ON K1S 5B6, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Double sampling; Mass imputation; Synthetic data; Two-phase sampling; MULTIPLE SURVEYS; INFORMATION; REGRESSION; ESTIMATORS; IMPUTATION; SUPERPOPULATION; ERROR;
D O I
10.1093/biomet/asr063
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Combining information from two or more independent surveys is a problem frequently encountered in survey sampling. We consider the case of two independent surveys, where a large sample from survey 1 collects only auxiliary information and a much smaller sample from survey 2 provides information on both the variables of interest and the auxiliary variables. We propose a model-assisted projection method of estimation based on a working model, but the reference distribution is design-based. We generate synthetic or proxy values of a variable of interest by first fitting the working model, relating the variable of interest to the auxiliary variables, to the data from survey 2 and then predicting the variable of interest associated with the auxiliary variables observed in survey 1. The projection estimator of a total is simply obtained from the survey 1 weights and associated synthetic values. We identify the conditions for the projection estimator to be asymptotically unbiased. Domain estimation using the projection method is also considered. Replication variance estimators are obtained by augmenting the synthetic data file for survey 1 with additional synthetic columns associated with the columns of replicate weights. Results from a simulation study are presented.
引用
收藏
页码:85 / 100
页数:16
相关论文
共 26 条
[1]   AN ERROR-COMPONENTS MODEL FOR PREDICTION OF COUNTY CROP AREAS USING SURVEY AND SATELLITE DATA [J].
BATTESE, GE ;
HARTER, RM ;
FULLER, WA .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1988, 83 (401) :28-36
[2]  
BOSE C., 1943, SANKHYA, V6, P330
[3]  
Breidt F.J., 1996, J INDIAN SOC AGR STA, V49, P79
[4]   Analysis of longitudinal binary data from multiphase sampling [J].
Clayton, D ;
Spiegelhalter, D ;
Dunn, G ;
Pickles, A .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1998, 60 :71-87
[5]  
Cochran W.G., 2007, Sampling techniques
[6]   CALIBRATION ESTIMATORS IN SURVEY SAMPLING [J].
DEVILLE, JC ;
SARNDAL, CE .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1992, 87 (418) :376-382
[7]   Micro-level estimation of poverty and inequality [J].
Elbers, C ;
Lanjouw, JO ;
Lanjouw, P .
ECONOMETRICA, 2003, 71 (01) :355-364
[8]  
FULLER W. A., 2003, ANAL SURVEY DATA
[9]   PARAMETERS OF SUPERPOPULATION AND SURVEY POPULATION - THEIR RELATIONSHIPS AND ESTIMATION [J].
GODAMBE, VP ;
THOMPSON, ME .
INTERNATIONAL STATISTICAL REVIEW, 1986, 54 (02) :127-138
[10]  
Hidiroglou M.A., 2001, Survey Methodology, V27, P143