Approaches to Improving Survey-Weighted Estimates

被引:24
作者
Chen, Qixuan [1 ]
Elliott, Michael R. [2 ]
Haziza, David [3 ]
Yang, Ye [4 ]
Ghosh, Malay [5 ]
Little, Roderick J. A. [2 ]
Sedransk, Joseph [6 ]
Thompson, Mary [7 ]
机构
[1] Columbia Univ, Dept Biostat, New York, NY 10032 USA
[2] Univ Michigan, Dept Biostat, Ann Arbor, MI 48109 USA
[3] Univ Montreal, Dept Math & Stat, Montreal, PQ H3T 1J4, Canada
[4] Univ Michigan, Ann Arbor, MI 48105 USA
[5] Univ Florida, Dept Stat, Gainesville, FL 32611 USA
[6] Univ Maryland, Joint Program Survey Methodol, College Pk, MD 20742 USA
[7] Univ Waterloo, Dept Stat & Actuarial Sci, Waterloo, ON N2L 3G1, Canada
关键词
Design-based survey weights; finite population survey sampling; inclusion probability; weight modeling; weight trimming; MODEL AVERAGING METHODS; CALIBRATED BAYES; SAMPLING DESIGNS; INFERENCE; POPULATION; SUPERPOPULATION;
D O I
10.1214/17-STS609
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
In sample surveys, the sample units are typically chosen using a complex design. This may lead to a selection effect and, if uncorrected in the analysis, may lead to biased inferences. To mitigate the effect on inferences of deviations from a simple random sample a common technique is to use survey weights in the analysis. This article reviews approaches to address possible inefficiency in estimation resulting from such weighting. To improve inferences we emphasize modifications of the basic design based weight, that is, the inverse of a unit's inclusion probability. These techniques include weight trimming, weight modelling and incorporating weights via models for survey variables. We start with an introduction to survey weighting, including methods derived from both the design and model based perspectives. Then we present the rationale and a taxonomy of methods for modifying the weights. We next describe an extensive numerical study to compare these methods. Using as the criteria relative bias, relative mean square error, confidence or credible interval width and coverage probability, we compare the alternative methods and summarize our findings. To supplement this numerical study we use Texas school data to compare the distributions of the weights for several methods. We also make general recommendations, describe limitations of our numerical study and make suggestions for further investigation.
引用
收藏
页码:227 / 248
页数:22
相关论文
共 83 条