Asymptotic efficiency of the calibration estimator in a high-dimensional data setting

被引:9
作者
Chauvet, Guillaume [1 ]
Goga, Camelia [2 ]
机构
[1] Ensai Irmar, Campus Ker Lann, F-35170 Bruz, France
[2] Univ Bourgogne Franche Comte, Lab Mathemat Besancon, 16 Route Gray, F-25000 Besancon, France
关键词
Calibrated weights; GREG estimator; Variable selection; Survey sampling; PARAMETERS;
D O I
10.1016/j.jspi.2021.07.011
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
In a finite population sampling survey, auxiliary information is commonly used to improve the Horvitz-Thompson estimators and calibration has been extensively used by national statistical agencies over the last decades for that purpose. This method enables to make estimators consistent with known totals of auxiliary variables and to reduce variance if the calibration variables are explanatory for the variable of interest. Nowadays, it is not unusual anymore to have high-dimensional auxiliary data sets and adding too much additional calibration variables may increase the variance of calibration estimators. We study in this paper the asymptotic efficiency of the calibration estimator with high-dimensional auxiliary data sets and we prove that it may suffer from an additional variability that may not be neglected in certain conditions. We suggest a bootstrap criterion in the choice of calibration variables. A short simulation study shows that the proposed method may lead to a more parsimonious number of calibration variables with associated weights of smaller variation and no variance inflation. (C) 2021 Elsevier B.V. All rights reserved.
引用
收藏
页码:177 / 187
页数:11
相关论文
共 24 条
[1]  
Beaumont J-F., 2008, Metron Int. J. Stat, VLXVI, P260
[2]   On the Generalized Bootstrap for Sample Surveys with Special Attention to Poisson Sampling [J].
Beaumont, Jean-Francois ;
Patak, Zdenek .
INTERNATIONAL STATISTICAL REVIEW, 2012, 80 (01) :127-148
[3]  
Breidt FJ, 2000, ANN STAT, V28, P1026
[4]   CALIBRATION AND PARTIAL CALIBRATION ON PRINCIPAL COMPONENTS WHEN THE NUMBER OF AUXILIARY VARIABLES IS LARGE [J].
Cardot, H. ;
Goga, C. ;
Shehzad, M. -A. .
STATISTICA SINICA, 2017, 27 (01) :243-260
[5]   Uniform convergence and asymptotic confidence bands for model-assisted estimators of the mean of sampled functional data [J].
Cardot, Herve ;
Goga, Camelia ;
Lardin, Pauline .
ELECTRONIC JOURNAL OF STATISTICS, 2013, 7 :562-596
[6]   Properties of design-based functional principal components analysis [J].
Cardot, Herve ;
Chaouch, Mohamed ;
Goga, Camelia ;
Labruere, Catherine .
JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2010, 140 (01) :75-91
[7]  
Chauvet G., 2007, THESIS U RENNES, V2
[8]  
Chauvet G., 2013, J SOC FRANAISE STATI
[9]   Exact balanced random imputation for sample survey data [J].
Chauvet, Guillaume ;
Do Paco, Wilfried .
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2018, 128 :1-16
[10]   CALIBRATION ESTIMATORS IN SURVEY SAMPLING [J].
DEVILLE, JC ;
SARNDAL, CE .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1992, 87 (418) :376-382