An Approach to Integrating a Non-Probability Sample in the Population Census

被引：4

作者：

Burakauskaite, Ieva ^{[1
]}

Ciginas, Andrius ^{[1
]}

机构：

[1] Vilnius Univ, Inst Data Sci & Digital Technol, Akad Str 4, LT-08412 Vilnius, Lithuania

来源：

MATHEMATICS | 2023年 / 11卷 / 08期

关键词：

population census; auxiliary information; missing at random; propensity score adjustment; inverse probability weighting; semiparametric estimation; doubly robust estimation; variance estimation; composite estimation; INFERENCE;

D O I：

10.3390/math11081782

中图分类号：

O1 [数学];

学科分类号：

0701 ; 070101 ;

摘要：

Population censuses are increasingly using administrative information and sampling as alternatives to collecting detailed data from individuals. Non-probability samples can also be an additional, relatively inexpensive data source, although they require special treatment. In this paper, we consider methods for integrating a non-representative volunteer sample into a population census survey, where the complementary probability sample is drawn from the rest of the population. We investigate two approaches to correcting non-probability sample selection bias: adjustment using propensity scores, which models participation in the voluntary sample, and doubly robust estimation, which has the property of persisting possible misspecification of the latter model. We combine the estimators of population parameters that correct the selection bias with the estimators based on a representative union of both samples. Our analysis shows that the availability of detailed auxiliary information simplifies the applied estimation procedures, which are efficient in the Lithuanian census survey. Our findings also reveal the biased nature of the non-probability sample. For instance, when estimating the proportions of professed religions, smaller religious communities exhibit a higher participation rate than other groups. The combination of estimators corrects such selection bias. Our methodology for combining the voluntary and probability samples can be applied to other sample surveys.

引用

页数：14

共 29 条

[1] Integrating probability and big non-probability samples data to produce Official Statistics
Golini, Natalia
Righi, Paolo
STATISTICAL METHODS AND APPLICATIONS, 2024, 33 (02) : 555 - 580
[2] Bayesian Integration for Small Areas by Supplementing a Probability Sample with a Non-probability Sample
Nandram, Balgobin
Rao, J. N. K.
STATISTICS AND APPLICATIONS, 2024, 22 (01): : 343 - 374
[3] Dealing with undercoverage for non-probability survey samples
Chen, Yilin
Li, Pengfei
Wu, Changbao
SURVEY METHODOLOGY, 2023, 49 (02)
[4] Doubly robust estimation for non-probability samples with heterogeneity
Liu, Zhan
Sun, Yi
Li, Yong
Li, Yuanmeng
JOURNAL OF COMPUTATIONAL AND APPLIED MATHEMATICS, 2025, 465
[5] Pretest estimation in combining probability and non-probability samples
Gao, Chenyin
Yang, Shu
ELECTRONIC JOURNAL OF STATISTICS, 2023, 17 (01): : 1492 - 1546
[6] The R package NonProbEst for estimation in non-probability surveys
Rueda, M.
Ferri-Garcia, R.
Castro, L.
R JOURNAL, 2020, 12 (01): : 406 - 418
[7] Kernel Weighting for blending probability and non-probability survey samples
del Mar Rueda, Maria
Cobo, Beatriz
Rueda-Sanchez, Jorge Luis
Ferri-Garcia, Ramon
Castro-Martin, Luis
SORT-STATISTICS AND OPERATIONS RESEARCH TRANSACTIONS, 2024, 48 (01) : 93 - 124
[8] Combining non-probability and probability survey samples through mass imputation
Kim, Jae Kwang
Park, Seho
Chen, Yilin
Wu, Changbao
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES A-STATISTICS IN SOCIETY, 2021, 184 (03) : 941 - 963
[9] Estimating General Parameters from Non-Probability Surveys Using Propensity Score Adjustment
Castro-Martin, Luis
Rueda, Maria del Mar
Ferri-Garcia, Ramon
MATHEMATICS, 2020, 8 (11) : 1 - 14
[10] Doubly robust estimation for non-probability samples with modified intertwined probabilistic factors decoupling
Liu, Zhan
Zheng, Junbo
Pan, Yingli
STATISTICAL ANALYSIS AND DATA MINING, 2023, 16 (03) : 224 - 236

← 1 2 3 →