An Approach to Integrating a Non-Probability Sample in the Population Census

被引:4
作者
Burakauskaite, Ieva [1 ]
Ciginas, Andrius [1 ]
机构
[1] Vilnius Univ, Inst Data Sci & Digital Technol, Akad Str 4, LT-08412 Vilnius, Lithuania
关键词
population census; auxiliary information; missing at random; propensity score adjustment; inverse probability weighting; semiparametric estimation; doubly robust estimation; variance estimation; composite estimation; INFERENCE;
D O I
10.3390/math11081782
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Population censuses are increasingly using administrative information and sampling as alternatives to collecting detailed data from individuals. Non-probability samples can also be an additional, relatively inexpensive data source, although they require special treatment. In this paper, we consider methods for integrating a non-representative volunteer sample into a population census survey, where the complementary probability sample is drawn from the rest of the population. We investigate two approaches to correcting non-probability sample selection bias: adjustment using propensity scores, which models participation in the voluntary sample, and doubly robust estimation, which has the property of persisting possible misspecification of the latter model. We combine the estimators of population parameters that correct the selection bias with the estimators based on a representative union of both samples. Our analysis shows that the availability of detailed auxiliary information simplifies the applied estimation procedures, which are efficient in the Lithuanian census survey. Our findings also reveal the biased nature of the non-probability sample. For instance, when estimating the proportions of professed religions, smaller religious communities exhibit a higher participation rate than other groups. The combination of estimators corrects such selection bias. Our methodology for combining the voluntary and probability samples can be applied to other sample surveys.
引用
收藏
页数:14
相关论文
共 29 条
  • [1] Integrating probability and big non-probability samples data to produce Official Statistics
    Golini, Natalia
    Righi, Paolo
    STATISTICAL METHODS AND APPLICATIONS, 2024, 33 (02) : 555 - 580
  • [2] Bayesian Integration for Small Areas by Supplementing a Probability Sample with a Non-probability Sample
    Nandram, Balgobin
    Rao, J. N. K.
    STATISTICS AND APPLICATIONS, 2024, 22 (01): : 343 - 374
  • [3] Dealing with undercoverage for non-probability survey samples
    Chen, Yilin
    Li, Pengfei
    Wu, Changbao
    SURVEY METHODOLOGY, 2023, 49 (02)
  • [4] Doubly robust estimation for non-probability samples with heterogeneity
    Liu, Zhan
    Sun, Yi
    Li, Yong
    Li, Yuanmeng
    JOURNAL OF COMPUTATIONAL AND APPLIED MATHEMATICS, 2025, 465
  • [5] Pretest estimation in combining probability and non-probability samples
    Gao, Chenyin
    Yang, Shu
    ELECTRONIC JOURNAL OF STATISTICS, 2023, 17 (01): : 1492 - 1546
  • [6] The R package NonProbEst for estimation in non-probability surveys
    Rueda, M.
    Ferri-Garcia, R.
    Castro, L.
    R JOURNAL, 2020, 12 (01): : 406 - 418
  • [7] Kernel Weighting for blending probability and non-probability survey samples
    del Mar Rueda, Maria
    Cobo, Beatriz
    Rueda-Sanchez, Jorge Luis
    Ferri-Garcia, Ramon
    Castro-Martin, Luis
    SORT-STATISTICS AND OPERATIONS RESEARCH TRANSACTIONS, 2024, 48 (01) : 93 - 124
  • [8] Combining non-probability and probability survey samples through mass imputation
    Kim, Jae Kwang
    Park, Seho
    Chen, Yilin
    Wu, Changbao
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES A-STATISTICS IN SOCIETY, 2021, 184 (03) : 941 - 963
  • [9] Estimating General Parameters from Non-Probability Surveys Using Propensity Score Adjustment
    Castro-Martin, Luis
    Rueda, Maria del Mar
    Ferri-Garcia, Ramon
    MATHEMATICS, 2020, 8 (11) : 1 - 14
  • [10] Doubly robust estimation for non-probability samples with modified intertwined probabilistic factors decoupling
    Liu, Zhan
    Zheng, Junbo
    Pan, Yingli
    STATISTICAL ANALYSIS AND DATA MINING, 2023, 16 (03) : 224 - 236