Pretest estimation in combining probability and non-probability samples

被引:1
作者
Gao, Chenyin [1 ]
Yang, Shu [1 ]
机构
[1] North Carolina State Univ, Dept Stat, Raleigh, NC 27695 USA
来源
ELECTRONIC JOURNAL OF STATISTICS | 2023年 / 17卷 / 01期
关键词
Data integration; dynamic borrowing; non-regularity; Pretest estimator; PSEUDO-LIKELIHOOD; REGRESSION; INFERENCE; NONRESPONSE; MODELS; INTEGRATION; ISSUES; WEAK;
D O I
10.1214/23-EJS2137
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Multiple heterogeneous data sources are becoming increasingly available for statistical analyses in the era of big data. As an important example in finite-population inference, we develop a unified framework of the test-and-pool approach to general parameter estimation by combining gold-standard probability and non-probability samples. We focus on the case when the study variable is observed in both datasets for estimating the target parameters, and each contains other auxiliary variables. Utilizing the probability design, we conduct a pretest procedure to determine the comparability of the non-probability data with the probability data and decide whether or not to leverage the non-probability data in a pooled analysis. When the probability and non-probability data are comparable, our approach combines both data for efficient estimation. Otherwise, we retain only the probability data for estimation. We also characterize the asymptotic distribution of the proposed test-and-pool estimator under a local alternative and provide a data-adaptive procedure to select the critical tuning parameters that target the smallest mean square error of the test -and-pool estimator. Lastly, to deal with the non-regularity of the test-and -pool estimator, we construct a robust confidence interval that has a good finite-sample coverage property.
引用
收藏
页码:1492 / 1546
页数:55
相关论文
共 50 条
  • [41] BLENDING PROBABILITY AND NONPROBABILITY SAMPLES WITH APPLICATIONS TO A SURVEY OF MILITARY CAREGIVERS
    Robbins, Michael W.
    Ghosh-Dastidar, Bonnie
    Ramchand, Rajeev
    JOURNAL OF SURVEY STATISTICS AND METHODOLOGY, 2021, 9 (05) : 1114 - 1145
  • [42] On estimation of optimal treatment regimes for maximizing t-year survival probability
    Jiang, Runchao
    Lu, Wenbin
    Song, Rui
    Davidian, Marie
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2017, 79 (04) : 1165 - 1185
  • [43] Design Effects of Multilevel Estimates From National Probability Samples
    Stapleton, Laura M.
    Kang, Yoonjeong
    SOCIOLOGICAL METHODS & RESEARCH, 2018, 47 (03) : 430 - 457
  • [44] Dominant transition probability: combining CA-Markov model to simulate land use change
    Wang, Shuqing
    Zheng, Xinqi
    ENVIRONMENT DEVELOPMENT AND SUSTAINABILITY, 2023, 25 (07) : 6829 - 6847
  • [45] Transfer posterior error probability estimation for peptide identification
    Yi, Xinpei
    Gong, Fuzhou
    Fu, Yan
    BMC BIOINFORMATICS, 2020, 21 (01)
  • [46] Clinical model to estimate the pretest probability of malignancy in patients with pulmonary focal Ground-glass Opacity
    Jiang, Long
    Situ, Dongrong
    Lin, Yongbin
    Su, Xiaodong
    Zheng, Yan
    Zhang, Yigong
    Long, Hao
    THORACIC CANCER, 2013, 4 (04) : 380 - 384
  • [47] Collision probability estimation for small unmanned aircraft systems
    Zou, Yiyuan
    Zhang, Honghai
    Zhong, Gang
    Liu, Hao
    Feng, Dikun
    RELIABILITY ENGINEERING & SYSTEM SAFETY, 2021, 213
  • [48] Probability Estimation in the Framework of Intuitionistic Fuzzy Evidence Theory
    Song, Yafei
    Wang, Xiaodan
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2015, 2015
  • [49] Default probability estimation via pair copula constructions
    Dalla Valle, Luciana
    De Giuli, Maria Elena
    Tarantola, Claudia
    Manelli, Claudio
    EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2016, 249 (01) : 298 - 311
  • [50] Inverse Probability Weighted Estimation of Risk Under Representative Interventions in Observational Studies
    Young, Jessica G.
    Logan, Roger W.
    Robins, James M.
    Hernan, Miguel A.
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2019, 114 (526) : 938 - 947