Adjustment for biased sampling using NHANES derived propensity weights

被引:2
作者
Bernstein, Olivia M. [1 ]
Vegetabile, Brian G. [2 ]
Salazar, Christian R. [3 ]
Grill, Joshua D. [3 ,4 ,5 ]
Gillen, Daniel L. [1 ,3 ]
机构
[1] Univ Calif Irvine, Dept Stat, Irvine, CA 92697 USA
[2] RAND Corp, Santa Monica, CA USA
[3] Univ Calif Irvine, Inst Memory Impairments & Neurol Disorders, Irvine, CA USA
[4] Univ Calif Irvine, Dept Psychiat & Human Behav, Irvine, CA 92717 USA
[5] Univ Calif Irvine, Dept Neurobiol & Behav, Irvine, CA USA
基金
美国国家科学基金会; 美国国家卫生研究院;
关键词
Biased sample; Convenience sample; Propensity weight; NHANES; COGNITIVE FUNCTION; SCORE; INFERENCE;
D O I
10.1007/s10742-022-00283-x
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
The Consent-to-Contact (C2C) registry at the University of California, Irvine collects data from community participants to aid in the recruitment to clinical research studies. Self-selection into the C2C likely leads to bias due in part to enrollees having more years of education relative to the US general population. Salazar et al. (Alzheimer's Dementia Transl Res Clin Interv 6(1):e120023, 2020, https://doi.org/10.1002/trc2.12023) recently used the C2C to examine associations of race/ethnicity with participant willingness to be contacted about research studies. To obtain representative estimates from C2C we use weighted estimation of associations of interest where the weights are related to the probability of self-selection into the convenience sample. The selection probabilities are estimated using data from the National Health and Nutrition Examination Survey (NHANES). We create a combined dataset of C2C and NHANES subjects and evaluate the trade-offs of different approaches (logistic regression, covariate balancing propensity score, entropy balancing, and random forest) for estimating the probability of membership in C2C relative to NHANES. We further propose methods to estimate the variance of parameter estimates that account for uncertainty that arises from estimating propensity weights. Simulation studies explore the impact of propensity weight estimation on uncertainty. We demonstrate the approach by repeating the analysis by Salazar et al. (Alzheimer's Dementia Transl Res Clin Interv 6(1):e120023, 2020, https://doi.org/10.1002/trc2.12023) with the deduced propensity weights for the C2C subjects and contrast the results of the two analyses. This method can be implemented using our estweight package in R available on GitHub.
引用
收藏
页码:21 / 44
页数:24
相关论文
共 49 条
[1]   Generalizing randomized trial findings to a target population using complex survey population data [J].
Ackerman, Benjamin ;
Lesko, Catherine R. ;
Siddique, Juned ;
Susukida, Ryoko ;
Stuart, Elizabeth A. .
STATISTICS IN MEDICINE, 2021, 40 (05) :1101-1120
[2]   Tracking Early Decline in Cognitive Function in Older Individuals at Risk for Alzheimer Disease Dementia The Alzheimer's Disease Cooperative Study Cognitive Function Instrument [J].
Amariglio, Rebecca E. ;
Donohue, Michael C. ;
Marshall, Gad A. ;
Rentz, Dorene M. ;
Salmon, David P. ;
Ferris, Steven H. ;
Karantzoulis, Stella ;
Aisen, Paul S. ;
Sperling, Reisa A. .
JAMA NEUROLOGY, 2015, 72 (04) :446-454
[3]  
[Anonymous], 1997, Survey methodology, DOI 10.1.1.44.5270
[4]  
[Anonymous], 2013, National Health and Nutrition Examination Survey: Analytic Guidelines, 2011-2012
[5]   Estimating the effect of treatment on binary outcomes using full matching on the propensity score [J].
Austin, Peter C. ;
Stuart, Elizabeth A. .
STATISTICAL METHODS IN MEDICAL RESEARCH, 2017, 26 (06) :2505-2525
[6]   Multi-Institutional Sharing of Electronic Health Record Data to Assess Childhood Obesity [J].
Bailey, L. Charles ;
Milov, David E. ;
Kelleher, Kelly ;
Kahn, Michael G. ;
Del Beccaro, Mark ;
Yu, Feliciano ;
Richards, Thomas ;
Forrest, Christopher B. .
PLOS ONE, 2013, 8 (06)
[7]  
Bernstein D. S., 2009, Matrix Mathematics: Theory, Facts, and Formulas
[8]  
Bishop Y. M., 2007, Discrete Multivariate Analysis: Theory and Practice
[9]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[10]   MULTIPLE IMPUTATION FOR NONRESPONSE IN SURVEYS - RUBIN,DB [J].
CAMPION, WM .
JOURNAL OF MARKETING RESEARCH, 1989, 26 (04) :485-486