Real-World Data Versus Probability Surveys for Estimating Health Conditions at the State Level

被引:0
作者
Marker, David A. [1 ]
Hilton, Charity [2 ]
Zelko, Jacob [3 ]
Duke, Jon [2 ]
Rolka, Deborah
Kaufmann, Rachel [4 ]
Boyd, Richard [2 ]
机构
[1] Marker Consulting, Fulton, MD USA
[2] Georgia Tech Res Inst, Atlanta, GA USA
[3] Northeastern Univ, Roux Inst, Observat Hlth Data Sci & Informat Ctr, Portland, ME USA
[4] CDCP, Natl Ctr Chron Dis Prevent & Hlth Promot, Atlanta, GA USA
关键词
All of Us; Bias; Data defect correlation; Diabetes; Electronic health records (EHRs); Nonprobability surveys; US;
D O I
10.1093/jssam/smae036
中图分类号
O1 [数学]; C [社会科学总论];
学科分类号
03 ; 0303 ; 0701 ; 070101 ;
摘要
Government statistical offices worldwide are under pressure to produce statistics rapidly and for more detailed geographies, to compete with unofficial estimates available from web-based big data sources or from private companies. Commonly suggested sources of improved health information are electronic health records and medical claims data. These data sources are collectively known as real-world data (RWD) because they are generated from routine health care processes, and they are available for millions of patients. It is clear that RWD can provide estimates that are more timely and less expensive to produce-but a key question is whether or not they are very accurate. To test this, we took advantage of a unique health data source that includes a full range of sociodemographic variables and compared estimates using all of those potential weighting variables versus estimates derived when only age and sex are available for weighting (as is common with most RWD sources). We show that not accounting for other variables can produce misleading and quite inaccurate health estimates.
引用
收藏
页码:1515 / 1530
页数:16
相关论文
共 18 条
[1]  
[Anonymous], 2015, Capturing social and behavioral domains and measures in electronic health records, V1st
[2]  
Arora A., 2022, AMSTAT NEWS
[3]   Statistical inference for association studies using electronic health records: handling both selection bias and outcome misclassification [J].
Beesley, Lauren J. ;
Mukherjee, Bhramar .
BIOMETRICS, 2022, 78 (01) :214-226
[4]   Unrepresentative big surveys significantly overestimated US vaccine uptake [J].
Bradley, Valerie C. ;
Kuriwaki, Shiro ;
Isakov, Michael ;
Sejdinovic, Dino ;
Meng, Xiao-Li ;
Flaxman, Seth .
NATURE, 2021, 600 (7890) :695-+
[5]  
Centers for Disease Control and Prevention, 2024, CDC DATA MODERNIZATI
[6]  
Centers for Disease Control and Prevention, 2022, DIABETES DATA STAT
[7]   Diabetes prevalence and diagnosis in US states: Analysis of health surveys [J].
Danaei G. ;
Friedman A.B. ;
Oza S. ;
Murray C.J.L. ;
Ezzati M. .
Population Health Metrics, 7 (1) :16
[8]   Diagnosed and Undiagnosed Diabetes Prevalence by County in the US, 1999-2012 [J].
Dwyer-Lindgren, Laura ;
Mackenbach, Johan P. ;
van Lenthe, Frank J. ;
Flaxman, Abraham D. ;
Mokdad, Ali H. .
DIABETES CARE, 2016, 39 (09) :1556-1562
[9]  
Eurostat, 2019, REG EU 2019 1700 EUR
[10]   Trends in Diabetes Treatment and Control in US Adults, 1999-2018 [J].
Fang, Michael ;
Wang, Dan ;
Coresh, Josef ;
Selvin, Elizabeth .
NEW ENGLAND JOURNAL OF MEDICINE, 2021, 384 (23) :2219-2228