Robustness of Multiple Imputation Methods for Missing Risk Factor Data from Electronic Medical Records for Observational Studies

被引:0
作者
Sanjoy K. Paul
Joanna Ling
Mayukh Samanta
Olga Montvida
机构
[1] University of Melbourne and Melbourne Health,Melbourne EpiCentre
[2] Royal Melbourne Institute of Technology,undefined
来源
Journal of Healthcare Informatics Research | 2022年 / 6卷
关键词
Missing data; Multiple imputation; Electronic medical records; Comparative effectiveness studies; Real-world evidence;
D O I
暂无
中图分类号
学科分类号
摘要
Evaluating appropriate methodologies for imputation of missing outcome data from electronic medical records (EMRs) is crucial but lacking for observational studies. Using US EMR in people with type 2 diabetes treated over 12 and 24 months with dipeptidyl peptidase 4 inhibitors (DPP-4i, n = 38,483) and glucagon-like peptide 1 receptor agonists (GLP-1RA, n = 8,977), predictors of missingness of disease biomarker (HbA1c) were explored. Robustness of multiple imputation (MI) by chained equations, two-fold MI (MI-2F) and MI with Monte Carlo Markov Chain were compared to complete case analyses for drawing inferences. Compared to younger people (age quartile Q1), those in age quartile Q3 and Q4 were less likely to have missing HbA1c by 25–32% (range of OR CI: 0.55–0.88) at 6-month follow-up and by 26–39% (range of OR CI: 0.50–0.80) at 12-month follow-up. People with HbA1c ≥ 7.5% at baseline were 12% (OR CI: 0.83, 0.93) and 14% (OR CI: 0.77, 0.97) less likely to have missing data at 6-month follow-up in the DPP-4i and GLP-1RA groups, respectively. All imputation methods provided similar HbA1c distributions during follow-up as observed with complete case analyses. The clinical inferences based on absolute change in HbA1c and by proportion of people reducing HbA1c to a clinically acceptable level (≤ 7%) were also similar between imputed data and complete case analyses. MI-2F method provided marginally smaller mean difference between observed and imputed data with relatively smaller standard error of difference, compared to other methods, while evaluating for consistency through artificial within-sample analyses. The established MI techniques can be reliably employed for missing outcome data imputations in large EMR-based relational databases, leading to efficiently designing and drawing robust clinical inferences in pharmaco-epidemiological studies.
引用
收藏
页码:385 / 400
页数:15
相关论文
共 92 条
[1]  
ElZarrad MK(2019)The US Food and Drug Administration’s real-world evidence framework: a commitment for engagement and transparency on real-world evidence Clin Pharmacol Ther 106 33-35
[2]  
Corrigan-Curay J(2019)The future of electronic health records Nature 573 S114-s116
[3]  
Hecht J(2017)Addition of or switch to insulin therapy in people treated with glucagon-like peptide-1 receptor agonists: a real-world study in 66 583 patients Diabetes Obes Metab 19 108-117
[4]  
Montvida O(2018)Long-term sustainability of glycaemic achievements with second-line antidiabetic therapies in patients with type 2 diabetes: a real-world study Diabetes Obes Metab 20 1722-1731
[5]  
Klein K(2019)Learning from longitudinal data in electronic health record and genetic data to improve cardiovascular event prediction Sci Rep 9 717-717
[6]  
Kumar S(2020)Cardiometabolic risk factor control in black and white people in the United States initiating sodium-glucose co-transporter-2 inhibitors: a real-world study Diabetes Obes Metab 22 2384-2397
[7]  
Khunti K(2020)How are missing data in covariates handled in observational time-to-event studies in oncology? A systematic review BMC Med Res Methodol 20 134-106
[8]  
Paul SK(2015)Using multiple imputation to deal with missing data and attrition in longitudinal studies with repeated measures of patient-reported outcomes Clin Epidemiol 7 91-54
[9]  
Montvida O(2014)Statistical challenges in analysing large longitudinal patient-level data: the danger of misleading clinical inferences with imputed data J Indian Soc Agric Stat 68 39-160
[10]  
Shaw JE(2009)Multiple imputation for missing data in epidemiological and clinical research: potential and pitfalls BMJ 338 157-442