Examining the impact of data quality and completeness of electronic health records on predictions of patients' risks of cardiovascular disease

被引:15
|
作者
Li, Yan [1 ]
Sperrin, Matthew [1 ]
Martin, Glen P. [1 ]
Ashcroft, Darren M. [2 ,3 ]
van Staa, Tjeerd Pieter [1 ,4 ,5 ]
机构
[1] Univ Manchester, Manchester Acad Hlth Sci Ctr, Hlth E Res Ctr, Fac Biol Med & Hlth,Farr Inst,Sch Hlth Sci, Oxford Rd, Manchester M13 9PL, Lancs, England
[2] Univ Manchester, Fac Biol Med & Hlth, Sch Hlth Sci, Ctr Pharmacoepidemiol & Drug Safety, Oxford Rd, Manchester M13 9PL, Lancs, England
[3] Univ Manchester, Fac Biol Med & Hlth, Sch Hlth Sci, NIHR Greater Manchester Patient Safety Translat R, Oxford Rd, Manchester M13 9PL, Lancs, England
[4] Univ Utrecht, Utrecht Inst Pharmaceut Sci, Utrecht, Netherlands
[5] British Lib, Alan Turing Inst, London, England
关键词
Electronic health records; QRISK; Practice variability; Statistical frailty model; CVD risk prediction; Random slope model; 10-YEAR RISK; MODELS;
D O I
10.1016/j.ijmedinf.2019.104033
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Objective: To assess the extent of variation of data quality and completeness of electronic health records and impact on the robustness of risk predictions of incident cardiovascular disease (CVD) using a risk prediction tool that is based on routinely collected data (QRISK3). Design: Longitudinal cohort study. Settings: 392 general practices (including 3.6 million patients) linked to hospital admission data. Methods: Variation in data quality was assessed using Saez's stability metrics quantifying outlyingness of each practice. Statistical frailty models evaluated whether accuracy of QRISK3 predictions on individual predictions and effects of overall risk factors (linear predictor) varied between practices. Results: There was substantial heterogeneity between practices in CVD incidence unaccounted for by QRISK3. In the lowest quintile of statistical frailty, a QRISK3 predicted risk of 10 % for female was in a range between 7.1 % and 9.0 % when incorporating practice variability into the statistical frailty models; for the highest quintile, this was 10.9%-16.4%. Data quality (using Saez metrics) and completeness were comparable across different levels of statistical frailty. For example, recording of missing information on ethnicity was 55.7 %, 62.7 %, 57.8 %, 64.8 % and 62.1 % for practices from lowest to highest quintiles of statistical frailty respectively. The effects of risk factors did not vary between practices with little statistical variation of beta coefficients. Conclusions: The considerable unmeasured heterogeneity in CVD incidence between practices was not explained by variations in data quality or effects of risk factors. QRISK3 risk prediction should be supplemented with clinical judgement and evidence of additional risk factors.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] Empirical study of Data Completeness in Electronic Health Records in China
    Liu, Caihua
    Zowghi, Didar
    Talaei-Khoei, Amir
    Jin, Zhi
    PACIFIC ASIA JOURNAL OF THE ASSOCIATION FOR INFORMATION SYSTEMS, 2020, 12 (02): : 103 - 128
  • [2] A Study Into the Impact of Data Breaches of Electronic Health Records
    Pilla, Ravi
    Oseni, Taiwo
    Stranieri, Andrew
    PROCEEDINGS OF 2023 AUSTRALIAN COMPUTER SCIENCE WEEK, ACSW 2023, 2023, : 252 - 254
  • [3] The Impact of Electronic Health Records and Meaningful Use on Inpatient Quality
    Trout, Kate E.
    Chen, Li-Wu
    Wilson, Fernando A.
    Tak, Hyo Jung
    Palm, David
    JOURNAL FOR HEALTHCARE QUALITY, 2022, 44 (02) : 15 - 23
  • [4] Detecting Systemic Data Quality Issues in Electronic Health Records
    Ta, Casey N.
    Weng, Chunhua
    MEDINFO 2019: HEALTH AND WELLBEING E-NETWORKS FOR ALL, 2019, 264 : 383 - 387
  • [5] Electronic Health Records vs Medicaid Claims: Completeness of Diabetes Preventive Care Data in Community Health Centers
    DeVoe, Jennifer E.
    Gold, Rachel
    McIntire, Patti
    Puro, Jon
    Chauvie, Susan
    Gallia, Charles A.
    ANNALS OF FAMILY MEDICINE, 2011, 9 (04) : 351 - 358
  • [6] Explore Data Quality Challenges Based on Data Structure of Electronic Health Records
    Liu, Caihua
    Peng, Guochao
    Lan, Chaowang
    Kong, Shufeng
    HUMAN INTERFACE AND THE MANAGEMENT OF INFORMATION, HIMI 2023, PT I, 2023, 14015 : 236 - 247
  • [7] Mining for equitable health: Assessing the impact of missing data in electronic health records
    Getzen, Emily
    Ungar, Lyle
    Mowery, Danielle
    Jiang, Xiaoqian
    Long, Qi
    JOURNAL OF BIOMEDICAL INFORMATICS, 2023, 139
  • [8] Heterogeneous Graph Embeddings of Electronic Health Records Improve Critical Care Disease Predictions
    Wanyan, Tingyi
    Kang, Martin
    Badgeley, Marcus A.
    Johnson, Kipp W.
    De Freitas, Jessica K.
    Chaudhry, Fayzan F.
    Vaid, Akhil
    Zhao, Shan
    Miotto, Riccardo
    Nadkarni, Girish N.
    Wang, Fei
    Rousseau, Justin
    Azad, Ariful
    Ding, Ying
    Glicksberg, Benjamin S.
    ARTIFICIAL INTELLIGENCE IN MEDICINE (AIME 2020), 2020, : 14 - 25
  • [9] Electronic health records model to improve the quality of patients' healthcare
    Motsi, Lovemore
    HEALTH SA GESONDHEID, 2024, 29
  • [10] Identification of Patient Safety Risks Associated with Electronic Health Records: A Software Quality Perspective
    Virginio, Luiz A., Jr.
    Marques Ricarte, Ivan Luiz
    MEDINFO 2015: EHEALTH-ENABLED HEALTH, 2015, 216 : 55 - 59