Targeted Development and Validation of Clinical Prediction Models in Secondary Care Settings: Opportunities and Challenges for Electronic Health Record Data

被引:0
作者
van Maurik, I. S. [1 ]
Doodeman, H. J. [1 ]
Veeger-Nuijens, B. W. [1 ]
Mohringer, R. P. M. [2 ]
Sudion, D. R. [2 ,3 ]
Jongbloed, W. [4 ]
van Soelen, E. [1 ]
机构
[1] Northwest Clin Alkmaar, Northwest Acad, Pr Julianalaan 14, NL-1815 JE Alkmaar, Netherlands
[2] Northwest Clin Alkmaar, Dept Informat & Commun Technol, Alkmaar, Netherlands
[3] Northwest Clin Alkmaar, Dept Radiol, Alkmaar, Netherlands
[4] Northwest Clin Alkmaar, Dept Clin Chem Hematol & Immunol, Alkmaar, Netherlands
关键词
clinical prediction model; electronic health record; targeted validation; EHR; EMR; prediction models; validation; CPM; secondary care; machine learning; artificial intelligence; AI; RISK;
D O I
10.2196/57035
中图分类号
R-058 [];
学科分类号
摘要
Before deploying a clinical prediction model (CPM) in clinical practice, its performance needs to be demonstrated in the population of intended use. This is also called "targeted validation." Many CPMs developed in tertiary settings may be most useful in secondary care, where the patient case mix is broad and practitioners need to triage patients efficiently. However, since structured or rich datasets of sufficient quality from secondary to assess the performance of a CPM are scarce, a validation gap exists that hampers the implementation of CPMs in secondary care settings. In this viewpoint, we highlight the importance of targeted validation and the use of CPMs in secondary care settings and discuss the potential and challenges of using electronic health record (EHR) data to overcome the existing validation gap. The introduction of software applications for text mining of EHRs allows the generation of structured "big" datasets, but the imperfection of EHRs as a research database requires careful validation of data quality. When using EHR data for the development and validation of CPMs, in addition to widely accepted checklists, we propose considering three additional practical steps: (1) involve a local EHR expert (clinician or nurse) in the data extraction process, (2) perform validity checks on the generated datasets, and (3) provide metadata on how variables were constructed from EHRs. These steps help to generate EHR datasets that are statistically powerful, of sufficient quality and replicable, and enable targeted development and validation of CPMs in secondary care settings. This approach can fill a major gap in prediction modeling research and appropriately advance CPMs into clinical practice.
引用
收藏
页数:6
相关论文
共 21 条
  • [1] Collins GS, 2015, ANN INTERN MED, V162, P55, DOI [10.1038/bjc.2014.639, 10.1186/s12916-014-0241-z, 10.1016/j.jclinepi.2014.11.010, 10.1136/bmj.g7594, 10.7326/M14-0697, 10.1002/bjs.9736, 10.7326/M14-0698, 10.1016/j.eururo.2014.11.025]
  • [2] Comparing health status between patients with COPD in primary, secondary and tertiary care
    de Klein, Mieke M.
    Peters, Jeannette B.
    van't Hul, Alex J.
    Akkermans, Reinier P.
    in't Veen, Johannes C.
    Vercoulen, Jan H.
    Bischoff, Erik W.
    Schermer, Tjard R.
    [J]. NPJ PRIMARY CARE RESPIRATORY MEDICINE, 2020, 30 (01)
  • [3] Ehrenstein V, 2019, Tools and Technologies for Registry Interoperability, Registries ForEvaluating Patient Outcomes: A User's Guide, Vthird
  • [4] Evans R S, 2016, Yearb Med Inform, VSuppl 1, pS48, DOI 10.15265/IYS-2016-s006
  • [5] CHARMS and PROBAST at your fingertips: a template for data extraction and risk of bias assessment in systematic reviews of predictive models
    Fernandez-Felix, Borja M.
    Lopez-Alcalde, Jesus
    Roque, Marta
    Muriel, Alfonso
    Zamora, Javier
    [J]. BMC MEDICAL RESEARCH METHODOLOGY, 2023, 23 (01)
  • [6] Heins M., 2022, ZORG DOOR HUISARTS N
  • [7] Electronic Health Record-Triggered Research Infrastructure Combining Real-world Electronic Health Record Data and Patient-Reported Outcomes to Detect Benefits, Risks, and Impact of Medication: Development Study
    Hek, Karin
    Rolfes, Lean
    van Puijenbroek, Eugene P.
    Flinterman, Linda E.
    Vorstenbosch, Saskia
    van Dijk, Liset
    Verheij, Robert A.
    [J]. JMIR MEDICAL INFORMATICS, 2022, 10 (03)
  • [8] Natural Language Processing in Electronic Health Records in relation to healthcare decision-making: A systematic review
    Hossain, Elias
    Rana, Rajib
    Higgins, Niall
    Soar, Jeffrey
    Barua, Prabal Datta
    Pisani, Anthony R.
    Turner, Kathryn
    [J]. COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 155
  • [9] Cohort design and natural language processing to reduce bias in electronic health records research
    Khurshid, Shaan
    Reeder, Christopher
    Harrington, Lia X.
    Singh, Pulkit
    Sarma, Gopal
    Friedman, Samuel F.
    Di Achille, Paolo
    Diamant, Nathaniel
    Cunningham, Jonathan W.
    Turner, Ashby C.
    Lau, Emily S.
    Haimovich, Julian S.
    Al-Alusi, Mostafa A.
    Wang, Xin
    Klarqvist, Marcus D. R.
    Ashburner, Jeffrey M.
    Diedrich, Christian
    Ghadessi, Mercedeh
    Mielke, Johanna
    Eilken, Hanna M.
    McElhinney, Alice
    Derix, Andrea
    Atlas, Steven J.
    Ellinor, Patrick T.
    Philippakis, Anthony A.
    Anderson, Christopher D.
    Ho, Jennifer E.
    Batra, Puneet
    Lubitz, Steven A.
    [J]. NPJ DIGITAL MEDICINE, 2022, 5 (01)
  • [10] Numeric, verbal, and visual formats of conveying health risk: Suggested best practices and future recommendations
    Lipkus, Isaac M.
    [J]. MEDICAL DECISION MAKING, 2007, 27 (05) : 696 - 713