Importance of multi-modal approaches to effectively identify cataract cases from electronic health records

被引:93
|
作者
Peissig, Peggy L. [1 ]
Rasmussen, Luke V. [1 ,2 ]
Berg, Richard L. [1 ]
Linneman, James G. [1 ]
McCarty, Catherine A. [3 ,4 ]
Waudby, Carol [3 ]
Chen, Lin [5 ]
Denny, Joshua C. [6 ,7 ]
Wilke, Russell A.
Pathak, Jyotishman [8 ]
Carrell, David [9 ]
Kho, Abel N. [10 ]
Starren, Justin B. [2 ]
机构
[1] Marshfield Clin Res Fdn, Biomed Informat Res Ctr, Marshfield, WI 54449 USA
[2] Northwestern Univ, Dept Prevent Med, Feinberg Sch Med, Div Hlth & Biomed Informat, Chicago, IL 60611 USA
[3] Marshfield Clin Res Fdn, Ctr Human Genet, Marshfield, WI 54449 USA
[4] Essentia Inst Rural Hlth, Duluth, MN USA
[5] Marshfield Clin Fdn Med Res & Educ, Dept Ophthalmol, Marshfield, WI USA
[6] Vanderbilt Univ, Sch Med, Dept Biomed Informat, Nashville, TN 37212 USA
[7] Vanderbilt Univ, Sch Med, Dept Med, Nashville, TN 37212 USA
[8] Mayo Clin, Dept Hlth Sci Res, Rochester, MN USA
[9] Grp Hlth Res Inst, Seattle, WA USA
[10] Northwestern Univ, Dept Med, Feinberg Sch Med, Chicago, IL 60611 USA
关键词
GENOME-WIDE ASSOCIATION; MEDICAL-RECORDS; VISUAL IMPAIRMENT; POPULATION; PREVALENCE; ADULTS;
D O I
10.1136/amiajnl-2011-000456
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Objective There is increasing interest in using electronic health records (EHRs) to identify subjects for genomic association studies, due in part to the availability of large amounts of clinical data and the expected cost efficiencies of subject identification. We describe the construction and validation of an EHR-based algorithm to identify subjects with age-related cataracts. Materials and methods We used a multi-modal strategy consisting of structured database querying, natural language processing on free-text documents, and optical character recognition on scanned clinical images to identify cataract subjects and related cataract attributes. Extensive validation on 3657 subjects compared the multi-modal results to manual chart review. The algorithm was also implemented at participating electronic MEdical Records and GEnomics (eMERGE) institutions. Results An EHR-based cataract phenotyping algorithm was successfully developed and validated, resulting in positive predictive values (PPVs) >95%. The multi-modal approach increased the identification of cataract subject attributes by a factor of three compared to single-mode approaches while maintaining high PPV. Components of the cataract algorithm were successfully deployed at three other institutions with similar accuracy. Discussion A multi-modal strategy incorporating optical character recognition and natural language processing may increase the number of cases identified while maintaining similar PPVs. Such algorithms, however, require that the needed information be embedded within clinical documents. Conclusion We have demonstrated that algorithms to identify and characterize cataracts can be developed utilizing data collected via the EHR. These algorithms provide a high level of accuracy even when implemented across multiple EHRs and institutional boundaries.
引用
收藏
页码:225 / 234
页数:10
相关论文
共 50 条
  • [31] A scoring system derived from electronic health records to identify patients at high risk for noninvasive ventilation failure
    Mihaela S. Stefan
    Aruna Priya
    Penelope S. Pekow
    Jay S. Steingrub
    Nicholas S. Hill
    Tara Lagu
    Karthik Raghunathan
    Anusha G. Bhat
    Peter K. Lindenauer
    BMC Pulmonary Medicine, 21
  • [32] A scoring system derived from electronic health records to identify patients at high risk for noninvasive ventilation failure
    Stefan, Mihaela S.
    Priya, Aruna
    Pekow, Penelope S.
    Steingrub, Jay S.
    Hill, Nicholas S.
    Lagu, Tara
    Raghunathan, Karthik
    Bhat, Anusha G.
    Lindenauer, Peter K.
    BMC PULMONARY MEDICINE, 2021, 21 (01)
  • [33] Can antiepileptic efficacy and epilepsy variables be studied from electronic health records? A review of current approaches
    Decker, Barbara M.
    Hill, Chloe E.
    Baldassano, Steven N.
    Khankhanian, Pouya
    SEIZURE-EUROPEAN JOURNAL OF EPILEPSY, 2021, 85 : 138 - 144
  • [34] Validation of Pediatric Diabetes Case Identification Approaches for Diagnosed Cases by Using Information in the Electronic Health Records of a Large Integrated Managed Health Care Organization
    Lawrence, Jean M.
    Black, Mary Helen
    Zhang, Jian L.
    Slezak, Jeff M.
    Takhar, Harpreet S.
    Koebnick, Corinna
    Mayer-Davis, Elizabeth J.
    Zhong, Victor W.
    Dabelea, Dana
    Hamman, Richard F.
    Reynolds, Kristi
    AMERICAN JOURNAL OF EPIDEMIOLOGY, 2014, 179 (01) : 27 - 38
  • [35] MixEHR-Guided: A guided multi-modal topic modeling approach for large-scale automatic phenotyping using the electronic health record
    Ahuja, Yuri
    Zou, Yuesong
    Verma, Aman
    Buckeridge, David
    Li, Yue
    Journal of Biomedical Informatics, 2022, 134
  • [36] Using Multi-Modal Electronic Health Record Data for the Development and Validation of Risk Prediction Models for Long COVID Using the Super Learner Algorithm
    Jin, Weijia
    Hao, Wei
    Shi, Xu
    Fritsche, Lars G.
    Salvatore, Maxwell
    Admon, Andrew J.
    Friese, Christopher R.
    Mukherjee, Bhramar
    JOURNAL OF CLINICAL MEDICINE, 2023, 12 (23)
  • [37] MixEHR-Guided: A guided multi-modal topic modeling approach for large-scale automatic phenotyping using the electronic health record
    Ahuja, Yuri
    Zou, Yuesong
    Verma, Aman
    Buckeridge, David
    Li, Yue
    JOURNAL OF BIOMEDICAL INFORMATICS, 2022, 134
  • [38] A surveillance method to identify patients with sepsis from electronic health records in Hong Kong: a single centre retrospective study
    Liu, Ying Zhi
    Chu, Raymond
    Lee, Anna
    Gomersall, Charles David
    Zhang, Lin
    Gin, Tony
    Chan, Matthew T. V.
    Wu, William K. K.
    Ling, Lowell
    BMC INFECTIOUS DISEASES, 2020, 20 (01)
  • [39] Development and Validation of an Algorithm to Accurately Identify Atopic Eczema Patients in Primary Care Electronic Health Records from the UK
    Abuabara, Katrina
    Magyari, Alexa M.
    Hoffstad, Ole
    Jabbar-Lopez, Zarif K.
    Smeeth, Liam
    Williams, Hywel C.
    Gelfand, Joel M.
    Margolis, David J.
    Langan, Sinead M.
    JOURNAL OF INVESTIGATIVE DERMATOLOGY, 2017, 137 (08) : 1655 - 1662
  • [40] A surveillance method to identify patients with sepsis from electronic health records in Hong Kong: a single centre retrospective study
    Ying Zhi Liu
    Raymond Chu
    Anna Lee
    Charles David Gomersall
    Lin Zhang
    Tony Gin
    Matthew T. V. Chan
    William K. K. Wu
    Lowell Ling
    BMC Infectious Diseases, 20