Automated Identification of Aspirin-Exacerbated Respiratory Disease Using Natural Language Processing and Machine Learning: Algorithm Development and Evaluation Study

被引:1
作者
Pongdee, Thanai [1 ]
Larson, Nicholas B. [2 ]
Divekar, Rohit [1 ]
Bielinski, Suzette J. [3 ]
Liu, Hongfang [4 ]
Moon, Sungrim [4 ]
机构
[1] Mayo Clin, Div Allerg Dis, 200 First St SW, Rochester, MN 55905 USA
[2] Mayo Clin, Dept Quantitat Hlth Sci, Div Clin Trials & Biostat, Rochester, MN USA
[3] Mayo Clin, Dept Quantitat Hlth Sci, Div Epidemiol, Rochester, MN USA
[4] Mayo Clin, Dept Artificial Intelligence & Informat, Rochester, MN USA
来源
JMIR AI | 2023年 / 2卷
关键词
aspirin exacerbated respiratory disease; natural language processing; electronic health record; identification; machine learning; aspirin; asthma; respiratory illness; artificial intelligence; natural language processing algorithm; RHINOSINUSITIS; RHINITIS; HISTORY; ASTHMA;
D O I
10.2196/44191
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Background: Aspirin-exacerbated respiratory disease (AERD) is an acquired inflammatory condition characterized by the presence of asthma, chronic rhinosinusitis with nasal polyposis, and respiratory hypersensitivity reactions on ingestion of aspirin or other nonsteroidal anti-inflammatory drugs (NSAIDs). DespiteAERD having a classic constellation of symptoms, the diagnosis is often overlooked, with an average of greater than 10 years between the onset of symptoms and diagnosis of AERD. Without a diagnosis, individuals will lack opportunities to receive effective treatments, such as aspirin desensitization or biologic medications. Objective: Our aim was to develop a combined algorithm that integrates both natural language processing (NLP) and machine learning (ML) techniquesto identify patientswithAERD froman electronic health record (EHR). Methods: A rule-based decision tree algorithm incorporating NLP-based features was developed using clinical documents from the EHRat Mayo Clinic. From clinical notes, using NLP techniques, 7 features were extracted that included thefollowing: AERD, asthma, NSAID allergy, nasal polyps, chronic sinusitis, elevated urine leukotriene E4 level, and documented no-NSAID allergy. MedTagger was used to extract these 7 features from the unstructured clinical text given a set of keywords and patterns based on the chart review of 2 allergy and immunology experts for AERD. The status of each extracted feature was quantified by assigning the frequency of its occurrence in clinical documents per subject. We optimized the decision tree classifier's hyperparameters cutoff threshold on the training set to determine the representative feature combination to discriminate AERD. We then evaluated the resulting model on the test set. Results: The AERD algorithm, which combines NLP and ML techniques, achieved an area under the receiver operating characteristic curve score, sensitivity, and specificity of 0.86 (95% CI 0.78-0.94), 80.00 (95% CI 70.82-87.33), and 88.00 (95% CI 79.98-93.64) for the testset, respectively. Conclusions:We developed a promising AERD algorithm that needs further refinement to improveAERD diagnosis. Continued development of NLP and ML technologies has the potential to reduce diagnostic delays for AERD and improve the health of our patients.
引用
收藏
页数:7
相关论文
共 15 条
[1]   Natural language processing of clinical notes for identification of critical limb ischemia [J].
Afzal, Naveed ;
Mallipeddi, Vishnu Priya ;
Sohn, Sunghwan ;
Liu, Hongfang ;
Chaudhry, Rajeev ;
Scott, Christopher G. ;
Kullo, Iftikhar J. ;
Arruda-Olson, Adelaide M. .
INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2018, 111 :83-89
[2]   The natural history and clinical characteristics of aspirin-exacerbated respiratory disease [J].
Berges-Gimeno, MP ;
Simon, RA ;
Stevenson, DD .
ANNALS OF ALLERGY ASTHMA & IMMUNOLOGY, 2002, 89 (05) :474-478
[3]  
Breiman L., 1984, CLASSIFICATION REGRE, DOI DOI 10.1201/9781315139470
[4]   Immunology-based recommendations for available and upcoming biologics in aspirin-exacerbated respiratory disease [J].
Buchheit, Kathleen M. ;
Laidlaw, Tanya M. ;
Levy, Joshua M. .
JOURNAL OF ALLERGY AND CLINICAL IMMUNOLOGY, 2021, 148 (02) :348-350
[5]   Automated identification of an aspirin-exacerbated respiratory disease cohort [J].
Cahill, Katherine N. ;
Johns, Christina B. ;
Cui, Jing ;
Wickner, Paige ;
Bates, David W. ;
Laidlaw, Tanya M. ;
Beeler, Patrick E. .
JOURNAL OF ALLERGY AND CLINICAL IMMUNOLOGY, 2017, 139 (03) :819-+
[6]   Diagnostic Utility of Urinary LTE4 in Asthma, Allergic Rhinitis, Chronic Rhinosinusitis, Nasal Polyps, and Aspirin Sensitivity [J].
Divekar, Rohit ;
Hagan, John ;
Rank, Matthew ;
Park, Miguel ;
Volcheck, Gerald ;
O'Brien, Erin ;
Meeusen, Jeffrey ;
Kita, Hirohito ;
Butterfield, Joseph .
JOURNAL OF ALLERGY AND CLINICAL IMMUNOLOGY-IN PRACTICE, 2016, 4 (04) :665-670
[7]   Clinical evaluation and diagnosis of aspirin-exacerbated respiratory disease [J].
Haque, Rubaiyat ;
White, Andrew A. ;
Jackson, David J. ;
Hopkins, Claire .
JOURNAL OF ALLERGY AND CLINICAL IMMUNOLOGY, 2021, 148 (02) :283-291
[8]   A Framework for Augmented Intelligence in Allergy and Immunology Practice and Research-A Work Group Report of the AAAAI Health Informatics, Technology, and Education Committee [J].
Khoury, Paneez ;
Srinivasan, Renganathan ;
Kakumanu, Sujani ;
Ochoa, Sebastian ;
Keswani, Anjeni ;
Sparks, Rachel ;
Rider, Nicholas L. .
JOURNAL OF ALLERGY AND CLINICAL IMMUNOLOGY-IN PRACTICE, 2022, 10 (05) :1178-1188
[9]   Tolerance of daily low-dose aspirin does not preclude aspirin-exacerbated respiratory disease [J].
Lee-Sarwar, Kathleen ;
Johns, Christina ;
Laidlaw, Tanya M. ;
Cahill, Katherine N. .
JOURNAL OF ALLERGY AND CLINICAL IMMUNOLOGY-IN PRACTICE, 2015, 3 (03) :449-451
[10]   Identifying Information Gaps in Electronic Health Records by Using Natural Language Processing: Gynecologic Surgery History Identification [J].
Moon, Sungrim ;
Carlson, Luke A. ;
Moser, Ethan D. ;
Kshatriya, Bhavani Singh Agnikula ;
Smith, Carin Y. ;
Rocca, Walter A. ;
Rocca, Liliana Gazzuola ;
Bielinski, Suzette J. ;
Liu, Hongfang ;
Larson, Nicholas B. .
JOURNAL OF MEDICAL INTERNET RESEARCH, 2022, 24 (01)