Comparison of machine learning algorithms for the identification of acute exacerbations in chronic obstructive pulmonary disease

被引:39
作者
Wang, Chenshuo [1 ,3 ]
Chen, Xianxiang [1 ,2 ]
Du, Lidong [1 ,2 ]
Zhan, Qingyuan [4 ]
Yang, Ting [4 ]
Fang, Zhen [1 ,2 ,3 ]
机构
[1] Chinese Acad Sci, Inst Elect, Beijing, Peoples R China
[2] Chinese Acad Sci, Personalized Management Chron Resp Dis, Beijing, Peoples R China
[3] Univ Chinese Acad Sci, Beijing, Peoples R China
[4] China Japan Friendship Hosp, Beijing, Peoples R China
关键词
Pulmonary disease; Chronic obstructive; Exacerbation; Machine learning; FORCED OSCILLATION MEASUREMENTS; COPD EXACERBATIONS; PREDICTION; DIAGNOSIS; SEVERITY; DYNAMICS;
D O I
10.1016/j.cmpb.2019.105267
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Objectives: Identifying acute exacerbations in chronic obstructive pulmonary disease (AECOPDs) is of utmost importance for reducing the associated mortality and financial burden. In this research, the authors aimed to develop identification models for AECOPDs and to compare the relative performance of different modeling paradigms to find the best model for this task. Methods: Data were extracted from electronic medical records (EMRs) of patients with chronic obstructive pulmonary disease who admitted to the China-Japan Friendship Hospital between February 2011 and March 2017. Five machine learning algorithms (random forest, support vector machine, logistic regression, K-nearest neighbor and naive Bayes) were used to develop the AECOPDs identification models. Feature selection was performed to find an optimal feature subset. 10-folds cross-validation was used to find the best hyperparameters for each model. The following metrics: area under the receiver operating characteristic curve, sensitivity, specificity, positive predictive value, and negative predictive value were used to evaluate the performance of these models. Results: A total of 303 EMRs (AECOPDs patients:135; None AECOPDs patients: 168) were included in the study. The SVM model obtained the best performance (sensitivity: 0.80, specificity: 0.83, positive predictive valuFFe:0.81, negative predictive value:0.85 and area under the receiver operating characteristic curve: 0.90) after performing feature selection. Conclusions: Our research confirms that the proposed model based on the support vector machine is a powerful tool to identify AECOPDs patients, and it is promising to provide decision support for clinicians when they are struggling to give a confirmed clinical diagnosis. (C) 2019 Elsevier B.V. All rights reserved.
引用
收藏
页数:8
相关论文
共 52 条
[1]   A new machine learning technique for an accurate diagnosis of coronary artery disease [J].
Abdar, Moloud ;
Ksiazek, Wojciech ;
Acharya, U. Rajendra ;
Tan, Ru-San ;
Makarenkov, Vladimir ;
Plawiak, Pawel .
COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2019, 179
[2]   Machine learning algorithms and forced oscillation measurements to categorise the airway obstruction severity in chronic obstructive pulmonary disease [J].
Amaral, Jorge L. M. ;
Lopes, Agnaldo J. ;
Faria, Alvaro C. D. ;
Melo, Pedro L. .
COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2015, 118 (02) :186-197
[3]   Machine learning algorithms and forced oscillation measurements applied to the automatic identification of chronic obstructive pulmonary disease [J].
Amaral, Jorge L. M. ;
Lopes, Agnaldo J. ;
Jansen, Jose M. ;
Faria, Alvaro C. D. ;
Melo, Pedro L. .
COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2012, 105 (03) :183-193
[4]   An artificial intelligence approach to early predict symptom-based exacerbations of COPD [J].
Angel Fernandez-Granero, Miguel ;
Sanchez-Morillo, Daniel ;
Leon-Jinnenez, Antonio .
BIOTECHNOLOGY & BIOTECHNOLOGICAL EQUIPMENT, 2018, 32 (03) :778-784
[5]  
[Anonymous], 2014, Combining Pattern Classifiers: Methods and Algorithms, DOI DOI 10.1002/0471660264
[6]  
[Anonymous], 2011, Global strategy for the diagnosis, management, and prevention of COPD, global initiative for chronic obstructive lung disease (GOLD) (Updated 2011)
[7]  
[Anonymous], BRAIN INFORM
[8]  
[Anonymous], P INT C MACH LEARN C
[9]  
[Anonymous], PULMONOLOGY
[10]  
[Anonymous], EC IMPACT COPD COST