Comparison of machine learning algorithms for the identification of acute exacerbations in chronic obstructive pulmonary disease

被引:39
作者
Wang, Chenshuo [1 ,3 ]
Chen, Xianxiang [1 ,2 ]
Du, Lidong [1 ,2 ]
Zhan, Qingyuan [4 ]
Yang, Ting [4 ]
Fang, Zhen [1 ,2 ,3 ]
机构
[1] Chinese Acad Sci, Inst Elect, Beijing, Peoples R China
[2] Chinese Acad Sci, Personalized Management Chron Resp Dis, Beijing, Peoples R China
[3] Univ Chinese Acad Sci, Beijing, Peoples R China
[4] China Japan Friendship Hosp, Beijing, Peoples R China
关键词
Pulmonary disease; Chronic obstructive; Exacerbation; Machine learning; FORCED OSCILLATION MEASUREMENTS; COPD EXACERBATIONS; PREDICTION; DIAGNOSIS; SEVERITY; DYNAMICS;
D O I
10.1016/j.cmpb.2019.105267
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Objectives: Identifying acute exacerbations in chronic obstructive pulmonary disease (AECOPDs) is of utmost importance for reducing the associated mortality and financial burden. In this research, the authors aimed to develop identification models for AECOPDs and to compare the relative performance of different modeling paradigms to find the best model for this task. Methods: Data were extracted from electronic medical records (EMRs) of patients with chronic obstructive pulmonary disease who admitted to the China-Japan Friendship Hospital between February 2011 and March 2017. Five machine learning algorithms (random forest, support vector machine, logistic regression, K-nearest neighbor and naive Bayes) were used to develop the AECOPDs identification models. Feature selection was performed to find an optimal feature subset. 10-folds cross-validation was used to find the best hyperparameters for each model. The following metrics: area under the receiver operating characteristic curve, sensitivity, specificity, positive predictive value, and negative predictive value were used to evaluate the performance of these models. Results: A total of 303 EMRs (AECOPDs patients:135; None AECOPDs patients: 168) were included in the study. The SVM model obtained the best performance (sensitivity: 0.80, specificity: 0.83, positive predictive valuFFe:0.81, negative predictive value:0.85 and area under the receiver operating characteristic curve: 0.90) after performing feature selection. Conclusions: Our research confirms that the proposed model based on the support vector machine is a powerful tool to identify AECOPDs patients, and it is promising to provide decision support for clinicians when they are struggling to give a confirmed clinical diagnosis. (C) 2019 Elsevier B.V. All rights reserved.
引用
收藏
页数:8
相关论文
共 52 条
[11]  
[Anonymous], GLOB IN CHRON OBSTR
[12]   Impact of exacerbations on COPD [J].
Anzueto, A. .
EUROPEAN RESPIRATORY REVIEW, 2010, 19 (116) :113-118
[13]  
Breiman L., 2001, IEEE Trans. Broadcast., V45, P5
[14]   Integrated care prevents hospital isations for exacerbations in COPD patients [J].
Casas, A. ;
Troosters, T. ;
Garcia-Aymerich, J. ;
Roca, J. ;
Hernandez, C. ;
Alonso, A. ;
del Pozo, F. ;
de Toledo, P. ;
Anto, J. M. ;
Rodriguez-Roisin, R. ;
Decramer, M. .
EUROPEAN RESPIRATORY JOURNAL, 2006, 28 (01) :123-130
[15]   Standards for the diagnosis and treatment of patients with COPD: a summary of the ATS/ERS position paper [J].
Celli, BR ;
MacNee, W ;
Agusti, A ;
Anzueto, A ;
Berg, B ;
Buist, AS ;
Calverley, PMA ;
Chavannes, N ;
Dillard, T ;
Fahy, B ;
Fein, A ;
Heffner, J ;
Lareau, S ;
Meek, P ;
Martinez, F ;
McNicholas, W ;
Muris, J ;
Austegard, E ;
Pauwels, R ;
Rennard, S ;
Rossi, A ;
Siafakas, N ;
Tiep, B ;
Vestbo, J ;
Wouters, E ;
ZuWallack, R .
EUROPEAN RESPIRATORY JOURNAL, 2004, 23 (06) :932-946
[16]   Development of a personalized diagnostic model for kidney stone disease tailored to acute care by integrating large clinical, demographics and laboratory data: the diagnostic acute care algorithm - kidney stones (DACA-KS) [J].
Chen, Zhaoyi ;
Bird, Victoria Y. ;
Ruchi, Rupam ;
Segal, Mark S. ;
Bian, Jiang ;
Khan, Saeed R. ;
Elie, Marie-Carmelle ;
Prosperi, Mattia .
BMC MEDICAL INFORMATICS AND DECISION MAKING, 2018, 18
[17]   An improved support vector machine-based diabetic readmission prediction [J].
Cui, Shaoze ;
Wang, Dujuan ;
Wang, Yanzhang ;
Yu, Pay-Wen ;
Jin, Yaochu .
COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2018, 166 :123-135
[18]   Predictors of in-hospital length of stay among cardiac patients: A machine learning approach [J].
Daghistani, Tahani A. ;
Elshawi, Radwa ;
Sakr, Sherif ;
Ahmed, Amjad M. ;
Al-Thwayee, Abdullah ;
Al-Mallah, Mouaz H. .
INTERNATIONAL JOURNAL OF CARDIOLOGY, 2019, 288 :140-147
[19]   Definitions of Exacerbations Does It Really Matter in Clinical Trials on COPD? [J].
Effing, Tanja W. ;
Kerstjens, Huib A. M. ;
Monninkhof, Evelyn M. ;
van der Valk, Paul D. L. P. M. ;
Wouters, Emiel F. M. ;
Postma, Dirkje S. ;
Zielhuis, Gerhard A. ;
van der Palen, Job .
CHEST, 2009, 136 (03) :918-923
[20]   Comparison between logistic regression and machine learning algorithms on survival prediction of traumatic brain injuries [J].
Feng, Jin-zhou ;
Wang, Yu ;
Peng, Jin ;
Sun, Ming-wei ;
Zeng, Jun ;
Jiang, Hua .
JOURNAL OF CRITICAL CARE, 2019, 54 :110-116