Performance comparison of multi-label learning algorithms on clinical data for chronic diseases

被引：46

作者：

Zufferey, Damien ^{[1
,2
]}

Hofer, Thomas ^{[1
]}

Hennebert, Jean ^{[2
]}

Schumacher, Michael ^{[1
]}

Ingold, Rolf ^{[2
]}

Bromuri, Stefano ^{[1
]}

机构：

[1] Univ Appl Sci & Arts Western Switzerland, Inst Informat Syst, AISLab, CH-3960 Sierre, Switzerland

[2] Univ Fribourg, DIVA Res Grp, Dept Informat, Bd Perolles 90, CH-1700 Fribourg, Switzerland

来源：

COMPUTERS IN BIOLOGY AND MEDICINE | 2015年 / 65卷

关键词：

Multi-label learning; Complex patient; Chronic disease; Clinical data; Summary statistics; MISSING DATA; SYSTEMATIC ANALYSIS; CLASSIFICATION; SCALE; PREDICTION; DESIGN; WORDS; BAG;

D O I：

10.1016/j.compbiomed.2015.07.017

中图分类号：

Q [生物科学];

学科分类号：

07 ; 0710 ; 09 ;

摘要：

We are motivated by the issue of classifying diseases of chronically ill patients to assist physicians in their everyday work. Our goal is to provide a performance comparison of state-of-the-art multi-label learning algorithms for the analysis of multivariate sequential clinical data from medical records of patients affected by chronic diseases. As a matter of fact, the multi-label learning approach appears to be a good candidate for modeling overlapped medical conditions, specific to chronically ill patients. With the availability of such comparison study, the evaluation of new algorithms should be enhanced. According to the method, we choose a summary statistics approach for the processing of the sequential clinical data, so that the extracted features maintain an interpretable link to their corresponding medical records. The publicly available MIMIC-II dataset, which contains more than 19,000 patients with chronic diseases, is used in this study. For the comparison we selected the following multi-label algorithms: ML-kNN, AdaBoostMH, binary relevance, classifier chains, HOMER and RAkEL. Regarding the results, binary relevance approaches, despite their elementary design and their independence assumption concerning the chronic illnesses, perform optimally in most scenarios, in particular for the detection of relevant diseases. In addition, binary relevance approaches scale up to large dataset and are easy to learn. However, the RAkEL algorithm, despite its scalability problems when it is confronted to large dataset, performs well in the scenario which consists of the ranking of the labels according to the dominant disease of the patient. (C) 2015 Elsevier Ltd. All rights reserved.

引用

页码：34 / 43

页数：10

共 50 条

[1] Active Learning Algorithms for Multi-label Data
Cherman, Everton Alvares
Tsoumakas, Grigorios
Monard, Maria-Carolina
ARTIFICIAL INTELLIGENCE APPLICATIONS AND INNOVATIONS, AIAI 2016, 2016, 475 : 267 - 279
[2] A Review on Multi-Label Learning Algorithms
Zhang, Min-Ling
Zhou, Zhi-Hua
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2014, 26 (08) : 1819 - 1837
[3] Multi-label Arabic text categorization: A benchmark and baseline comparison of multi-label learning algorithms
Al-Salemi, Bassam
Ayob, Masri
Kendall, Graham
Noah, Shahrul Azman Mohd
INFORMATION PROCESSING & MANAGEMENT, 2019, 56 (01) : 212 - 227
[4] Imbalance multi-label data learning with label specific features
Rastogi, Reshma
Mortaza, Sayed
NEUROCOMPUTING, 2022, 513 : 395 - 408
[5] Learning Similarity Metric to improve the performance of Lazy Multi-label Ranking Algorithms
Reyes, Oscar
Morell, Carlos
Ventura, Sebastian
2012 12TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS (ISDA), 2012, : 246 - 251
[6] On the consistency of multi-label learning
Gao, Wei
Zhou, Zhi-Hua
ARTIFICIAL INTELLIGENCE, 2013, 199 : 22 - 44
[7] Prediction of Chronic Diseases With Multi-Label Neural Network
Ge, Ruiquan
Zhang, Renfeng
Wang, Pu
IEEE ACCESS, 2020, 8 : 138210 - 138216
[8] Comparison of base classifiers for multi-label learning
Yapp, Edward K. Y.
Li, Xiang
Lu, Wen Feng
Tan, Puay Siew
NEUROCOMPUTING, 2020, 394 : 51 - 60
[9] Multi-label learning with label-specific feature reduction
Xu, Suping
Yang, Xibei
Yu, Hualong
Yu, Dong-Jun
Yang, Jingyu
Tsang, Eric C. C.
KNOWLEDGE-BASED SYSTEMS, 2016, 104 : 52 - 61
[10] An effective single-model learning for multi-label data
Siahroudi, Sajjad Kamali
Kudenko, Daniel
EXPERT SYSTEMS WITH APPLICATIONS, 2023, 232

← 1 2 3 4 5 →