Performance comparison of multi-label learning algorithms on clinical data for chronic diseases

被引:46
作者
Zufferey, Damien [1 ,2 ]
Hofer, Thomas [1 ]
Hennebert, Jean [2 ]
Schumacher, Michael [1 ]
Ingold, Rolf [2 ]
Bromuri, Stefano [1 ]
机构
[1] Univ Appl Sci & Arts Western Switzerland, Inst Informat Syst, AISLab, CH-3960 Sierre, Switzerland
[2] Univ Fribourg, DIVA Res Grp, Dept Informat, Bd Perolles 90, CH-1700 Fribourg, Switzerland
关键词
Multi-label learning; Complex patient; Chronic disease; Clinical data; Summary statistics; MISSING DATA; SYSTEMATIC ANALYSIS; CLASSIFICATION; SCALE; PREDICTION; DESIGN; WORDS; BAG;
D O I
10.1016/j.compbiomed.2015.07.017
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
We are motivated by the issue of classifying diseases of chronically ill patients to assist physicians in their everyday work. Our goal is to provide a performance comparison of state-of-the-art multi-label learning algorithms for the analysis of multivariate sequential clinical data from medical records of patients affected by chronic diseases. As a matter of fact, the multi-label learning approach appears to be a good candidate for modeling overlapped medical conditions, specific to chronically ill patients. With the availability of such comparison study, the evaluation of new algorithms should be enhanced. According to the method, we choose a summary statistics approach for the processing of the sequential clinical data, so that the extracted features maintain an interpretable link to their corresponding medical records. The publicly available MIMIC-II dataset, which contains more than 19,000 patients with chronic diseases, is used in this study. For the comparison we selected the following multi-label algorithms: ML-kNN, AdaBoostMH, binary relevance, classifier chains, HOMER and RAkEL. Regarding the results, binary relevance approaches, despite their elementary design and their independence assumption concerning the chronic illnesses, perform optimally in most scenarios, in particular for the detection of relevant diseases. In addition, binary relevance approaches scale up to large dataset and are easy to learn. However, the RAkEL algorithm, despite its scalability problems when it is confronted to large dataset, performs well in the scenario which consists of the ranking of the labels according to the dominant disease of the patient. (C) 2015 Elsevier Ltd. All rights reserved.
引用
收藏
页码:34 / 43
页数:10
相关论文
共 50 条
  • [41] Lattice and Imbalance Informed Multi-Label Learning
    Sadhukhan, Payel
    Palit, Sarbani
    IEEE ACCESS, 2020, 8 : 7394 - 7407
  • [42] Evolutionary feature weighting to improve the performance of multi-label lazy algorithms
    Reyes, Oscar
    Morell, Carlos
    Ventura, Sebastian
    INTEGRATED COMPUTER-AIDED ENGINEERING, 2014, 21 (04) : 339 - 354
  • [43] A Diagram Retrieval Method with Multi-label Learning
    Fu, Songping
    Lu, Xiaoqing
    Liu, Lu
    Qu, Jingwei
    Tang, Zhi
    DOCUMENT RECOGNITION AND RETRIEVAL XXII, 2015, 9402
  • [44] Improving Multi-Label Learning by Correlation Embedding
    Huang, Jun
    Xu, Qian
    Qu, Xiwen
    Lin, Yaojin
    Zheng, Xiao
    APPLIED SCIENCES-BASEL, 2021, 11 (24):
  • [45] Semi-Supervised Learning for Multi-Label Cardiovascular Diseases Prediction: A Multi-Dataset Study
    Zhou, Rushuang
    Lu, Lei
    Liu, Zijun
    Xiang, Ting
    Liang, Zhen
    Clifton, David A.
    Dong, Yining
    Zhang, Yuan-Ting
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (05) : 3305 - 3320
  • [46] Cost Sensitive Ranking Support Vector Machine for Multi-label Data Learning
    Cao, Peng
    Liu, Xiaoli
    Zhao, Dazhe
    Zaiane, Osmar
    PROCEEDINGS OF THE 16TH INTERNATIONAL CONFERENCE ON HYBRID INTELLIGENT SYSTEMS (HIS 2016), 2017, 552 : 244 - 255
  • [47] Local Rademacher Complexity for Multi-Label Learning
    Xu, Chang
    Liu, Tongliang
    Tao, Dacheng
    Xu, Chao
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2016, 25 (03) : 1495 - 1507
  • [48] Two stage architecture for multi-label learning
    Madjarov, Gjorgji
    Gjorgjevikj, Dejan
    Dzeroski, Saso
    PATTERN RECOGNITION, 2012, 45 (03) : 1019 - 1034
  • [49] Learning from multi-label data with interactivity constraints: An extensive experimental study
    Nair-Benrekia, Noureddine-Yassine
    Kuntz, Pascale
    Meyer, Frank
    EXPERT SYSTEMS WITH APPLICATIONS, 2015, 42 (13) : 5723 - 5736
  • [50] Prototype selection for multi-label data based on label correlation
    Li, Haikun
    Fang, Min
    Li, Hang
    Wang, Peng
    NEURAL COMPUTING & APPLICATIONS, 2024, 36 (05) : 2121 - 2130