Performance comparison of multi-label learning algorithms on clinical data for chronic diseases

被引:46
|
作者
Zufferey, Damien [1 ,2 ]
Hofer, Thomas [1 ]
Hennebert, Jean [2 ]
Schumacher, Michael [1 ]
Ingold, Rolf [2 ]
Bromuri, Stefano [1 ]
机构
[1] Univ Appl Sci & Arts Western Switzerland, Inst Informat Syst, AISLab, CH-3960 Sierre, Switzerland
[2] Univ Fribourg, DIVA Res Grp, Dept Informat, Bd Perolles 90, CH-1700 Fribourg, Switzerland
关键词
Multi-label learning; Complex patient; Chronic disease; Clinical data; Summary statistics; MISSING DATA; SYSTEMATIC ANALYSIS; CLASSIFICATION; SCALE; PREDICTION; DESIGN; WORDS; BAG;
D O I
10.1016/j.compbiomed.2015.07.017
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
We are motivated by the issue of classifying diseases of chronically ill patients to assist physicians in their everyday work. Our goal is to provide a performance comparison of state-of-the-art multi-label learning algorithms for the analysis of multivariate sequential clinical data from medical records of patients affected by chronic diseases. As a matter of fact, the multi-label learning approach appears to be a good candidate for modeling overlapped medical conditions, specific to chronically ill patients. With the availability of such comparison study, the evaluation of new algorithms should be enhanced. According to the method, we choose a summary statistics approach for the processing of the sequential clinical data, so that the extracted features maintain an interpretable link to their corresponding medical records. The publicly available MIMIC-II dataset, which contains more than 19,000 patients with chronic diseases, is used in this study. For the comparison we selected the following multi-label algorithms: ML-kNN, AdaBoostMH, binary relevance, classifier chains, HOMER and RAkEL. Regarding the results, binary relevance approaches, despite their elementary design and their independence assumption concerning the chronic illnesses, perform optimally in most scenarios, in particular for the detection of relevant diseases. In addition, binary relevance approaches scale up to large dataset and are easy to learn. However, the RAkEL algorithm, despite its scalability problems when it is confronted to large dataset, performs well in the scenario which consists of the ranking of the labels according to the dominant disease of the patient. (C) 2015 Elsevier Ltd. All rights reserved.
引用
收藏
页码:34 / 43
页数:10
相关论文
共 50 条
  • [31] Multi-label Ensemble Learning
    Shi, Chuan
    Kong, Xiangnan
    Yu, Philip S.
    Wang, Bai
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, PT III, 2011, 6913 : 223 - 239
  • [32] Multi-label Learning based on Label Entropy Guided Clustering
    Zhang, Ju-Jie
    Fang, Min
    Li, Xiao
    2014 IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION TECHNOLOGY (CIT), 2014, : 756 - 760
  • [33] Multi-label Crowdsourcing Learning
    Li S.-Y.
    Jiang Y.
    Ruan Jian Xue Bao/Journal of Software, 2020, 31 (05): : 1497 - 1510
  • [34] Multi-Label Manifold Learning
    Hou, Peng
    Geng, Xin
    Zhang, Min-Ling
    THIRTIETH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016, : 1680 - 1686
  • [35] Partial Multi-Label Learning
    Xie, Ming-Kun
    Huang, Sheng-Jun
    THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 4302 - 4309
  • [36] Leveraging Supervised Label Dependency Propagation for Multi-label Learning
    Fu, Bin
    Xu, Guandong
    Wang, Zhihai
    Cao, Longbing
    2013 IEEE 13TH INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2013, : 1061 - 1066
  • [37] LIFT: Multi-Label Learning with Label-Specific Features
    Zhang, Min-Ling
    Wu, Lei
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2015, 37 (01) : 107 - 120
  • [38] Multi-Label learning in the independent label sub-spaces
    Barezi, Elham J.
    Kwok, James T.
    Rabiee, Hamid R.
    PATTERN RECOGNITION LETTERS, 2017, 97 : 8 - 12
  • [39] Robust Multi-Label Learning with PRO Loss
    Xu, Miao
    Li, Yu-Feng
    Zhou, Zhi-Hua
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2020, 32 (08) : 1610 - 1624
  • [40] Weight matrix sharing for multi-label learning
    Qian, Kun
    Min, Xue-Yang
    Cheng, Yusheng
    Min, Fan
    PATTERN RECOGNITION, 2023, 136