Multi-label feature ranking with ensemble methods

被引:0
作者
Matej Petković
Sašo Džeroski
Dragi Kocev
机构
[1] Jožef Stefan Institute,
[2] Jožef Stefan International Postgraduate School,undefined
来源
Machine Learning | 2020年 / 109卷
关键词
Feature ranking; Multi-label classification; Ensemble-based methods; Predictive clustering trees;
D O I
暂无
中图分类号
学科分类号
摘要
In this paper, we propose three ensemble-based feature ranking scores for multi-label classification (MLC), which is a generalisation of multi-class classification where the classes are not mutually exclusive. Each of the scores (Symbolic, Genie3 and Random forest) can be computed from three different ensembles of predictive clustering trees: Bagging, Random forest and Extra trees. We extensively evaluate the proposed scores on 24 benchmark MLC problems, using 15 standard MLC evaluation measures. We determine the ranking quality saturation points in terms of the ensemble sizes, for each ranking-ensemble pair, and show that quality rankings can be computed really efficiently (typically 10 or 50 trees suffice). We also show that the proposed feature rankings are relevant and determine the most appropriate ensemble method for every feature ranking score. We empirically prove that the proposed feature ranking scores outperform current state-of-the-art methods in the quality of the rankings (for the majority of the evaluation measures), and in time efficiency. Finally, we determine the best performing feature ranking scores. Taking into account the quality of the rankings first and—in the case of ties—time efficiency, we identify the Genie3 feature ranking score as the optimal one.
引用
收藏
页码:2141 / 2159
页数:18
相关论文
共 50 条
  • [41] Multi-Label Feature Selection using Correlation Information
    Braytee, Ali
    Liu, Wei
    Catchpoole, Daniel R.
    Kennedy, Paul J.
    CIKM'17: PROCEEDINGS OF THE 2017 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2017, : 1649 - 1656
  • [42] Dynamic feature weighting for multi-label classification problems
    Dialameh, Maryam
    Hamzeh, Ali
    PROGRESS IN ARTIFICIAL INTELLIGENCE, 2021, 10 (03) : 283 - 295
  • [43] Multi-label feature selection via information gain
    Li, Ling
    Liu, Huawen
    Ma, Zongjie
    Mo, Yuchang
    Duan, Zhengjie
    Zhou, Jiaqing
    Zhao, Jianmin
    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2014, 8933 : 345 - 355
  • [44] Multi-label methods for prediction with sequential data
    Read, Jesse
    Martino, Luca
    Hollmen, Jaakko
    PATTERN RECOGNITION, 2017, 63 : 45 - 55
  • [45] Using Credal C4.5 for Calibrated Label Ranking in Multi-Label Classification
    Moral-Garcia, Serafin
    Mantas, Carlos J.
    Castellano, Javier G.
    Abellan, Joaquin
    PROCEEDINGS OF THE TWELVETH INTERNATIONAL SYMPOSIUM ON IMPRECISE PROBABILITY: THEORIES AND APPLICATIONS, 2021, 147 : 220 - 228
  • [46] Using Credal C4.5 for Calibrated Label Ranking in Multi-Label Classification
    Moral-Garcia, Serafin
    Mantas, Carlos J.
    Castellano, Javier G.
    Abellan, Joaquin
    INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2022, 147 : 60 - 77
  • [47] Multi-label classification with weighted classifier selection and stacked ensemble
    Xia, Yuelong
    Chen, Ke
    Yang, Yun
    INFORMATION SCIENCES, 2021, 557 : 421 - 442
  • [48] Calibrated k-labelsets for Ensemble Multi-label Classification
    Gharroudi, Ouadie
    Elghazel, Haytham
    Aussem, Alex
    NEURAL INFORMATION PROCESSING, PT I, 2015, 9489 : 573 - 582
  • [49] Multi-label Classification of Small Samples Using an Ensemble Technique
    Mahdavi-Shahri, Amirreza
    Karimian, Jamil
    Javadi, Azadeh
    Houshmand, Mahboobeh
    26TH IRANIAN CONFERENCE ON ELECTRICAL ENGINEERING (ICEE 2018), 2018, : 1708 - 1713
  • [50] GENERALIZED K-LABELSET ENSEMBLE FOR MULTI-LABEL CLASSIFICATION
    Lo, Hung-Yi
    Lin, Shou-De
    Wang, Hsin-Min
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 2061 - 2064