Multi-label feature ranking with ensemble methods

被引:0
|
作者
Matej Petković
Sašo Džeroski
Dragi Kocev
机构
[1] Jožef Stefan Institute,
[2] Jožef Stefan International Postgraduate School,undefined
来源
Machine Learning | 2020年 / 109卷
关键词
Feature ranking; Multi-label classification; Ensemble-based methods; Predictive clustering trees;
D O I
暂无
中图分类号
学科分类号
摘要
In this paper, we propose three ensemble-based feature ranking scores for multi-label classification (MLC), which is a generalisation of multi-class classification where the classes are not mutually exclusive. Each of the scores (Symbolic, Genie3 and Random forest) can be computed from three different ensembles of predictive clustering trees: Bagging, Random forest and Extra trees. We extensively evaluate the proposed scores on 24 benchmark MLC problems, using 15 standard MLC evaluation measures. We determine the ranking quality saturation points in terms of the ensemble sizes, for each ranking-ensemble pair, and show that quality rankings can be computed really efficiently (typically 10 or 50 trees suffice). We also show that the proposed feature rankings are relevant and determine the most appropriate ensemble method for every feature ranking score. We empirically prove that the proposed feature ranking scores outperform current state-of-the-art methods in the quality of the rankings (for the majority of the evaluation measures), and in time efficiency. Finally, we determine the best performing feature ranking scores. Taking into account the quality of the rankings first and—in the case of ties—time efficiency, we identify the Genie3 feature ranking score as the optimal one.
引用
收藏
页码:2141 / 2159
页数:18
相关论文
共 50 条
  • [1] Multi-label feature ranking with ensemble methods
    Petkovic, Matej
    Dzeroski, Saso
    Kocev, Dragi
    MACHINE LEARNING, 2020, 109 (11) : 2141 - 2159
  • [2] Feature Ranking for Hierarchical Multi-Label Classification with Tree Ensemble Methods
    Petkovic, Matej
    Dzeroski, Saso
    Kocev, Dragi
    ACTA POLYTECHNICA HUNGARICA, 2020, 17 (10) : 129 - 148
  • [3] Ensemble methods for multi-label classification
    Rokach, Lior
    Schclar, Alon
    Itach, Ehud
    EXPERT SYSTEMS WITH APPLICATIONS, 2014, 41 (16) : 7507 - 7523
  • [4] Structuring the Output Space in Multi-label Classification by Using Feature Ranking
    Nikoloski, Stevanche
    Kocev, Dragi
    Dzeroski, Saso
    NEW FRONTIERS IN MINING COMPLEX PATTERNS, NFMCP 2017, 2018, 10785 : 151 - 166
  • [5] Multi-label text classification with an ensemble feature space
    Tandon, Kushagri
    Chatterjee, Niladri
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2022, 42 (05) : 4425 - 4436
  • [6] HMC-ReliefF: Feature Ranking for Hierarchical Multi-label Classification
    Slavkov, Ivica
    Karcheska, Jana
    Kocev, Dragi
    Dzeroski, Saso
    COMPUTER SCIENCE AND INFORMATION SYSTEMS, 2018, 15 (01) : 187 - 209
  • [7] Feature Ranking for Multi-target Regression with Tree Ensemble Methods
    Petkovic, Matej
    Dzeroski, Sao
    Kocev, Dragi
    DISCOVERY SCIENCE, DS 2017, 2017, 10558 : 171 - 185
  • [8] Multi-label Selective Ensemble
    Li, Nan
    Jiang, Yuan
    Zhou, Zhi-Hua
    MULTIPLE CLASSIFIER SYSTEMS (MCS 2015), 2015, 9132 : 76 - 88
  • [9] MLCE: A Multi-Label Crotch Ensemble Method for Multi-Label Classification
    Yao, Yuan
    Li, Yan
    Ye, Yunming
    Li, Xutao
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2021, 35 (04)
  • [10] An Ensemble Multi-Label Feature Selection Algorithm Based on Information Entropy
    Li, Shining
    Zhang, Zhenhai
    Duan, Jiaqi
    INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2014, 11 (04) : 379 - 386