Multi-label feature ranking with ensemble methods

被引：0

作者：

Matej Petković

Sašo Džeroski

Dragi Kocev

机构：

[1] Jožef Stefan Institute,

[2] Jožef Stefan International Postgraduate School,undefined

来源：

Machine Learning | 2020年 / 109卷

关键词：

Feature ranking; Multi-label classification; Ensemble-based methods; Predictive clustering trees;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

In this paper, we propose three ensemble-based feature ranking scores for multi-label classification (MLC), which is a generalisation of multi-class classification where the classes are not mutually exclusive. Each of the scores (Symbolic, Genie3 and Random forest) can be computed from three different ensembles of predictive clustering trees: Bagging, Random forest and Extra trees. We extensively evaluate the proposed scores on 24 benchmark MLC problems, using 15 standard MLC evaluation measures. We determine the ranking quality saturation points in terms of the ensemble sizes, for each ranking-ensemble pair, and show that quality rankings can be computed really efficiently (typically 10 or 50 trees suffice). We also show that the proposed feature rankings are relevant and determine the most appropriate ensemble method for every feature ranking score. We empirically prove that the proposed feature ranking scores outperform current state-of-the-art methods in the quality of the rankings (for the majority of the evaluation measures), and in time efficiency. Finally, we determine the best performing feature ranking scores. Taking into account the quality of the rankings first and—in the case of ties—time efficiency, we identify the Genie3 feature ranking score as the optimal one.

引用

页码：2141 / 2159

页数：18

共 50 条

[41] Multi-Label Feature Selection using Correlation Information
Braytee, Ali
Liu, Wei
Catchpoole, Daniel R.
Kennedy, Paul J.
CIKM'17: PROCEEDINGS OF THE 2017 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2017, : 1649 - 1656
[42] Dynamic feature weighting for multi-label classification problems
Dialameh, Maryam
Hamzeh, Ali
PROGRESS IN ARTIFICIAL INTELLIGENCE, 2021, 10 (03) : 283 - 295
[43] Multi-label feature selection via information gain
Li, Ling
Liu, Huawen
Ma, Zongjie
Mo, Yuchang
Duan, Zhengjie
Zhou, Jiaqing
Zhao, Jianmin
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2014, 8933 : 345 - 355
[44] Multi-label methods for prediction with sequential data
Read, Jesse
Martino, Luca
Hollmen, Jaakko
PATTERN RECOGNITION, 2017, 63 : 45 - 55
[45] Using Credal C4.5 for Calibrated Label Ranking in Multi-Label Classification
Moral-Garcia, Serafin
Mantas, Carlos J.
Castellano, Javier G.
Abellan, Joaquin
PROCEEDINGS OF THE TWELVETH INTERNATIONAL SYMPOSIUM ON IMPRECISE PROBABILITY: THEORIES AND APPLICATIONS, 2021, 147 : 220 - 228
[46] Using Credal C4.5 for Calibrated Label Ranking in Multi-Label Classification
Moral-Garcia, Serafin
Mantas, Carlos J.
Castellano, Javier G.
Abellan, Joaquin
INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2022, 147 : 60 - 77
[47] Multi-label classification with weighted classifier selection and stacked ensemble
Xia, Yuelong
Chen, Ke
Yang, Yun
INFORMATION SCIENCES, 2021, 557 : 421 - 442
[48] Calibrated k-labelsets for Ensemble Multi-label Classification
Gharroudi, Ouadie
Elghazel, Haytham
Aussem, Alex
NEURAL INFORMATION PROCESSING, PT I, 2015, 9489 : 573 - 582
[49] Multi-label Classification of Small Samples Using an Ensemble Technique
Mahdavi-Shahri, Amirreza
Karimian, Jamil
Javadi, Azadeh
Houshmand, Mahboobeh
26TH IRANIAN CONFERENCE ON ELECTRICAL ENGINEERING (ICEE 2018), 2018, : 1708 - 1713
[50] GENERALIZED K-LABELSET ENSEMBLE FOR MULTI-LABEL CLASSIFICATION
Lo, Hung-Yi
Lin, Shou-De
Wang, Hsin-Min
2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 2061 - 2064

← 1 2 3 4 5 →