Feature Ranking for Hierarchical Multi-Label Classification with Tree Ensemble Methods

被引:7
作者
Petkovic, Matej [1 ]
Dzeroski, Saso
Kocev, Dragi
机构
[1] Jozef Stefan Inst, Jamova 39, Ljubljana 1000, Slovenia
关键词
hierarchical multi-label classification; feature ranking; ensemble methods; Relief; RELIEFF;
D O I
10.12700/APH.17.10.2020.10.8
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
In this work, we address the task of feature ranking for hierarchical multi-label classification (HMLC). The task of HMLC concerns problems with multiple binary variables, organized into a hierarchy of target attributes. The goal is to train a model to learn and accurately predict all of them, simultaneously. This task is receiving increasing attention from the research community, due to its wide application potential in text document classification and functional genomics. Here, we propose a group of feature ranking methods based on three established ensemble methods of predictive clustering trees: Bagging, Random Forests and Extra Trees. Predictive clustering trees are a generalization of decision trees, towards predicting structured outputs. Furthermore, we propose to use three scoring functions for calculating the feature importance values: Symbolic, Genie3 and Random Forest. We test the proposed methods on 30 benchmark HMLC datasets, show that Symbolic and Genie3 scores return relevant rankings, that all three scores outperform the HMLC-Relief ranking method and are computed in very time-efficient manner. For each scoring function, we find the most appropriate ensemble method and compare the scores to find the best one.
引用
收藏
页码:129 / 148
页数:20
相关论文
共 50 条
  • [21] HmcNet: A General Approach for Hierarchical Multi-Label Classification
    Huang, Wei
    Chen, Enhong
    Liu, Qi
    Xiong, Hui
    Huang, Zhenya
    Tong, Shiwei
    Zhang, Dan
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (09) : 8713 - 8728
  • [22] The Use of the Label Hierarchy in Hierarchical Multi-label Classification Improves Performance
    Levatic, Jurica
    Kocev, Dragi
    Dzeroski, Saso
    NEW FRONTIERS IN MINING COMPLEX PATTERNS, NFMCP 2013, 2014, 8399 : 162 - 177
  • [23] Hierarchical Multi-Label Classification with Partial Labels and Unknown Hierarchy
    Jo, Suhyeon
    Shin, DongHyeok
    Na, Byeonghu
    Jang, JoonHo
    Moon, Il-Chul
    PROCEEDINGS OF THE 32ND ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2023, 2023, : 1025 - 1034
  • [24] Hierarchical multi-label classification using local neural networks
    Cerri, Ricardo
    Barros, Rodrigo C.
    de Carvalho, Andre C. P. L. F.
    JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 2014, 80 (01) : 39 - 56
  • [25] Inducing Hierarchical Multi-label Classification rules with Genetic Algorithms
    Cerri, Ricardo
    Basgalupp, Marcio P.
    Barros, Rodrigo C.
    de Carvalho, Andre C. P. L. F.
    APPLIED SOFT COMPUTING, 2019, 77 : 584 - 604
  • [26] iSOUP-SymRF: Symbolic feature ranking with random forests in online multi-target regression and multi-label classification
    Osojnik, Aljaz
    Panov, Pance
    Dzeroski, Saso
    MACHINE LEARNING, 2025, 114 (02)
  • [27] A Comparison of Multi-label Feature Selection Methods using the Problem Transformation Approach
    Spolaor, Newton
    Cherman, Everton Alvares
    Monard, Maria Carolina
    Lee, Huei Diana
    ELECTRONIC NOTES IN THEORETICAL COMPUTER SCIENCE, 2013, 292 : 135 - 151
  • [28] Feature ranking for enhancing boosting-based multi-label text categorization
    Al-Salemi, Bassam
    Ayob, Masri
    Noah, Shahrul Azman Mohd
    EXPERT SYSTEMS WITH APPLICATIONS, 2018, 113 : 531 - 543
  • [29] Label Construction for Multi-label Feature Selection
    Spolaor, Newton
    Monard, Maria Carolina
    Tsoumakas, Grigorios
    Lee, Huei Diana
    2014 BRAZILIAN CONFERENCE ON INTELLIGENT SYSTEMS (BRACIS), 2014, : 247 - 252
  • [30] Reduction strategies for hierarchical multi-label classification in protein function prediction
    Ricardo Cerri
    Rodrigo C. Barros
    André C. P. L. F. de Carvalho
    Yaochu Jin
    BMC Bioinformatics, 17