Feature Ranking for Hierarchical Multi-Label Classification with Tree Ensemble Methods

被引:7
作者
Petkovic, Matej [1 ]
Dzeroski, Saso
Kocev, Dragi
机构
[1] Jozef Stefan Inst, Jamova 39, Ljubljana 1000, Slovenia
关键词
hierarchical multi-label classification; feature ranking; ensemble methods; Relief; RELIEFF;
D O I
10.12700/APH.17.10.2020.10.8
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
In this work, we address the task of feature ranking for hierarchical multi-label classification (HMLC). The task of HMLC concerns problems with multiple binary variables, organized into a hierarchy of target attributes. The goal is to train a model to learn and accurately predict all of them, simultaneously. This task is receiving increasing attention from the research community, due to its wide application potential in text document classification and functional genomics. Here, we propose a group of feature ranking methods based on three established ensemble methods of predictive clustering trees: Bagging, Random Forests and Extra Trees. Predictive clustering trees are a generalization of decision trees, towards predicting structured outputs. Furthermore, we propose to use three scoring functions for calculating the feature importance values: Symbolic, Genie3 and Random Forest. We test the proposed methods on 30 benchmark HMLC datasets, show that Symbolic and Genie3 scores return relevant rankings, that all three scores outperform the HMLC-Relief ranking method and are computed in very time-efficient manner. For each scoring function, we find the most appropriate ensemble method and compare the scores to find the best one.
引用
收藏
页码:129 / 148
页数:20
相关论文
共 50 条
[41]   Leveraging class hierarchy for detecting missing annotations on hierarchical multi-label classification [J].
Romero, Miguel ;
Nakano, Felipe Kenji ;
Finke, Jorge ;
Rocha, Camilo ;
Vens, Celine .
COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 152
[42]   CEHMR: Curriculum learning enhanced hierarchical multi-label classification for medication recommendation [J].
Sun, Mengxuan ;
Niu, Jinghao ;
Yang, Xuebing ;
Gu, Yifan ;
Zhang, Wensheng .
ARTIFICIAL INTELLIGENCE IN MEDICINE, 2023, 143
[43]   Surj: Ontological Learning for Fast, Accurate, and Robust Hierarchical Multi-label Classification [J].
Yang, Sean T. ;
Howe, Bill .
COMPANION PROCEEDINGS OF THE WEB CONFERENCE 2022, WWW 2022 COMPANION, 2022, :1106-1114
[44]   A hierarchical multi-label classification ant colony algorithm for protein function prediction [J].
Otero F.E.B. ;
Freitas A.A. ;
Johnson C.G. .
Memetic Computing, 2010, 2 (3) :165-181
[45]   Deep Hierarchical Multi-label Classification of Chest X-ray Images [J].
Chen, Haomin ;
Miao, Shun ;
Xu, Daguang ;
Hager, Gregory D. ;
Harrison, Adam P. .
INTERNATIONAL CONFERENCE ON MEDICAL IMAGING WITH DEEP LEARNING, VOL 102, 2019, 102 :109-120
[46]   Hierarchical Multi-Label Classification over Ticket Data using Contextual Loss [J].
Zeng, Chunqiu ;
Li, Tao ;
Shwartz, Larisa ;
Grabarnik, Genady Ya. .
2014 IEEE NETWORK OPERATIONS AND MANAGEMENT SYMPOSIUM (NOMS), 2014,
[47]   Hierarchical Multi-Label Classification With Gene-Environment Interactions in Disease Modeling [J].
Li, Jingmao ;
Zhang, Qingzhao ;
Ma, Shuangge ;
Fang, Kuangnan ;
Xu, Yaqing .
STATISTICS IN MEDICINE, 2025, 44 (3-4)
[48]   Hierarchical multi-label classification with SVMs: A case study in gene function prediction [J].
Vateekul, Peerapon ;
Kubat, Miroslav ;
Sarinnapakorn, Kanoksri .
INTELLIGENT DATA ANALYSIS, 2014, 18 (04) :717-738
[49]   Classifier chains for multi-label classification [J].
Jesse Read ;
Bernhard Pfahringer ;
Geoff Holmes ;
Eibe Frank .
Machine Learning, 2011, 85
[50]   Classifier chains for multi-label classification [J].
Read, Jesse ;
Pfahringer, Bernhard ;
Holmes, Geoff ;
Frank, Eibe .
MACHINE LEARNING, 2011, 85 (03) :333-359