Hyperbolic Embeddings for Hierarchical Multi-label Classification

被引:5
|
作者
Tomaz, Stepisnik [1 ,2 ]
Kocev, Dragi [1 ,2 ,3 ]
机构
[1] Joszef Stefan Inst, Ljubljana, Slovenia
[2] Jozef Stefan Int Postgrad Sch, Ljubljana, Slovenia
[3] Bias Variance Labs Doo, Ljubljana, Slovenia
来源
FOUNDATIONS OF INTELLIGENT SYSTEMS (ISMIS 2020) | 2020年 / 12117卷
关键词
Hierarchical Multi-label Classification; Hyperbolic embeddings; Ensemble methods; Predictive Clustering Trees;
D O I
10.1007/978-3-030-59491-6_7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Hierarchical multi-label classification (HMC) is a practically relevant machine learning task with applications ranging from text categorization, image annotation and up to functional genomics. State of the art results for HMC are obtained with ensembles of predictive models, especially ensembles of predictive clustering trees. Predictive clustering trees (PCTs) generalize decision trees towards HMC and can be combined into ensembles using techniques such as bagging and random forests. There are two major issues that influence the performance of HMC methods: (1) the computational bottleneck imposed by the size of the label hierarchy that can easily reach tens of thousands of labels, and (2) the sparsity of annotations in the label/output space. To address these limitations, we propose an approach that combines graph node embeddings and a specific property of PCTs (descriptive, clustering and target attributes can be specified arbitrarily). We adapt Poincare hyperbolic node embeddings to obtain low dimensional label set embeddings, which are then used to guide PCT construction instead of the original label space. This greatly reduces the time needed to construct a tree due to the difference in dimensionality. The input and output space remain the same: the tests in the tree use original attributes, and in the leaves the original labels are predicted directly. We empirically evaluate the proposed approach on 9 datasets. The results show that our approach dramatically reduces the computational cost of learning and can lead to improved predictive performance.
引用
收藏
页码:66 / 76
页数:11
相关论文
共 50 条
  • [1] Joint Learning of Hyperbolic Label Embeddings for Hierarchical Multi-label Classification
    Chatterjee, Soumya
    Maheshwari, Ayush
    Ramakrishnan, Ganesh
    Jagarlapudi, Saketha Nath
    16TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EACL 2021), 2021, : 2829 - 2841
  • [2] Hyperbolic Interaction Model for Hierarchical Multi-Label Classification
    Chen, Boli
    Huang, Xin
    Xiao, Lin
    Cai, Zixin
    Jing, Liping
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 7496 - 7503
  • [3] ReliefF for Hierarchical Multi-label Classification
    Slavkov, Ivica
    Karcheska, Jana
    Kocev, Dragi
    Kalajdziski, Slobodan
    Dzeroski, Saso
    NEW FRONTIERS IN MINING COMPLEX PATTERNS, NFMCP 2013, 2014, 8399 : 148 - 161
  • [4] Hierarchical Multi-Label Classification Networks
    Wehrmann, Jonatas
    Cerri, Ricardo
    Barros, Rodrigo C.
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 80, 2018, 80
  • [5] Hyperbolic Capsule Networks for Multi-Label Classification
    Chen, Boli
    Huang, Xin
    Xiao, Lin
    Jing, Liping
    58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), 2020, : 3115 - 3124
  • [6] The importance of the label hierarchy in hierarchical multi-label classification
    Levatic, Jurica
    Kocev, Dragi
    Dzeroski, Saso
    JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2015, 45 (02) : 247 - 271
  • [7] The importance of the label hierarchy in hierarchical multi-label classification
    Jurica Levatić
    Dragi Kocev
    Sašo Džeroski
    Journal of Intelligent Information Systems, 2015, 45 : 247 - 271
  • [8] Label Correction Strategy on Hierarchical Multi-Label Classification
    Ananpiriyakul, Thanawut
    Poomsirivilai, Piyapan
    Vateekul, Peerapon
    MACHINE LEARNING AND DATA MINING IN PATTERN RECOGNITION, MLDM 2014, 2014, 8556 : 213 - 227
  • [9] Sparse Local Embeddings for Extreme Multi-label Classification
    Bhatia, Kush
    Jain, Himanshu
    Kar, Purushottam
    Varma, Manik
    Jain, Prateek
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 28 (NIPS 2015), 2015, 28
  • [10] Feature Selection for Hierarchical Multi-label Classification
    da Silva, Luan V. M.
    Cerri, Ricardo
    ADVANCES IN INTELLIGENT DATA ANALYSIS XIX, IDA 2021, 2021, 12695 : 196 - 208