Local Hierarchy-Aware Text-Label Association for Hierarchical Text Classification

被引:0
|
作者
Kumar, Ashish [1 ]
Toshniwal, Durga [1 ]
机构
[1] Indian Inst Technol Roorkee, Dept Comp Sci & Engn, Roorkee, Uttar Pradesh, India
来源
2024 IEEE 11TH INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ADVANCED ANALYTICS, DSAA 2024 | 2024年
关键词
Multi-label classification; NLP; Representation Learning;
D O I
10.1109/DSAA61799.2024.10722840
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Hierarchical Text Classification (HTC) aims to categorize text data based on a structured label hierarchy, generating predicted labels that form a local hierarchical structure. Previous approaches have employed various methods to integrate text and label semantics, but they often overlooked the importance of local hierarchical context. By considering the local hierarchy, which encapsulates relationships between labels within the context of individual samples, we can enhance the association between text and its related labels. To address this, we propose a Margin Separation Loss (MSL), which explicitly models text-label semantic associations in a local hierarchy-aware manner. We obtain positive labels for each sample using the local hierarchy and employ the global hierarchy to identify corresponding negative labels. Positive and negative pairs are created by pairing the text sample with its positive and negative labels. MSL enforces a margin between positive and negative pairs at each hierarchical level, which ensures that similarity within positive pairs is maximized while similarity within negative pairs is minimized in the embedding space, thereby aligning text representations with their related labels. Building upon this, we introduce the Hierarchical Text-Label Association (HTLA(n)) model, utilizing BERT for text encoding and a customized Graphormer to encode label hierarchy and fusion of text-label embeddings to generate composite representations. Experimental results on benchmark datasets and comparison with existing baselines demonstrate the effectiveness of HTLAn for HTC.
引用
收藏
页码:68 / 77
页数:10
相关论文
共 50 条
  • [1] Hierarchy-Aware and Label Balanced Model for Hierarchical Text Classification
    Zhang, Jun
    Li, Yubin
    Shen, Fanfan
    Xia, Chenxi
    Tan, Hai
    He, Yanxiang
    KNOWLEDGE-BASED SYSTEMS, 2024, 300
  • [2] Hierarchy-aware Label Semantics Matching Network for Hierarchical Text Classification
    Chen, Haibin
    Ma, Qianli
    Lin, Zhenxi
    Yan, Jiangyue
    59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (ACL-IJCNLP 2021), VOL 1, 2021, : 4370 - 4379
  • [3] Hierarchy-Aware Global Model for Hierarchical Text Classification
    Zhou, Jie
    Ma, Chunping
    Long, Dingkun
    Xu, Guangwei
    Ding, Ning
    Zhang, Haoyu
    Xie, Pengjun
    Liu, Gongshen
    58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), 2020, : 1106 - 1117
  • [4] HiGen: Hierarchy-Aware Sequence Generation for Hierarchical Text Classification
    Jain, Vidit
    Rungta, Mukund
    Zhuang, Yuchen
    Yu, Yue
    Wang, Zeyu
    Gao, Mu
    Skolnick, Jeffrey
    Zhang, Chao
    EACL 2024 - 18th Conference of the European Chapter of the Association for Computational Linguistics, Proceedings of the Conference, 2024, 1 : 1354 - 1368
  • [5] Modeling Text-Label Alignment for Hierarchical Text Classification
    Kumar, Ashish
    Toshniwal, Durga
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES-RESEARCH TRACK, PT VI, ECML PKDD 2024, 2024, 14946 : 163 - 179
  • [6] HiGen: Hierarchy-Aware Sequence Generation for Hierarchical Text Classification
    Jain, Vidit
    Rungta, Mukund
    Zhuang, Yuchen
    Yu, Yue
    Wang, Zeyu
    Gao, Mu
    Skolnick, Jeffrey
    Zhang, Chao
    PROCEEDINGS OF THE 18TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1: LONG PAPERS, 2024, : 1354 - 1368
  • [7] HiTIN: Hierarchy-aware Tree Isomorphism Network for Hierarchical Text Classification
    Zhu, He
    Zhang, Chong
    Huang, Junjie
    Wu, Junran
    Xu, Ke
    PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 1, 2023, : 7809 - 7821
  • [8] Hierarchy-Aware Bilateral-Branch Network for Imbalanced Hierarchical Text Classification
    Zhao, Jiangjiang
    Lie, Jiyi
    Fukumoto, Fumiyo
    DATABASE AND EXPERT SYSTEMS APPLICATIONS, DEXA 2023, PT II, 2023, 14147 : 143 - 157
  • [9] Instances and Labels: Hierarchy-aware Joint Supervised Contrastive Learning for Hierarchical Multi-Label Text Classification
    Lok, Simon Chi U.
    He, Jie
    Gutierrez-Basulto, Victor
    Pan, Jeff Z.
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EMNLP 2023), 2023, : 8858 - 8875
  • [10] A Hierarchy-Aware Approach to the Multiaspect Text Categorization Problem
    Zadrozny, Slawomir
    Kacprzyk, Janusz
    Gajewski, Marek
    RECENT DEVELOPMENTS AND THE NEW DIRECTION IN SOFT-COMPUTING FOUNDATIONS AND APPLICATIONS, 2018, 361 : 49 - 62