HeteroHTC: Enhancing Hierarchical Text Classification via Heterogeneity Encoding of Label Hierarchy

被引:0
作者
Song, Junru [1 ]
Chen, Tianlei [3 ]
Yang, Yang [4 ]
Wang, Feifei [2 ,3 ]
机构
[1] Shanghai Jiao Tong Univ, Sch Elect Informat & Elect Engn, Shanghai 200240, Peoples R China
[2] Renmin Univ China, Ctr Appl Stat, Beijing 100872, Peoples R China
[3] Renmin Univ China, Sch Stat, Beijing 100872, Peoples R China
[4] Peking Univ, Sch Comp Sci, Beijing 100072, Peoples R China
关键词
Hierarchical Text Classification; Heterogeneous Graph Transformer; Large Language Models;
D O I
10.1016/j.eswa.2025.126558
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Hierarchical Text Classification (HTC) is a challenging subtask of multi-label text classification, where labels are organized into a pre-defined hierarchy. Recent works primarily encode documents and labels separately before cross attention-based feature extraction, and in the process, they collectively overlook a crucial characteristic of label hierarchies: "heterogeneity". Specifically, labels on different levels hold different granularities, and they should be projected onto distinct feature spaces; The relationships among labels are various, dictating that the message transmission among them should occur in unique feature spaces. We term these properties ''granularity heterogeneity"and "relationship heterogeneity", respectively. To fully exploit these ubiquitous yet overlooked properties, we propose HeteroHTC, which features a heterogeneous label hierarchy encoder. Additionally, we leverage pre-trained Large Language Models (LLMs) to generate high-quality label descriptions with strategically designed prompts. HeteroHTC outperforms almost all baselines in our extensive experiments on three datasets, proving its effectiveness and the necessity to take "granularity and relationship heterogeneity"into consideration.
引用
收藏
页数:11
相关论文
共 44 条
[1]  
Aly R, 2019, 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019:): STUDENT RESEARCH WORKSHOP, P323
[2]  
[Anonymous], 2010, JMLR WORKSHOP C P
[3]  
Banerjee S, 2019, 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), P6295
[4]  
Cao PF, 2020, 58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), P3105
[5]   Edge-relational window-attentional graph neural network for gene expression prediction in spatial transcriptomics analysis [J].
Chen C. ;
Zhang Z. ;
Tang P. ;
Liu X. ;
Huang B. .
Computers in Biology and Medicine, 2024, 174
[6]   Retrieval-style In-context Learning for Few-shot Hierarchical Text Classification [J].
Chen, Huiyao ;
Zhao, Yu ;
Chen, Zulong ;
Wang, Mengjia ;
Li, Liangyue ;
Zhang, Meishan ;
Zhang, Min .
TRANSACTIONS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, 2024, 12 :1214-1231
[7]  
Deng ZF, 2021, Arxiv, DOI arXiv:2104.05220
[8]  
Devlin J, 2019, Arxiv, DOI arXiv:1810.04805
[9]  
Fang H., 2023, IEEE Transactions on Mobile Computing
[10]   A survey of automated hierarchical classification of patents [J].
Gomez, Juan Carlos ;
Moens, Marie-Francine .
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2014, 8830 :215-249