Hierarchy-Aware Bilateral-Branch Network for Imbalanced Hierarchical Text Classification

被引:0
作者
Zhao, Jiangjiang [1 ,2 ]
Lie, Jiyi [2 ]
Fukumoto, Fumiyo [2 ]
机构
[1] Hangzhou Dianzi Univ, Hangzhou, Peoples R China
[2] Univ Yamanashi, Kofu, Japan
来源
DATABASE AND EXPERT SYSTEMS APPLICATIONS, DEXA 2023, PT II | 2023年 / 14147卷
关键词
Hierarchical Text Classification; Imbalanced Data;
D O I
10.1007/978-3-031-39821-6_12
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Hierarchical text classification is an essential task in natural language processing. Existing studies focus only on label hierarchy structure, such as building classifiers for each level of labels or employing the label taxonomic hierarchy to improve the hierarchy classification performance. However, these methods ignore issues with imbalanced datasets, which present tremendous challenges to text classification performance, especially for the tail categories. To this end, we propose Hierarchyaware-Bilateral-Branch-Network (HiBBN) to address this problem, where we introduce the bilateral-branch network and apply a hierarchy-aware encoder to model text representation with label dependencies. In addition, HiBBN has two network branches that cooperate with the uniform sampler and reversed sampler, which can deal with the data imbalance problem sufficiently. Therefore, our-model handles both hierarchical structural information and modeling of tail data simultaneously, and extensive experiments on benchmark datasets indicate that our model achieves better performance, especially for fine-grained categories.
引用
收藏
页码:143 / 157
页数:15
相关论文
empty
未找到相关数据