Hierarchical Text Classification as Sub-hierarchy Sequence Generation

被引:0
作者
Im, SangHun [1 ]
Kim, GiBaeg [1 ]
Oh, Heung-Seon [1 ]
Jo, Seongung [1 ]
Kim, Dong Hwan [1 ]
机构
[1] Korea Univ Technol & Educ KOREATECH, Sch Comp Sci & Engn, Cheonan, South Korea
来源
THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 11 | 2023年
基金
新加坡国家研究基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Hierarchical text classification (HTC) is essential for various real applications. However, HTC models are challenging to develop because they often require processing a large volume of documents and labels with hierarchical taxonomy. Recent HTC models based on deep learning have attempted to incorporate hierarchy information into a model structure. Consequently, these models are challenging to implement when the model parameters increase for a large-scale hierarchy because the model structure depends on the hierarchy size. To solve this problem, we formulate HTC as a sub-hierarchy sequence generation to incorporate hierarchy information into a target label sequence instead of the model structure. Subsequently, we propose the Hierarchy DECoder (HiDEC), which decodes a text sequence into a sub-hierarchy sequence using recursive hierarchy decoding, classifying all parents at the same level into children at once. In addition, HiDEC is trained to use hierarchical path information from a root to each leaf in a sub-hierarchy composed of the labels of a target document via an attention mechanism and hierarchy-aware masking. HiDEC achieved state-of-the-art performance with significantly fewer model parameters than existing models on benchmark datasets, such as RCV1-v2, NYT, and EURLEX57K.
引用
收藏
页码:12933 / 12941
页数:9
相关论文
共 50 条
[21]   Multi-granularity sequence generation for hierarchical image classification [J].
Xinda Liu ;
Lili Wang .
Computational Visual Media, 2024, 10 :243-260
[22]   Does the Order Matter? A Random Generative Way to Learn Label Hierarchy for Hierarchical Text Classification [J].
Yan, Jingsong ;
Li, Piji ;
Chen, Haibin ;
Zheng, Junhao ;
Ma, Qianli .
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 :276-285
[23]   On Dataless Hierarchical Text Classification [J].
Song, Yangqiu ;
Roth, Dan .
PROCEEDINGS OF THE TWENTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2014, :1579-1585
[24]   Experiments with hierarchical text classification [J].
Granitzer, M ;
Auer, P .
PROCEEDINGS OF THE NINTH IASTED INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING, 2005, :177-182
[25]   Hierarchical text classification and evaluation [J].
Sun, AX ;
Lim, EP .
2001 IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2001, :521-528
[26]   Text Generation for Imbalanced Text Classification [J].
Akkaradamrongrat, Suphamongkol ;
Kachamas, Pornpimon ;
Sinthupinyo, Sukree .
2019 16TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER SCIENCE AND SOFTWARE ENGINEERING (JCSSE 2019), 2019, :181-186
[27]   An approach of multi-hierarchy text classification [J].
Liu, SH ;
Dong, MK ;
Zhang, HJ ;
Li, R ;
Shi, ZZ .
2001 INTERNATIONAL CONFERENCES ON INFO-TECH AND INFO-NET PROCEEDINGS, CONFERENCE A-G: INFO-TECH & INFO-NET: A KEY TO BETTER LIFE, 2001, :C95-C100
[28]   Hierarchical text classification methods and their specification [J].
Sun, AX ;
Lim, EP ;
Ng, WK .
COOPERATIVE INTERNET COMPUTING, 2003, 729 :236-256
[29]   Naive approach for hierarchical text classification [J].
Wang, Mingwen ;
Lu, Xu ;
Zhang, Huawei ;
Luo, Yuansheng .
Journal of Computational Information Systems, 2007, 3 (04) :1591-1598
[30]   Hierarchical Interpretation of Neural Text Classification [J].
Yan, Hanqi ;
Gui, Lin ;
He, Yulan .
COMPUTATIONAL LINGUISTICS, 2022, 48 (04) :987-1020