Hierarchical Text Classification as Sub-hierarchy Sequence Generation

被引:0
作者
Im, SangHun [1 ]
Kim, GiBaeg [1 ]
Oh, Heung-Seon [1 ]
Jo, Seongung [1 ]
Kim, Dong Hwan [1 ]
机构
[1] Korea Univ Technol & Educ KOREATECH, Sch Comp Sci & Engn, Cheonan, South Korea
来源
THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 11 | 2023年
基金
新加坡国家研究基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Hierarchical text classification (HTC) is essential for various real applications. However, HTC models are challenging to develop because they often require processing a large volume of documents and labels with hierarchical taxonomy. Recent HTC models based on deep learning have attempted to incorporate hierarchy information into a model structure. Consequently, these models are challenging to implement when the model parameters increase for a large-scale hierarchy because the model structure depends on the hierarchy size. To solve this problem, we formulate HTC as a sub-hierarchy sequence generation to incorporate hierarchy information into a target label sequence instead of the model structure. Subsequently, we propose the Hierarchy DECoder (HiDEC), which decodes a text sequence into a sub-hierarchy sequence using recursive hierarchy decoding, classifying all parents at the same level into children at once. In addition, HiDEC is trained to use hierarchical path information from a root to each leaf in a sub-hierarchy composed of the labels of a target document via an attention mechanism and hierarchy-aware masking. HiDEC achieved state-of-the-art performance with significantly fewer model parameters than existing models on benchmark datasets, such as RCV1-v2, NYT, and EURLEX57K.
引用
收藏
页码:12933 / 12941
页数:9
相关论文
共 50 条
  • [21] On Dataless Hierarchical Text Classification
    Song, Yangqiu
    Roth, Dan
    [J]. PROCEEDINGS OF THE TWENTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2014, : 1579 - 1585
  • [22] Experiments with hierarchical text classification
    Granitzer, M
    Auer, P
    [J]. PROCEEDINGS OF THE NINTH IASTED INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING, 2005, : 177 - 182
  • [23] Hierarchical text classification and evaluation
    Sun, AX
    Lim, EP
    [J]. 2001 IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2001, : 521 - 528
  • [24] Text Generation for Imbalanced Text Classification
    Akkaradamrongrat, Suphamongkol
    Kachamas, Pornpimon
    Sinthupinyo, Sukree
    [J]. 2019 16TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER SCIENCE AND SOFTWARE ENGINEERING (JCSSE 2019), 2019, : 181 - 186
  • [25] An approach of multi-hierarchy text classification
    Liu, SH
    Dong, MK
    Zhang, HJ
    Li, R
    Shi, ZZ
    [J]. 2001 INTERNATIONAL CONFERENCES ON INFO-TECH AND INFO-NET PROCEEDINGS, CONFERENCE A-G: INFO-TECH & INFO-NET: A KEY TO BETTER LIFE, 2001, : C95 - C100
  • [26] Naive approach for hierarchical text classification
    Wang, Mingwen
    Lu, Xu
    Zhang, Huawei
    Luo, Yuansheng
    [J]. Journal of Computational Information Systems, 2007, 3 (04): : 1591 - 1598
  • [27] Hierarchical text classification methods and their specification
    Sun, AX
    Lim, EP
    Ng, WK
    [J]. COOPERATIVE INTERNET COMPUTING, 2003, 729 : 236 - 256
  • [28] Context Recognition for Hierarchical Text Classification
    Liu, Rey-Long
    [J]. JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2009, 60 (04): : 803 - 813
  • [29] Hierarchical Interpretation of Neural Text Classification
    Yan, Hanqi
    Gui, Lin
    He, Yulan
    [J]. COMPUTATIONAL LINGUISTICS, 2022, 48 (04) : 987 - 1020
  • [30] Hierarchical Text Classification Incremental Learning
    Song, Shengli
    Qiao, Xiaofei
    Chen, Ping
    [J]. NEURAL INFORMATION PROCESSING, PT 1, PROCEEDINGS, 2009, 5863 : 247 - 258