A Study on Hierarchical Text Classification as a Seq2seq Task

被引:0
|
作者
Torba, Fatos [1 ,2 ]
Gravier, Christophe [2 ]
Laclau, Charlotte [3 ]
Kammoun, Abderrhammen [1 ]
Subercaze, Julien [1 ]
机构
[1] AItenders, St Etienne, France
[2] CNRS, Lab Hubert Curien, UMR 5516, St Etienne, France
[3] Inst Polytech Paris, Telecom Paris, Paris, France
来源
ADVANCES IN INFORMATION RETRIEVAL, ECIR 2024, PT III | 2024年 / 14610卷
关键词
Hierarchical text classification; generative model; reproducibility;
D O I
10.1007/978-3-031-56063-7_20
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the progress of generative neural models, Hierarchical Text Classification (HTC) can be cast as a generative task. In this case, given an input text, the model generates the sequence of predicted class labels taken from a label tree of arbitrary width and depth. Treating HTC as a generative task introduces multiple modeling choices. These choices vary from choosing the order for visiting the class tree and therefore defining the order of generating tokens, choosing either to constrain the decoding to labels that respect the previous level predictions, up to choosing the pre-trained Language Model itself. Each HTC model therefore differs from the others from an architectural standpoint, but also from the modeling choices that were made. Prior contributions lack transparent modeling choices and open implementations, hindering the assessment of whether model performance stems from architectural or modeling decisions. For these reasons, we propose with this paper an analysis of the impact of different modeling choices along with common model errors and successes for this task. This analysis is based on an open framework coming along this paper that can facilitate the development of future contributions in the field by providing datasets, metrics, error analysis toolkit and the capability to readily test various modeling choices for one given model.
引用
收藏
页码:287 / 296
页数:10
相关论文
共 48 条
  • [21] HLC: hierarchically-aware label correlation for hierarchical text classification
    Ashish Kumar
    Durga Toshinwal
    Applied Intelligence, 2024, 54 : 1602 - 1618
  • [22] Adaptive Hierarchical Text Classification Using ERNIE and Dynamic Threshold Pruning
    Chen, Han
    Zhang, Yangsen
    Jiang, Yuru
    Duan, Ruixue
    IEEE ACCESS, 2024, 12 : 193641 - 193652
  • [23] Hierarchy-Aware and Label Balanced Model for Hierarchical Text Classification
    Zhang, Jun
    Li, Yubin
    Shen, Fanfan
    Xia, Chenxi
    Tan, Hai
    He, Yanxiang
    KNOWLEDGE-BASED SYSTEMS, 2024, 300
  • [24] Utilizing global and path information with language modelling for hierarchical text classification
    Oh, Heung-Seon
    Myaeng, Sung-Hyon
    JOURNAL OF INFORMATION SCIENCE, 2014, 40 (02) : 127 - 145
  • [25] HTCSI: A Hierarchical Text Classification Method Based on Selection-Inference
    Xu, Yiming
    Feng, Jianzhou
    Gu, Chenghan
    Qin, Haonan
    Xue, Kehan
    NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, PT II, NLPCC 2024, 2025, 15360 : 307 - 318
  • [26] A Category Hybrid Embedding Based Approach for Power Text Hierarchical Classification
    Chen X.
    Gao P.
    Liang Y.
    Ma Y.
    Beijing Daxue Xuebao (Ziran Kexue Ban)/Acta Scientiarum Naturalium Universitatis Pekinensis, 2022, 58 (01): : 77 - 82
  • [27] Hierarchical text classification with multi-label contrastive learning and KNN
    Zhang, Jun
    Li, Yubin
    Shen, Fanfan
    He, Yueshun
    Tan, Hai
    He, Yanxiang
    NEUROCOMPUTING, 2024, 577
  • [28] A Comparative Study of Techniques for Differential Expression Analysis on RNA-Seq Data
    Zhang, Zong Hong
    Jhaveri, Dhanisha J.
    Marshall, Vikki M.
    Bauer, Denis C.
    Edson, Janette
    Narayanan, Ramesh K.
    Robinson, Gregory J.
    Lundberg, Andreas E.
    Bartlett, Perry F.
    Wray, Naomi R.
    Zhao, Qiong-Yi
    PLOS ONE, 2014, 9 (08):
  • [29] Adaptive micro- and macro-knowledge incorporation for hierarchical text classification
    Feng, Zijian
    Mao, Kezhi
    Zhou, Hanzhang
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 248
  • [30] HeteroHTC: Enhancing Hierarchical Text Classification via Heterogeneity Encoding of Label Hierarchy
    Song, Junru
    Chen, Tianlei
    Yang, Yang
    Wang, Feifei
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 271