Hierarchical Verbalizer for Few-Shot Hierarchical Text Classification

被引:0
|
作者
Ji, Ke [1 ,2 ]
Lian, Yixin [2 ]
Gao, Jingsheng [2 ]
Wang, Baoyuan [2 ]
机构
[1] Southeast Univ, Sch Comp Sci & Engn, Nanjing, Peoples R China
[2] Xiaobing AI, Beijing, Peoples R China
来源
PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 1 | 2023年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Due to the complex label hierarchy and intensive labeling cost in practice, the hierarchical text classification (HTC) suffers a poor performance especially when low-resource or fewshot settings are considered. Recently, there is a growing trend of applying prompts on pretrained language models (PLMs), which has exhibited effectiveness in the few-shot flat text classification tasks. However, limited work has studied the paradigm of prompt-based learning in the HTC problem when the training data is extremely scarce. In this work, we define a path-based few-shot setting and establish a strict path-based evaluation metric to further explore few-shot HTC tasks. To address the issue, we propose the hierarchical verbalizer ("HierVerb"), a multi-verbalizer framework treating HTC as a single- or multi-label classification problem at multiple layers and learning vectors as verbalizers constrained by hierarchical structure and hierarchical contrastive learning. In this manner, HierVerb fuses label hierarchy knowledge into verbalizers and remarkably outperforms those who inject hierarchy through graph encoders, maximizing the benefits of PLMs. Extensive experiments on three popular HTC datasets under the few-shot settings demonstrate that prompt with HierVerb significantly boosts the HTC performance, meanwhile indicating an elegant way to bridge the gap between the large pre-trained model and downstream hierarchical classification tasks. 1
引用
收藏
页码:2918 / 2933
页数:16
相关论文
共 50 条
  • [1] Hierarchical Attention Prototypical Networks for Few-Shot Text Classification
    Sun, Shengli
    Sun, Qingfeng
    Zhou, Kevin
    Lv, Tengchao
    2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 476 - 485
  • [2] Few-shot Hierarchical Text Classification with Bidirectional Path Constraint by label weighting
    Zhang, Mingbao
    Song, Rui
    Li, Xiang
    Tavares, Adriano
    Xu, Hao
    PATTERN RECOGNITION LETTERS, 2025, 190 : 81 - 88
  • [3] Retrieval-style In-context Learning for Few-shot Hierarchical Text Classification
    Chen, Huiyao
    Zhao, Yu
    Chen, Zulong
    Wang, Mengjia
    Li, Liangyue
    Zhang, Meishan
    Zhang, Min
    TRANSACTIONS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, 2024, 12 : 1214 - 1231
  • [4] Learning Hierarchical Task Structures for Few-shot Graph Classification
    Wang, Song
    Dong, Yushun
    Huang, Xiao
    Chen, Chen
    Li, Jundong
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2024, 18 (03)
  • [5] SELF-ADAPTIVE EMBEDDING FOR FEW-SHOT CLASSIFICATION BY HIERARCHICAL ATTENTION
    Wang, Xueliang
    Wu, Feng
    Wang, Jie
    2020 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2020,
  • [6] Exploring Hierarchical Prototypes for Few-Shot Segmentation
    Chen, Yaozong
    Cao, Wenming
    ARTIFICIAL INTELLIGENCE, CICAI 2022, PT I, 2022, 13604 : 42 - 53
  • [7] Label-Aware Automatic Verbalizer for Few-Shot Text Classification in Mid-To-Low Resource Languages
    Thaminkaew, Thanakorn
    Lertvittayakumjorn, Piyawat
    Vateekul, Peerapon
    PROCEEDINGS OF THE 62ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 4: STUDENT RESEARCH WORKSHOP, 2024, : 119 - 127
  • [8] Causal representation for few-shot text classification
    Yang, Maoqin
    Zhang, Xuejie
    Wang, Jin
    Zhou, Xiaobing
    APPLIED INTELLIGENCE, 2023, 53 (18) : 21422 - 21432
  • [9] Few-shot learning for short text classification
    Yan, Leiming
    Zheng, Yuhui
    Cao, Jie
    MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (22) : 29799 - 29810
  • [10] Adversarial training for few-shot text classification
    Croce, Danilo
    Castellucci, Giuseppe
    Basili, Roberto
    INTELLIGENZA ARTIFICIALE, 2020, 14 (02) : 201 - 214