Hierarchical Verbalizer for Few-Shot Hierarchical Text Classification

被引：0

作者：

Ji, Ke ^{[1
,2
]}

Lian, Yixin ^{[2
]}

Gao, Jingsheng ^{[2
]}

Wang, Baoyuan ^{[2
]}

机构：

[1] Southeast Univ, Sch Comp Sci & Engn, Nanjing, Peoples R China

[2] Xiaobing AI, Beijing, Peoples R China

来源：

PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 1 | 2023年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Due to the complex label hierarchy and intensive labeling cost in practice, the hierarchical text classification (HTC) suffers a poor performance especially when low-resource or fewshot settings are considered. Recently, there is a growing trend of applying prompts on pretrained language models (PLMs), which has exhibited effectiveness in the few-shot flat text classification tasks. However, limited work has studied the paradigm of prompt-based learning in the HTC problem when the training data is extremely scarce. In this work, we define a path-based few-shot setting and establish a strict path-based evaluation metric to further explore few-shot HTC tasks. To address the issue, we propose the hierarchical verbalizer ("HierVerb"), a multi-verbalizer framework treating HTC as a single- or multi-label classification problem at multiple layers and learning vectors as verbalizers constrained by hierarchical structure and hierarchical contrastive learning. In this manner, HierVerb fuses label hierarchy knowledge into verbalizers and remarkably outperforms those who inject hierarchy through graph encoders, maximizing the benefits of PLMs. Extensive experiments on three popular HTC datasets under the few-shot settings demonstrate that prompt with HierVerb significantly boosts the HTC performance, meanwhile indicating an elegant way to bridge the gap between the large pre-trained model and downstream hierarchical classification tasks. 1

引用

页码：2918 / 2933

页数：16

共 50 条

[1] Hierarchical Attention Prototypical Networks for Few-Shot Text Classification
Sun, Shengli
Sun, Qingfeng
Zhou, Kevin
Lv, Tengchao
2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 476 - 485
[2] Few-shot Hierarchical Text Classification with Bidirectional Path Constraint by label weighting
Zhang, Mingbao
Song, Rui
Li, Xiang
Tavares, Adriano
Xu, Hao
PATTERN RECOGNITION LETTERS, 2025, 190 : 81 - 88
[3] Retrieval-style In-context Learning for Few-shot Hierarchical Text Classification
Chen, Huiyao
Zhao, Yu
Chen, Zulong
Wang, Mengjia
Li, Liangyue
Zhang, Meishan
Zhang, Min
TRANSACTIONS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, 2024, 12 : 1214 - 1231
[4] Learning Hierarchical Task Structures for Few-shot Graph Classification
Wang, Song
Dong, Yushun
Huang, Xiao
Chen, Chen
Li, Jundong
ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2024, 18 (03)
[5] SELF-ADAPTIVE EMBEDDING FOR FEW-SHOT CLASSIFICATION BY HIERARCHICAL ATTENTION
Wang, Xueliang
Wu, Feng
Wang, Jie
2020 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2020,
[6] Exploring Hierarchical Prototypes for Few-Shot Segmentation
Chen, Yaozong
Cao, Wenming
ARTIFICIAL INTELLIGENCE, CICAI 2022, PT I, 2022, 13604 : 42 - 53
[7] Label-Aware Automatic Verbalizer for Few-Shot Text Classification in Mid-To-Low Resource Languages
Thaminkaew, Thanakorn
Lertvittayakumjorn, Piyawat
Vateekul, Peerapon
PROCEEDINGS OF THE 62ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 4: STUDENT RESEARCH WORKSHOP, 2024, : 119 - 127
[8] Causal representation for few-shot text classification
Yang, Maoqin
Zhang, Xuejie
Wang, Jin
Zhou, Xiaobing
APPLIED INTELLIGENCE, 2023, 53 (18) : 21422 - 21432
[9] Few-shot learning for short text classification
Yan, Leiming
Zheng, Yuhui
Cao, Jie
MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (22) : 29799 - 29810
[10] Adversarial training for few-shot text classification
Croce, Danilo
Castellucci, Giuseppe
Basili, Roberto
INTELLIGENZA ARTIFICIALE, 2020, 14 (02) : 201 - 214

← 1 2 3 4 5 →