Few-shot Hierarchical Text Classification with Bidirectional Path Constraint by label weighting

被引：0

作者：

Zhang, Mingbao ^{[1
,2
]}

Song, Rui ^{[4
]}

Li, Xiang ^{[1
,3
]}

Tavares, Adriano ^{[1
]}

Xu, Hao ^{[4
]}

机构：

[1] Univ Minho, Braga, Portugal

[2] Neusoft Educ Technol Co Ltd, Shenyang, Peoples R China

[3] Dalian Neusoft Univ Informat, Dalian, Peoples R China

[4] Jilin Univ, Coll Comp Sci & Technol, Changchun, Peoples R China

来源：

PATTERN RECOGNITION LETTERS | 2025年 / 190卷

关键词：

Text analysis; Multi-label classification; Few-shot learning; Weakly-supervised learning;

D O I：

10.1016/j.patrec.2025.01.025

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Hierarchical Text Classification (HTC) organizes candidate labels into a hierarchical structure and uses one or more paths within the hierarchy as the ground-truth labels, which has been applied to various downstream tasks, e.g., sentiment analysis and harmful text detection. Existing works often involve data-driven models that are trained on large-scale datasets. However, creating annotated datasets is labor-intensive and timeconsuming. To address this issue, recent work has focused on the few-shot HTC task, where each class has only a few samples, e.g., 5. These approaches perform classification at each layer separately and leverage the prompt learning capability of pre-trained models like BERT. However, we find that these methods always neglect the inter-layer relationships. To solve this problem, we propose anew model called Bidirectional Path Constraint by Label Weighting (BPc-LW). Its basic idea is to use a pre-defined label embedding matrix and a feed-forward neural network for information propagation between layers, while also designing a bidirectional label weighting method to constrain the predictions of each layer to be along the same path in the label hierarchy. In addition, we employ a contrastive learning-based method to enhance the discriminative capacity of the hierarchical embeddings. We compare our proposed method with recent few-shot HTC baseline models across 3 benchmark datasets, and the experimental results demonstrate the effectiveness of BPc-LW.

引用

页码：81 / 88

页数：8

共 50 条

[31] Learning Hierarchical Task Structures for Few-shot Graph Classification
Wang, Song
Dong, Yushun
Huang, Xiao
Chen, Chen
Li, Jundong
ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2024, 18 (03)
[32] Few-shot Text Classification Method Based on Feature Optimization
Peng, Jing
Huo, Shuquan
JOURNAL OF WEB ENGINEERING, 2023, 22 (03): : 497 - 514
[33] Dynamic Memory Induction Networks for Few-Shot Text Classification
Geng, Ruiying
Li, Binhua
Li, Yongbin
Sun, Jian
Zhu, Xiaodan
58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), 2020, : 1087 - 1094
[34] Feature Weighting and Boosting for Few-Shot Segmentation
Khoi Nguyen
Todorovic, Sinisa
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 622 - 631
[35] A review of few-shot classification
Lim, Jia Min
Lim, Kian Ming
Lee, Chin Poo
Lim, Jit Yan
EXPERT SYSTEMS WITH APPLICATIONS, 2025, 275
[36] BCC: BIDIRECTIONAL CONSISTENCY CONSTRAINT METHOD FOR HIERARCHICAL TEXT CLASSIFICATION
Shen, Yinghan
Yan, Yu
Yin, Dechun
Shen, Huawei
2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2024), 2024, : 11271 - 11275
[37] Meta-Learning for Multi-Label Few-Shot Classification
Simon, Christian
Koniusz, Piotr
Harandi, Mehrtash
2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022), 2022, : 346 - 355
[38] MetricPrompt: Prompting Model as a Relevance Metric for Few-shot Text Classification
Dong, Hongyuan
Zhang, Weinan
Che, Wanxiang
PROCEEDINGS OF THE 29TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2023, 2023, : 426 - 436
[39] Enhance Prototypical Network with Text Descriptions for Few-shot Relation Classification
Yang, Kaijia
Zheng, Nantao
Dai, Xinyu
He, Liang
Huang, Shujian
Chen, Jiajun
CIKM '20: PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, 2020, : 2273 - 2276
[40] Noisy Channel Language Model Prompting for Few-Shot Text Classification
Min, Sewon
Lewis, Mike
Hajishirzi, Hannaneh
Zettlemoyer, Luke
PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), 2022, : 5316 - 5330

← 1 2 3 4 5 →