Few-shot Hierarchical Text Classification with Bidirectional Path Constraint by label weighting

被引:0
|
作者
Zhang, Mingbao [1 ,2 ]
Song, Rui [4 ]
Li, Xiang [1 ,3 ]
Tavares, Adriano [1 ]
Xu, Hao [4 ]
机构
[1] Univ Minho, Braga, Portugal
[2] Neusoft Educ Technol Co Ltd, Shenyang, Peoples R China
[3] Dalian Neusoft Univ Informat, Dalian, Peoples R China
[4] Jilin Univ, Coll Comp Sci & Technol, Changchun, Peoples R China
关键词
Text analysis; Multi-label classification; Few-shot learning; Weakly-supervised learning;
D O I
10.1016/j.patrec.2025.01.025
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Hierarchical Text Classification (HTC) organizes candidate labels into a hierarchical structure and uses one or more paths within the hierarchy as the ground-truth labels, which has been applied to various downstream tasks, e.g., sentiment analysis and harmful text detection. Existing works often involve data-driven models that are trained on large-scale datasets. However, creating annotated datasets is labor-intensive and timeconsuming. To address this issue, recent work has focused on the few-shot HTC task, where each class has only a few samples, e.g., 5. These approaches perform classification at each layer separately and leverage the prompt learning capability of pre-trained models like BERT. However, we find that these methods always neglect the inter-layer relationships. To solve this problem, we propose anew model called Bidirectional Path Constraint by Label Weighting (BPc-LW). Its basic idea is to use a pre-defined label embedding matrix and a feed-forward neural network for information propagation between layers, while also designing a bidirectional label weighting method to constrain the predictions of each layer to be along the same path in the label hierarchy. In addition, we employ a contrastive learning-based method to enhance the discriminative capacity of the hierarchical embeddings. We compare our proposed method with recent few-shot HTC baseline models across 3 benchmark datasets, and the experimental results demonstrate the effectiveness of BPc-LW.
引用
收藏
页码:81 / 88
页数:8
相关论文
共 50 条
  • [31] Learning Hierarchical Task Structures for Few-shot Graph Classification
    Wang, Song
    Dong, Yushun
    Huang, Xiao
    Chen, Chen
    Li, Jundong
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2024, 18 (03)
  • [32] Few-shot Text Classification Method Based on Feature Optimization
    Peng, Jing
    Huo, Shuquan
    JOURNAL OF WEB ENGINEERING, 2023, 22 (03): : 497 - 514
  • [33] Dynamic Memory Induction Networks for Few-Shot Text Classification
    Geng, Ruiying
    Li, Binhua
    Li, Yongbin
    Sun, Jian
    Zhu, Xiaodan
    58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), 2020, : 1087 - 1094
  • [34] Feature Weighting and Boosting for Few-Shot Segmentation
    Khoi Nguyen
    Todorovic, Sinisa
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 622 - 631
  • [35] A review of few-shot classification
    Lim, Jia Min
    Lim, Kian Ming
    Lee, Chin Poo
    Lim, Jit Yan
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 275
  • [36] BCC: BIDIRECTIONAL CONSISTENCY CONSTRAINT METHOD FOR HIERARCHICAL TEXT CLASSIFICATION
    Shen, Yinghan
    Yan, Yu
    Yin, Dechun
    Shen, Huawei
    2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2024), 2024, : 11271 - 11275
  • [37] Meta-Learning for Multi-Label Few-Shot Classification
    Simon, Christian
    Koniusz, Piotr
    Harandi, Mehrtash
    2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022), 2022, : 346 - 355
  • [38] MetricPrompt: Prompting Model as a Relevance Metric for Few-shot Text Classification
    Dong, Hongyuan
    Zhang, Weinan
    Che, Wanxiang
    PROCEEDINGS OF THE 29TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2023, 2023, : 426 - 436
  • [39] Enhance Prototypical Network with Text Descriptions for Few-shot Relation Classification
    Yang, Kaijia
    Zheng, Nantao
    Dai, Xinyu
    He, Liang
    Huang, Shujian
    Chen, Jiajun
    CIKM '20: PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, 2020, : 2273 - 2276
  • [40] Noisy Channel Language Model Prompting for Few-Shot Text Classification
    Min, Sewon
    Lewis, Mike
    Hajishirzi, Hannaneh
    Zettlemoyer, Luke
    PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), 2022, : 5316 - 5330