CLUR: Uncertainty Estimation for Few-Shot Text Classification with Contrastive Learning

被引:1
|
作者
He, Jianfeng [1 ]
Zhang, Xuchao [2 ]
Lei, Shuo [1 ]
Alhamadani, Abdulaziz [1 ]
Chen, Fanglan [1 ]
Xiao, Bei [3 ]
Lu, Chang-Tien [1 ]
机构
[1] Virginia Tech, Falls Church, VA 22043 USA
[2] Microsoft, Redmond, WA USA
[3] Amer Univ, Washington, DC USA
基金
美国国家科学基金会;
关键词
Uncertainty estimation; few-shot; pseudo labels; contrastive learning;
D O I
10.1145/3580305.3599276
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Few-shot text classification has extensive application where the sample collection is expensive or complicated. When the penalty for classification errors is high, such as early threat event detection with scarce data, we expect to know "whether we should trust the classification results or reexamine them." This paper investigates the Uncertainty Estimation for Few-shot Text Classification (UEFTC), an unexplored research area. Given limited samples, a UEFTC model predicts an uncertainty score for a classification result, which is the likelihood that the classification result is false. However, many traditional uncertainty estimation models in text classification are unsuitable for implementing a UEFTC model. These models require numerous training samples, whereas the few-shot setting in UEFTC only provides a few or just one support sample for each class in an episode. We propose Contrastive Learning from Uncertainty Relations (CLUR) to address UEFTC. CLUR can be trained with only one support sample for each class with the help of pseudo uncertainty scores. Unlike previous works that manually set the pseudo uncertainty scores, CLUR self-adaptively learns them using our proposed uncertainty relations. Specifically, we explore four model structures in CLUR to investigate the performance of three common-used contrastive learning components in UEFTC and find that two of the components are effective. Experiment results prove that CLUR outperforms six baselines on four datasets, including an improvement of 4.52% AUPR on an RCV1 dataset in a 5-way 1-shot setting. Our code and data split for UEFTC are in https: //github.com/he159ok/CLUR_UncertaintyEst_FewShot_TextCls.
引用
收藏
页码:698 / 710
页数:13
相关论文
共 50 条
  • [1] ContrastNet: A Contrastive Learning Framework for Few-Shot Text Classification
    Chen, Junfan
    Zhang, Richong
    Mao, Yongyi
    Xu, Jie
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 10492 - 10500
  • [2] Few-Shot Classification with Contrastive Learning
    Yang, Zhanyuan
    Wang, Jinghua
    Zhu, Yingying
    COMPUTER VISION, ECCV 2022, PT XX, 2022, 13680 : 293 - 309
  • [3] CPCL: Conceptual prototypical contrastive learning for Few-Shot text classification
    Cheng, Tao
    Cheng, Hua
    Fang, Yiquan
    Liu, Yufei
    Gao, Caiting
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2023, 45 (06) : 11963 - 11975
  • [4] Diversified Contrastive Learning For Few-Shot Classification
    Lu, Guangtong
    Li, Fanzhang
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PT I, 2023, 14254 : 147 - 158
  • [5] Spatial Contrastive Learning for Few-Shot Classification
    Ouali, Yassine
    Hudelot, Celine
    Tami, Myriam
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, 2021, 12975 : 671 - 686
  • [6] Multimodal variational contrastive learning for few-shot classification
    Pan, Meihong
    Shen, Hongbin
    APPLIED INTELLIGENCE, 2024, 54 (02) : 1879 - 1892
  • [7] Supervised Contrastive Learning for Few-Shot Action Classification
    Han, Hongfeng
    Fei, Nanyi
    Lu, Zhiwu
    Wen, Ji-Rong
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2022, PT III, 2023, 13715 : 512 - 528
  • [8] Multimodal variational contrastive learning for few-shot classification
    Meihong Pan
    Hongbin Shen
    Applied Intelligence, 2024, 54 : 1879 - 1892
  • [9] Few-shot learning for short text classification
    Yan, Leiming
    Zheng, Yuhui
    Cao, Jie
    MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (22) : 29799 - 29810
  • [10] Few-shot learning for short text classification
    Leiming Yan
    Yuhui Zheng
    Jie Cao
    Multimedia Tools and Applications, 2018, 77 : 29799 - 29810