Less is more: A closer look at semantic-based few-shot learning

被引:0
|
作者
Zhou, Chunpeng [1 ]
Yu, Zhi [2 ]
Yuan, Xilu [1 ]
Zhou, Sheng [2 ]
Bu, Jiajun [1 ]
Wang, Haishuai [1 ,3 ]
机构
[1] Zhejiang Univ, Coll Comp Sci, Zhejiang Key Lab Accessible Percept & Intelligent, Hangzhou 310000, Peoples R China
[2] Zhejiang Univ, Sch Software Technol, Ningbo 310027, Peoples R China
[3] Shanghai Artificial Intelligence Lab, Shanghai 200125, Peoples R China
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
Few-shot learning; Multi-modal learning; Feature representation; Image classification;
D O I
10.1016/j.inffus.2024.102672
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Few-shot Learning (FSL) aims to learn and distinguish new categories from a scant number of available samples, presenting a significant challenge in the realm of deep learning. Recent researchers have sought to leverage the additional semantic or linguistic information of scarce categories with a pre-trained language model to facilitate learning, thus partially alleviating the problem of insufficient supervision signals. Nonetheless, the full potential of the semantic information and pre-trained language model have been underestimated in the few-shot learning till now, resulting in limited performance enhancements. To address this, we propose a straightforward and efficacious framework for few-shot learning tasks, specifically designed to exploit the semantic information and language model. Specifically, we explicitly harness the zero-shot capability of the pre-trained language model with learnable prompts. And we directly add the visual feature with the textual feature for inference without the intricate designed fusion modules as in prior studies. Additionally, we apply the self-ensemble and distillation to further enhance performance. Extensive experiments conducted across four widely used few-shot datasets demonstrate that our simple framework achieves impressive results. Particularly noteworthy is its outstanding performance in the 1-shot learning task, surpassing the current state-of-the-art by an average of 3.3% in classification accuracy. Our code will be available at https://github.com/zhouchunpong/ SimpleFewShot.
引用
收藏
页数:12
相关论文
共 50 条
  • [21] Learning Orthogonal Prototypes for Generalized Few-shot Semantic Segmentation
    Liu, Sun-Ao
    Zhang, Yiheng
    Qiu, Zhaofan
    Xie, Hongtao
    Zhang, Yongdong
    Yao, Ting
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 11319 - 11328
  • [22] SpatialFormer: Semantic and Target Aware Attentions for Few-Shot Learning
    Lai, Jinxiang
    Yang, Siqian
    Wu, Wenlong
    Wu, Tao
    Jiang, Guannan
    Wang, Xi
    Liu, Jun
    Gao, Bin-Bin
    Zhang, Wei
    Xie, Yuan
    Wang, Chengjie
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 7, 2023, : 8430 - 8437
  • [23] Prompting-to-Distill Semantic Knowledge for Few-Shot Learning
    Ji, Hong
    Gao, Zhi
    Ren, Jinchang
    Wang, Xing-ao
    Gao, Tianyi
    Sun, Wenbo
    Ma, Ping
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2024, 21
  • [24] Harnessing Multi-Semantic Hypergraph for Few-Shot Learning
    Chen, Hao
    Li, Linyan
    Xia, Zhenping
    Lyu, Fan
    Zhao, Liuqing
    Huang, Kaizhu
    Feng, Wei
    Hu, Fuyuan
    PATTERN RECOGNITION AND COMPUTER VISION, PT I, PRCV 2022, 2022, 13534 : 232 - 244
  • [25] Semantic Guided Latent Parts Embedding for Few-Shot Learning
    Yang, Fengyuan
    Wang, Ruiping
    Chen, Xilin
    2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 5436 - 5446
  • [26] Learning Foreground Information Bottleneck for few-shot semantic segmentation
    Hu, Yutao
    Huang, Xin
    Luo, Xiaoyan
    Han, Jungong
    Cao, Xianbin
    Zhang, Jun
    PATTERN RECOGNITION, 2024, 146
  • [27] A Dual Attention Network with Semantic Embedding for Few-Shot Learning
    Yan, Shipeng
    Zhang, Songyang
    He, Xuming
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 9079 - 9086
  • [28] Part-Based Semantic Transform for Few-Shot Semantic Segmentation
    Yang, Boyu
    Wan, Fang
    Liu, Chang
    Li, Bohao
    Ji, Xiangyang
    Ye, Qixiang
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (12) : 7141 - 7152
  • [29] Semantic Mask Reconstruction and Category Semantic Learning for few-shot image generation
    Xiao, Ting
    Cai, Yunjie
    Guan, Jiaoyan
    Wang, Zhe
    NEURAL NETWORKS, 2025, 183
  • [30] Generalized Few-shot Semantic Segmentation
    Tian, Zhuotao
    Lai, Xin
    Jiang, Li
    Liu, Shu
    Shu, Michelle
    Zhao, Hengshuang
    Jia, Jiaya
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 11553 - 11562