Prompting-to-Distill Semantic Knowledge for Few-Shot Learning

被引:0
|
作者
Ji, Hong [1 ]
Gao, Zhi [1 ,2 ]
Ren, Jinchang [3 ]
Wang, Xing-ao [1 ]
Gao, Tianyi [1 ]
Sun, Wenbo [1 ]
Ma, Ping
机构
[1] Wuhan Univ, Sch Remote Sensing Informat Engn, Wuhan 430079, Peoples R China
[2] Hubei Luojia Lab, Wuhan 430079, Peoples R China
[3] Robert Gordon Univ, Natl Subsea Ctr, Aberdeen AB21 0BH, Scotland
基金
中国国家自然科学基金;
关键词
Attention mechanism; ChatGPT; CLIP; few-shot learning (FSL); multimodal knowledge;
D O I
10.1109/LGRS.2024.3414505
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Recognizing visual patterns in low-data regime necessitates deep neural networks to glean generalized representations from limited training samples. In this letter, we propose a novel few-shot classification method, namely ProDFSL, leveraging multimodal knowledge and attention mechanism. We are inspired by recent advances of large language models and the great potential they have shown across a wide range of downstream tasks and tailor it to benefit the remote sensing community. We utilize ChatGPT to produce class-specific textual inputs for enabling CLIP with rich semantic information. To promote the adaptation of CLIP in remote sensing domain, we introduce a cross-modal knowledge generation module, which dynamically generates a group of soft prompts conditioned on the few-shot visual samples and further uses a shallow Transformer to model the dependencies between language sequences. Fusing the semantic information with few-shot visual samples, we build representative class prototypes, which are conducive to both inductive and transductive inference. In extensive experiments on standard benchmarks, our ProDFSL consistently outperforms the state of the art in few-shot learning (FSL).
引用
收藏
页数:5
相关论文
共 50 条
  • [21] Collect and Select: Semantic Alignment Metric Learning for Few-Shot Learning
    Hao, Fusheng
    He, Fengxiang
    Cheng, Jun
    Wang, Lei
    Cao, Jianzhong
    Tao, Dacheng
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 8459 - 8468
  • [22] Symmetric Hallucination With Knowledge Transfer for Few-Shot Learning
    Wang, Shuo
    Zhang, Xinyu
    Wang, Meng
    He, Xiangnan
    IEEE TRANSACTIONS ON MULTIMEDIA, 2025, 27 : 1797 - 1807
  • [23] Template-Free Prompting for Few-Shot Named Entity Recognition via Semantic-Enhanced Contrastive Learning
    He, Kai
    Mao, Rui
    Huang, Yucheng
    Gong, Tieliang
    Li, Chen
    Cambria, Erik
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (12) : 18357 - 18369
  • [24] Imposing Semantic Consistency of Local Descriptors for Few-Shot Learning
    Cheng, Jun
    Hao, Fusheng
    Liu, Liu
    Tao, Dacheng
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 1587 - 1600
  • [25] Learning Orthogonal Prototypes for Generalized Few-shot Semantic Segmentation
    Liu, Sun-Ao
    Zhang, Yiheng
    Qiu, Zhaofan
    Xie, Hongtao
    Zhang, Yongdong
    Yao, Ting
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 11319 - 11328
  • [26] SpatialFormer: Semantic and Target Aware Attentions for Few-Shot Learning
    Lai, Jinxiang
    Yang, Siqian
    Wu, Wenlong
    Wu, Tao
    Jiang, Guannan
    Wang, Xi
    Liu, Jun
    Gao, Bin-Bin
    Zhang, Wei
    Xie, Yuan
    Wang, Chengjie
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 7, 2023, : 8430 - 8437
  • [27] Harnessing Multi-Semantic Hypergraph for Few-Shot Learning
    Chen, Hao
    Li, Linyan
    Xia, Zhenping
    Lyu, Fan
    Zhao, Liuqing
    Huang, Kaizhu
    Feng, Wei
    Hu, Fuyuan
    PATTERN RECOGNITION AND COMPUTER VISION, PT I, PRCV 2022, 2022, 13534 : 232 - 244
  • [28] Semantic Guided Latent Parts Embedding for Few-Shot Learning
    Yang, Fengyuan
    Wang, Ruiping
    Chen, Xilin
    2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 5436 - 5446
  • [29] Mixer-Based Semantic Spread for Few-Shot Learning
    Cheng, Jun
    Hao, Fusheng
    He, Fengxiang
    Liu, Liu
    Zhang, Qieshi
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 191 - 202
  • [30] Learning Foreground Information Bottleneck for few-shot semantic segmentation
    Hu, Yutao
    Huang, Xin
    Luo, Xiaoyan
    Han, Jungong
    Cao, Xianbin
    Zhang, Jun
    PATTERN RECOGNITION, 2024, 146