Prompting-to-Distill Semantic Knowledge for Few-Shot Learning

被引:0
|
作者
Ji, Hong [1 ]
Gao, Zhi [1 ,2 ]
Ren, Jinchang [3 ]
Wang, Xing-ao [1 ]
Gao, Tianyi [1 ]
Sun, Wenbo [1 ]
Ma, Ping
机构
[1] Wuhan Univ, Sch Remote Sensing Informat Engn, Wuhan 430079, Peoples R China
[2] Hubei Luojia Lab, Wuhan 430079, Peoples R China
[3] Robert Gordon Univ, Natl Subsea Ctr, Aberdeen AB21 0BH, Scotland
基金
中国国家自然科学基金;
关键词
Attention mechanism; ChatGPT; CLIP; few-shot learning (FSL); multimodal knowledge;
D O I
10.1109/LGRS.2024.3414505
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Recognizing visual patterns in low-data regime necessitates deep neural networks to glean generalized representations from limited training samples. In this letter, we propose a novel few-shot classification method, namely ProDFSL, leveraging multimodal knowledge and attention mechanism. We are inspired by recent advances of large language models and the great potential they have shown across a wide range of downstream tasks and tailor it to benefit the remote sensing community. We utilize ChatGPT to produce class-specific textual inputs for enabling CLIP with rich semantic information. To promote the adaptation of CLIP in remote sensing domain, we introduce a cross-modal knowledge generation module, which dynamically generates a group of soft prompts conditioned on the few-shot visual samples and further uses a shallow Transformer to model the dependencies between language sequences. Fusing the semantic information with few-shot visual samples, we build representative class prototypes, which are conducive to both inductive and transductive inference. In extensive experiments on standard benchmarks, our ProDFSL consistently outperforms the state of the art in few-shot learning (FSL).
引用
收藏
页数:5
相关论文
共 50 条
  • [1] Few-shot Learning with Prompting Methods
    2023 6TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION AND IMAGE ANALYSIS, IPRIA, 2023,
  • [2] MEAL: Stable and Active Learning for Few-Shot Prompting
    Koeksal, Abdullatif
    Schick, Timo
    Schuetze, Hinrich
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS - EMNLP 2023, 2023, : 506 - 517
  • [3] Commonsense Knowledge Prompting for Few-Shot Action Recognition in Videos
    Shi, Yuheng
    Wu, Xinxiao
    Lin, Hanxi
    Luo, Jiebo
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 8395 - 8405
  • [4] Learning Non-target Knowledge for Few-shot Semantic Segmentation
    Liu, Yuanwei
    Liu, Nian
    Cao, Qinglong
    Yao, Xiwen
    Han, Junwei
    Shao, Ling
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 11563 - 11572
  • [5] LEARNING WITH MEMORY FOR FEW-SHOT SEMANTIC SEGMENTATION
    Lu, Hongchao
    Wei, Chao
    Deng, Zhidong
    2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 629 - 633
  • [6] Adversarial Knowledge Stimulated Contrastive Prompting for Few-shot Language Learners
    Zheng, Kai
    Sun, Qingfeng
    Yang, Yaming
    Lv, Tengchao
    Pi, Yeyong
    Zhao, Changlin
    Xu, Fei
    Zhang, Qi
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023), 2023, : 13495 - 13507
  • [7] Dynamic Knowledge Path Learning for Few-Shot Learning
    Li, Jingzhu
    Yin, Zhe
    Yang, Xu
    Jiao, Jianbin
    Ding, Ye
    BIG DATA MINING AND ANALYTICS, 2025, 8 (02): : 479 - 495
  • [8] Adaptive Learning Knowledge Networks for Few-Shot Learning
    Yan, Minghao
    IEEE ACCESS, 2019, 7 : 119041 - 119051
  • [9] Learning to Compare Relation: Semantic Alignment for Few-Shot Learning
    Cao, Congqi
    Zhang, Yanning
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 1462 - 1474
  • [10] Few-Shot Novel Concept Learning for Semantic Parsing
    Dan, Soham
    Bastani, Osbert
    Roth, Dan
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2021, 2021, : 2064 - 2075