Prompting-to-Distill Semantic Knowledge for Few-Shot Learning

被引:0
|
作者
Ji, Hong [1 ]
Gao, Zhi [1 ,2 ]
Ren, Jinchang [3 ]
Wang, Xing-ao [1 ]
Gao, Tianyi [1 ]
Sun, Wenbo [1 ]
Ma, Ping
机构
[1] Wuhan Univ, Sch Remote Sensing Informat Engn, Wuhan 430079, Peoples R China
[2] Hubei Luojia Lab, Wuhan 430079, Peoples R China
[3] Robert Gordon Univ, Natl Subsea Ctr, Aberdeen AB21 0BH, Scotland
基金
中国国家自然科学基金;
关键词
Attention mechanism; ChatGPT; CLIP; few-shot learning (FSL); multimodal knowledge;
D O I
10.1109/LGRS.2024.3414505
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Recognizing visual patterns in low-data regime necessitates deep neural networks to glean generalized representations from limited training samples. In this letter, we propose a novel few-shot classification method, namely ProDFSL, leveraging multimodal knowledge and attention mechanism. We are inspired by recent advances of large language models and the great potential they have shown across a wide range of downstream tasks and tailor it to benefit the remote sensing community. We utilize ChatGPT to produce class-specific textual inputs for enabling CLIP with rich semantic information. To promote the adaptation of CLIP in remote sensing domain, we introduce a cross-modal knowledge generation module, which dynamically generates a group of soft prompts conditioned on the few-shot visual samples and further uses a shallow Transformer to model the dependencies between language sequences. Fusing the semantic information with few-shot visual samples, we build representative class prototypes, which are conducive to both inductive and transductive inference. In extensive experiments on standard benchmarks, our ProDFSL consistently outperforms the state of the art in few-shot learning (FSL).
引用
收藏
页数:5
相关论文
共 50 条
  • [31] Semantic-Based Few-Shot Classification by Psychometric Learning
    Yin, Lu
    Menkovski, Vlado
    Pei, Yulong
    Pechenizkiy, Mykola
    ADVANCES IN INTELLIGENT DATA ANALYSIS XX, IDA 2022, 2022, 13205 : 392 - 403
  • [32] A Dual Attention Network with Semantic Embedding for Few-Shot Learning
    Yan, Shipeng
    Zhang, Songyang
    He, Xuming
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 9079 - 9086
  • [33] HiPrompt: Few-Shot Biomedical Knowledge Fusion via Hierarchy-Oriented Prompting
    Lu, Jiaying
    Shen, Jiaming
    Xiong, Bo
    Ma, Wenjing
    Staab, Steffen
    Yang, Carl
    PROCEEDINGS OF THE 46TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2023, 2023, : 2052 - 2056
  • [34] Semantic Mask Reconstruction and Category Semantic Learning for few-shot image generation
    Xiao, Ting
    Cai, Yunjie
    Guan, Jiaoyan
    Wang, Zhe
    NEURAL NETWORKS, 2025, 183
  • [35] Generalized Few-shot Semantic Segmentation
    Tian, Zhuotao
    Lai, Xin
    Jiang, Li
    Liu, Shu
    Shu, Michelle
    Zhao, Hengshuang
    Jia, Jiaya
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 11553 - 11562
  • [36] Few-shot object detection with semantic enhancement and semantic prototype contrastive learning
    Huang, Lian
    Dai, Shaosheng
    He, Ziqiang
    KNOWLEDGE-BASED SYSTEMS, 2022, 252
  • [37] Knowledge Distillation Meets Few-Shot Learning: An Approach for Few-Shot Intent Classification Within and Across Domains
    Sauer, Anna
    Asaadi, Shima
    Kuech, Fabian
    PROCEEDINGS OF THE 4TH WORKSHOP ON NLP FOR CONVERSATIONAL AI, 2022, : 108 - 119
  • [38] Semantic Interaction Matching Network for Few-Shot Knowledge Graph Completion
    Luo, Pengfei
    Zhu, Xi
    Xu, Tong
    Zheng, Yi
    Chen, Enhong
    ACM TRANSACTIONS ON THE WEB, 2024, 18 (02)
  • [39] Embedding Generalized Semantic Knowledge Into Few-Shot Remote Sensing Segmentation
    Wang, Qi
    Jia, Yuyu
    Huang, Wei
    Gao, Junyu
    Li, Qiang
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2025, 63
  • [40] HybridPrompt: Domain-Aware Prompting for Cross-Domain Few-Shot Learning
    Wu, Jiamin
    Zhang, Tianzhu
    Zhang, Yongdong
    International Journal of Computer Vision, 132 (12): : 5681 - 5697