Prompting-to-Distill Semantic Knowledge for Few-Shot Learning

被引：0

作者：

Ji, Hong ^{[1
]}

Gao, Zhi ^{[1
,2
]}

Ren, Jinchang ^{[3
]}

Wang, Xing-ao ^{[1
]}

Gao, Tianyi ^{[1
]}

Sun, Wenbo ^{[1
]}

Ma, Ping

机构：

[1] Wuhan Univ, Sch Remote Sensing Informat Engn, Wuhan 430079, Peoples R China

[2] Hubei Luojia Lab, Wuhan 430079, Peoples R China

[3] Robert Gordon Univ, Natl Subsea Ctr, Aberdeen AB21 0BH, Scotland

来源：

IEEE GEOSCIENCE AND REMOTE SENSING LETTERS | 2024年 / 21卷

基金：

中国国家自然科学基金;

关键词：

Attention mechanism; ChatGPT; CLIP; few-shot learning (FSL); multimodal knowledge;

D O I：

10.1109/LGRS.2024.3414505

中图分类号：

P3 [地球物理学]; P59 [地球化学];

学科分类号：

0708 ; 070902 ;

摘要：

Recognizing visual patterns in low-data regime necessitates deep neural networks to glean generalized representations from limited training samples. In this letter, we propose a novel few-shot classification method, namely ProDFSL, leveraging multimodal knowledge and attention mechanism. We are inspired by recent advances of large language models and the great potential they have shown across a wide range of downstream tasks and tailor it to benefit the remote sensing community. We utilize ChatGPT to produce class-specific textual inputs for enabling CLIP with rich semantic information. To promote the adaptation of CLIP in remote sensing domain, we introduce a cross-modal knowledge generation module, which dynamically generates a group of soft prompts conditioned on the few-shot visual samples and further uses a shallow Transformer to model the dependencies between language sequences. Fusing the semantic information with few-shot visual samples, we build representative class prototypes, which are conducive to both inductive and transductive inference. In extensive experiments on standard benchmarks, our ProDFSL consistently outperforms the state of the art in few-shot learning (FSL).

引用

页数：5

共 50 条

[31] Semantic-Based Few-Shot Classification by Psychometric Learning
Yin, Lu
Menkovski, Vlado
Pei, Yulong
Pechenizkiy, Mykola
ADVANCES IN INTELLIGENT DATA ANALYSIS XX, IDA 2022, 2022, 13205 : 392 - 403
[32] A Dual Attention Network with Semantic Embedding for Few-Shot Learning
Yan, Shipeng
Zhang, Songyang
He, Xuming
THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 9079 - 9086
[33] HiPrompt: Few-Shot Biomedical Knowledge Fusion via Hierarchy-Oriented Prompting
Lu, Jiaying
Shen, Jiaming
Xiong, Bo
Ma, Wenjing
Staab, Steffen
Yang, Carl
PROCEEDINGS OF THE 46TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2023, 2023, : 2052 - 2056
[34] Semantic Mask Reconstruction and Category Semantic Learning for few-shot image generation
Xiao, Ting
Cai, Yunjie
Guan, Jiaoyan
Wang, Zhe
NEURAL NETWORKS, 2025, 183
[35] Generalized Few-shot Semantic Segmentation
Tian, Zhuotao
Lai, Xin
Jiang, Li
Liu, Shu
Shu, Michelle
Zhao, Hengshuang
Jia, Jiaya
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 11553 - 11562
[36] Few-shot object detection with semantic enhancement and semantic prototype contrastive learning
Huang, Lian
Dai, Shaosheng
He, Ziqiang
KNOWLEDGE-BASED SYSTEMS, 2022, 252
[37] Knowledge Distillation Meets Few-Shot Learning: An Approach for Few-Shot Intent Classification Within and Across Domains
Sauer, Anna
Asaadi, Shima
Kuech, Fabian
PROCEEDINGS OF THE 4TH WORKSHOP ON NLP FOR CONVERSATIONAL AI, 2022, : 108 - 119
[38] Semantic Interaction Matching Network for Few-Shot Knowledge Graph Completion
Luo, Pengfei
Zhu, Xi
Xu, Tong
Zheng, Yi
Chen, Enhong
ACM TRANSACTIONS ON THE WEB, 2024, 18 (02)
[39] Embedding Generalized Semantic Knowledge Into Few-Shot Remote Sensing Segmentation
Wang, Qi
Jia, Yuyu
Huang, Wei
Gao, Junyu
Li, Qiang
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2025, 63
[40] HybridPrompt: Domain-Aware Prompting for Cross-Domain Few-Shot Learning
Wu, Jiamin
Zhang, Tianzhu
Zhang, Yongdong
International Journal of Computer Vision, 132 (12): : 5681 - 5697

← 1 2 3 4 5 →