Prompt-based learning for few-shot class-incremental learning

被引:1
作者
Yuan, Jicheng [1 ,2 ]
Chen, Hang [3 ]
Tian, Songsong [2 ]
Li, Wenfa [1 ]
Li, Lusi [4 ]
Ning, Enhao [1 ]
Zhang, Yugui [2 ]
机构
[1] Univ Sci & Technol Beijing, Sch Intelligent Sci & Technol, Beijing 100083, Peoples R China
[2] Chinese Acad Sci, Inst Semicond, Beijing 100083, Peoples R China
[3] Zhejiang Chuchi Technol Co Ltd, Hangzhou 310000, Peoples R China
[4] Old Dominion Univ, Dept Comp Sci, Norfolk, VA 23529 USA
关键词
few-shot class-incremental learning; Prompt learning; Prototype classifier; Catastrophic forgetting;
D O I
10.1016/j.aej.2025.02.008
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Few-Shot Class-Incremental Learning (FSCIL) aims to enable deep neural networks to incrementally learn new tasks from a limited number of labeled samples, while retaining knowledge of previously learned tasks, mimicking the way humans learn. In this paper, we introduce a novel approach called Prompt Learning for FSCIL (PL-FSCIL), which leverages the power of prompts alongside a pre-trained Vision Transformer (ViT) model to effectively tackle the challenges of FSCIL. Our approach explores the feasibility of directly applying visual prompts in FSCIL, using a simplified model architecture. PL-FSCIL integrates two key prompts: the Domain Prompt and the FSCIL Prompt. Both are tensors incorporated into the attention layer of the ViT network to enhance its capabilities. The Domain Prompt helps the model adapt to new data domains, while the FSCIL Prompt, in combination with a prototype classifier, boosts the model's ability to handle incremental tasks. We evaluate the performance of PL-FSCIL on well-established benchmark datasets, including CIFAR-100 and CUB-200. The results demonstrate competitive performance, highlighting the method's promising potential for real-world applications, particularly in scenarios where high-quality labeled data is scarce. The source code is at: https://github.com/JichengYuan81/PL-FSCIL.
引用
收藏
页码:287 / 295
页数:9
相关论文
共 56 条
[11]   Visual Prompt Tuning [J].
Jia, Menglin ;
Tang, Luming ;
Chen, Bor-Chun ;
Cardie, Claire ;
Belongie, Serge ;
Hariharan, Bharath ;
Lim, Ser-Nam .
COMPUTER VISION - ECCV 2022, PT XXXIII, 2022, 13693 :709-727
[12]   S3C: Self-Supervised Stochastic Classifiers for Few-Shot Class-Incremental Learning [J].
Kalla, Jayateja ;
Biswas, Soma .
COMPUTER VISION, ECCV 2022, PT XXV, 2022, 13685 :432-448
[13]  
Kim D., 2023, 11 INT C LEARN REPR
[14]  
Krizhevsky A., 2009, LEARNING MULTIPLE LA
[15]   Deep learning [J].
LeCun, Yann ;
Bengio, Yoshua ;
Hinton, Geoffrey .
NATURE, 2015, 521 (7553) :436-444
[16]  
Lester B, 2021, Arxiv, DOI arXiv:2104.08691
[17]  
Li JZ, 2022, Arxiv, DOI arXiv:2207.01036
[18]  
Li XLS, 2021, 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (ACL-IJCNLP 2021), VOL 1, P4582
[19]  
Lin JH, 2024, AAAI CONF ARTIF INTE, P3422
[20]  
Lin Y, 2021, ADV NEUR IN, V34