Prompt-based learning for few-shot class-incremental learning

被引:1
作者
Yuan, Jicheng [1 ,2 ]
Chen, Hang [3 ]
Tian, Songsong [2 ]
Li, Wenfa [1 ]
Li, Lusi [4 ]
Ning, Enhao [1 ]
Zhang, Yugui [2 ]
机构
[1] Univ Sci & Technol Beijing, Sch Intelligent Sci & Technol, Beijing 100083, Peoples R China
[2] Chinese Acad Sci, Inst Semicond, Beijing 100083, Peoples R China
[3] Zhejiang Chuchi Technol Co Ltd, Hangzhou 310000, Peoples R China
[4] Old Dominion Univ, Dept Comp Sci, Norfolk, VA 23529 USA
关键词
few-shot class-incremental learning; Prompt learning; Prototype classifier; Catastrophic forgetting;
D O I
10.1016/j.aej.2025.02.008
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Few-Shot Class-Incremental Learning (FSCIL) aims to enable deep neural networks to incrementally learn new tasks from a limited number of labeled samples, while retaining knowledge of previously learned tasks, mimicking the way humans learn. In this paper, we introduce a novel approach called Prompt Learning for FSCIL (PL-FSCIL), which leverages the power of prompts alongside a pre-trained Vision Transformer (ViT) model to effectively tackle the challenges of FSCIL. Our approach explores the feasibility of directly applying visual prompts in FSCIL, using a simplified model architecture. PL-FSCIL integrates two key prompts: the Domain Prompt and the FSCIL Prompt. Both are tensors incorporated into the attention layer of the ViT network to enhance its capabilities. The Domain Prompt helps the model adapt to new data domains, while the FSCIL Prompt, in combination with a prototype classifier, boosts the model's ability to handle incremental tasks. We evaluate the performance of PL-FSCIL on well-established benchmark datasets, including CIFAR-100 and CUB-200. The results demonstrate competitive performance, highlighting the method's promising potential for real-world applications, particularly in scenarios where high-quality labeled data is scarce. The source code is at: https://github.com/JichengYuan81/PL-FSCIL.
引用
收藏
页码:287 / 295
页数:9
相关论文
共 56 条
[1]  
Brown TB, 2020, ADV NEUR IN, V33
[2]  
Buck Christian, 2018, INT C LEARN REPR
[3]   MetaFSCEL A Meta-Learning Approach for Few-Shot Class Incremental Learning [J].
Chi, Zhixiang ;
Gu, Li ;
Liu, Huan ;
Wang, Yang ;
Yu, Yuanhao ;
Tang, Jin .
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, :14146-14155
[4]  
Coates A., 2011, International Conference on Artificial Intelligence and Statistics, P215
[5]  
Dodge J, 2020, Arxiv, DOI [arXiv:2002.06305, DOI 10.48550/ARXIV.2002.06305]
[6]  
Dosovitskiy Alexey., 2021, PROC INT C LEARN REP, P2021, DOI DOI 10.48550/ARXIV.2010.11929
[7]  
Gao Tianyu., 2020, arXiv
[8]  
Griffin G., 2007, Caltech-256 object category dataset. Technical Report 7694
[9]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778
[10]   Constrained Few-shot Class-incremental Learning [J].
Hersche, Michael ;
Karunaratne, Geethan ;
Cherubini, Giovanni ;
Benini, Luca ;
Sebastian, Abu ;
Rahimi, Abbas .
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, :9047-9057