PKI: Prior knowledge-infused neural network for few-shot class-incremental learning

被引:0
作者
Bao, Kexin [1 ,2 ]
Lin, Fanzhao [3 ]
Wang, Zichen [4 ]
Li, Yong [1 ]
Zeng, Dan [5 ]
Ge, Shiming [1 ]
机构
[1] Chinese Acad Sci, Inst Informat Engn, Beijing 100092, Peoples R China
[2] Univ Chinese Acad Sci, Sch Cyber Secur, Beijing 100049, Peoples R China
[3] NARI Technol Co Ltd, Nanjing 210000, Jiangsu, Peoples R China
[4] George Washington Univ, Sch Engn & Appl Sci, Washington, DC 20052 USA
[5] Shanghai Univ, Dept Commun Engn, Shanghai 200040, Peoples R China
关键词
Few-shot learning; Class-incremental learning; Catastrophic forgetting;
D O I
10.1016/j.neunet.2025.107724
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Few-shot class-incremental learning (FSCIL) aims to continually adapt a model on a limited number of new-class examples, facing two well-known challenges: catastrophic forgetting and overfitting to new classes. Existing methods tend to freeze more parts of network components and finetune others with an extra memory during incremental sessions. These methods emphasize preserving prior knowledge to ensure proficiency in recognizing old classes, thereby mitigating catastrophic forgetting. Meanwhile, constraining fewer parameters can help in overcoming overfitting with the assistance of prior knowledge. Following previous methods, we retain more prior knowledge and propose a prior knowledge-infused neural network (PKI) to facilitate FSCIL. PKI consists of a backbone, an ensemble of projectors, a classifier, and an extra memory. In each incremental session, we build a new projector and add it to the ensemble. Subsequently, we finetune the new projector and the classifier jointly with other frozen network components, ensuring the rich prior knowledge is utilized effectively. By cascading projectors, PKI integrates prior knowledge accumulated from previous sessions and learns new knowledge flexibly, which helps to recognize old classes and efficiently learn new classes. Further, to reduce the resource consumption associated with keeping many projectors, we design two variants of the prior knowledge-infused neural network (PKIV-1 and PKIV-2) to trade off a balance between resource consumption and performance by reducing the number of projectors. Extensive experiments on three popular benchmarks demonstrate that our approach outperforms state-of-the-art methods.
引用
收藏
页数:10
相关论文
共 71 条
[1]  
Achituve I., 2021, INT C MACH LEARN, P54
[2]   Semantics-Driven Generative Replay for Few-Shot Class Incremental Learning [J].
Agarwal, Aishwarya ;
Banerjee, Biplab ;
Cuzzolin, Fabio ;
Chaudhuri, Subhasis .
PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, :5246-5254
[3]   Few-Shot Class Incremental Learning Leveraging Self-Supervised Features [J].
Ahmad, Touqeer ;
Dhamija, Akshay Raj ;
Cruz, Steve ;
Rabinowitz, Ryan ;
Li, Chunchun ;
Jafarzadeh, Mohsen ;
Boult, Terrance E. .
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2022, 2022, :3899-3909
[4]   OrCo: Towards Better Generalization via Orthogonality and Contrast for Few-Shot Class-Incremental Learning [J].
Ahmed, Noor ;
Kukleva, Anna ;
Schiele, Bernt .
2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, :28762-28771
[5]  
Akyurek A., 2022, INT C LEARN REPR
[6]   Memory Aware Synapses: Learning What (not) to Forget [J].
Aljundi, Rahaf ;
Babiloni, Francesca ;
Elhoseiny, Mohamed ;
Rohrbach, Marcus ;
Tuytelaars, Tinne .
COMPUTER VISION - ECCV 2018, PT III, 2018, 11207 :144-161
[7]   A class-incremental learning approach for learning feature-compatible embeddings [J].
An, Hongchao ;
Yang, Jing ;
Zhang, Xiuhua ;
Ruan, Xiaoli ;
Wu, Yuankai ;
Li, Shaobo ;
Hu, Jianjun .
NEURAL NETWORKS, 2024, 180
[8]   IL2M: Class Incremental Learning With Dual Memory [J].
Belouadah, Eden ;
Popescu, Adrian .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :583-592
[9]   Riemannian Walk for Incremental Learning: Understanding Forgetting and Intransigence [J].
Chaudhry, Arslan ;
Dokania, Puneet K. ;
Ajanthan, Thalaiyasingam ;
Torr, Philip H. S. .
COMPUTER VISION - ECCV 2018, PT XI, 2018, 11215 :556-572
[10]  
Chen K., 2021, INT C LEARN REPR