Less confidence, less forgetting: Learning with a humbler teacher in exemplar-free Class-Incremental learning

被引：1

作者：

Gao, Zijian ^{[1
,2
]}

Xu, Kele ^{[1
,2
]}

Zhuang, Huiping ^{[3
]}

Liu, Li ^{[1
,2
,4
]}

Mao, Xinjun ^{[1
,2
]}

Ding, Bo ^{[1
,2
]}

Feng, Dawei ^{[1
,2
]}

Wang, Huaimin ^{[1
,2
]}

机构：

[1] Natl Univ Def Technol, Changsha 410000, Peoples R China

[2] State Key Lab Complex & Crit Software Environm, Changsha 410000, Peoples R China

[3] South China Univ Technol, Guangzhou 510000, Peoples R China

[4] Univ Oulu, Oulu, Finland

来源：

NEURAL NETWORKS | 2024年 / 179卷

基金：

中国国家自然科学基金;

关键词：

Exemplar-free Class-Incremental learning; Catastrophic forgetting; Knowledge distillation; Checkpoint model;

D O I：

10.1016/j.neunet.2024.106513

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Class-Incremental learning (CIL) is challenging due to catastrophic forgetting (CF), which escalates in exemplarfree scenarios. To mitigate CF, Knowledge Distillation (KD), which leverages old models as teacher models, has been widely employed in CIL. However, based on a case study, our investigation reveals that the teacher model exhibits over-confidence in unseen new samples. In this article, we conduct empirical experiments and provide theoretical analysis to investigate the over-confident phenomenon and the impact of KD in exemplar-free CIL, where access to old samples is unavailable. Building on our analysis, we propose a novel approach, Learning with Humbler Teacher, by systematically selecting an appropriate checkpoint model as a humbler teacher to mitigate CF. Furthermore, we explore utilizing the nuclear norm to obtain an appropriate temporal ensemble to enhance model stability. Notably, LwHT outperforms the state-of-the-art approach by a significant margin of 10.41%, 6.56%, and 4.31% in various settings while demonstrating superior model plasticity.

引用

页数：14

共 50 条

[31] Class-Incremental Learning Network for Small Objects Enhancing of Semantic Segmentation in Aerial Imagery
Li, Junxi
Sun, Xian
Diao, Wenhui
Wang, Peijin
Feng, Yingchao
Lu, Xiaonan
Xu, Guangluan
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[32] Continual prune-and-select: class-incremental learning with specialized subnetworks
Dekhovich, Aleksandr
Tax, David M. J.
Sluiter, Marcel H. F.
Bessa, Miguel A.
APPLIED INTELLIGENCE, 2023, 53 (14) : 17849 - 17864
[33] CLASS-INCREMENTAL LEARNING FOR REMOTE SENSING IMAGES BASED ON KNOWLEDGE DISTILLATION
Song, Jingduo
Jia, Hecheng
Xu, Feng
IGARSS 2023 - 2023 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2023, : 5026 - 5028
[34] Squeezing More Past Knowledge for Online Class-Incremental Continual Learning
Da Yu
Mingyi Zhang
Mantian Li
Fusheng Zha
Junge Zhang
Lining Sun
Kaiqi Huang
IEEE/CAAJournalofAutomaticaSinica, 2023, 10 (03) : 722 - 736
[35] Class-Incremental Learning with Cross-Space Clustering and Controlled Transfer
Ashok, Arjun
Joseph, K. J.
Balasubramanian, Vineeth N.
COMPUTER VISION - ECCV 2022, PT XXVII, 2022, 13687 : 105 - 122
[36] Squeezing More Past Knowledge for Online Class-Incremental Continual Learning
Yu, Da
Zhang, Mingyi
Li, Mantian
Zha, Fusheng
Zhang, Junge
Sun, Lining
Huang, Kaiqi
IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2023, 10 (03) : 722 - 736
[37] Class-Incremental Learning Method With Fast Update and High Retainability Based on Broad Learning System
Du, Jie
Liu, Peng
Vong, Chi-Man
Chen, Chuangquan
Wang, Tianfu
Chen, C. L. Philip
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (08) : 11332 - 11345
[38] Few-Shot Class-Incremental Learning for Classification and Object Detection: A Survey
Zhang, Jinghua
Liu, Li
Silven, Olli
Pietikainen, Matti
Hu, Dewen
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2025, 47 (04) : 2924 - 2945
[39] CBCL-PR: A Cognitively Inspired Model for Class-Incremental Learning in Robotics
Ayub, Ali
Wagner, Alan R.
IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2023, 15 (04) : 2004 - 2013
[40] BEFM: A balanced and efficient fine-tuning model in class-incremental learning
Liu, Lize
Ji, Jian
Zhao, Lei
KNOWLEDGE-BASED SYSTEMS, 2025, 315

← 1 2 3 4 5 →