Class similarity weighted knowledge distillation for few shot incremental learning

被引：5

作者：

Akmel, Feidu ^{[1
]}

Meng, Fanman ^{[1
]}

Wu, Qingbo ^{[1
]}

Chen, Shuai ^{[1
]}

Zhang, Runtong ^{[1
]}

Assefa, Maregu ^{[2
]}

机构：

[1] Univ Elect Sci & Technol China, Sch Informat & Commun Engn, Chengdu, Peoples R China

[2] Univ Elect Sci & Technol China, Sch Informat & Software Engn, Chengdu, Peoples R China

来源：

NEUROCOMPUTING | 2024年 / 584卷

关键词：

Knowledge distillation; Semantic information; Few shot; Incremental learning;

D O I：

10.1016/j.neucom.2024.127587

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Few -shot class incremental learning illustrates the challenges of learning new concepts, where the learner can access only a small sample per concept. The standard incremental learning techniques cannot be applied directly because of the small number of samples for training. Moreover, catastrophic forgetting is the propensity of an Artificial Neural Network to fully and abruptly forget previously learned knowledge upon learning new knowledge. This problem happens due to a lack of supervision in older classes or an imbalance between the old and new classes. In this work, we propose a new distillation structure to tackle the forgetting and overfitting issues. Particularly, we suggest a dual distillation module that adaptably draws knowledge from two different but complementary teachers. The first teacher is the base model, which has been trained on large class data, and the second teacher is the updated model from the previous K-1 session, which contains the modified knowledge of previously observed new classes. Thus, the first teacher can reduce overfitting issues by transferring the knowledge obtained from the base classes to the new classes. While the second teacher can reduce knowledge forgetting by distilling knowledge from the previous model. Additionally, we use semantic information as word embedding to facilitate the distillation process. To align visual and semantic vectors, we used the attention mechanism of the embedding of visual data. With extensive experiments on different data sets such as Mini-ImageNet, CIFAR100, and CUB200, our model shows state-of-the-art performance compared to the existing few shot incremental learning methods.

引用

页数：11

共 50 条

[41] Semantic-visual Guided Transformer for Few-shot Class-incremental Learning [J].

Qiu, Wenhao ;

Fu, Sichao ;

Zhang, Jingyi ;

Lei, Chengxiang ;

Peng, Qinmu .

2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, :2885-2890

[42] Aggregatedf-average neural network applied to few-shot class incremental learning [J].

Vu, Mathieu ;

Chouzenoux, Emilie ;

Ben Ayed, Ismail ;

Pesquet, Jean-Christophe .

SIGNAL PROCESSING, 2025, 237

[43] Graph knowledge distillation-guided few-shot learning for hyperspectral image classification [J].

Li, Xiaolong ;

Ma, Huifang ;

Guo, Shuheng ;

Zhang, Di ;

Li, Zhixin .

JOURNAL OF SUPERCOMPUTING, 2025, 81 (11)

[44] Contrastive knowledge-augmented self-distillation approach for few-shot learning [J].

Zhang, Lixu ;

Shao, Mingwen ;

Chen, Sijie ;

Liu, Fukang .

JOURNAL OF ELECTRONIC IMAGING, 2023, 32 (05)

[45] A pseudo-labeling approach based on knowledge distillation for graph few-shot learning [J].

Wu, Zongqian ;

Zhou, Peng ;

Wen, Guoqiu ;

Zhu, Xiaofeng .

INFORMATION PROCESSING & MANAGEMENT, 2025, 62 (06)

[46] Layer-Specific Knowledge Distillation for Class Incremental Semantic Segmentation [J].

Wang, Qilong ;

Wu, Yiwen ;

Yang, Liu ;

Zuo, Wangmeng ;

Hu, Qinghua .

IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 :1977-1989

[47] Incremental attribute learning by knowledge distillation method [J].

Kuang, Zhejun ;

Wang, Jingrui ;

Sun, Dawen ;

Zhao, Jian ;

Shi, Lijuan ;

Xiong, Xingbo .

JOURNAL OF COMPUTATIONAL DESIGN AND ENGINEERING, 2024, 11 (05) :259-283

[48] Incremental few-shot learning via implanting and consolidating [J].

Li, Yiting ;

Zhu, Haiyue ;

Ma, Jun ;

Xiang, Cheng ;

Vadakkepat, Prahlad .

NEUROCOMPUTING, 2023, 559

[49] BFCP: Pursue Better Forward Compatibility Pretraining for Few-Shot Class-Incremental Learning [J].

Fu, Zhiling ;

Wang, Zhe ;

Xu, Xinlei ;

Guo, Wei ;

Chi, Ziqiu ;

Yang, Hai ;

Du, Wenli .

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2025,

[50] Mitigate forgetting in few-shot class-incremental learning using different image views [J].

Mazumder, Pratik ;

Singh, Pravendra .

NEURAL NETWORKS, 2023, 165 :999-1009

← 1 2 3 4 5 →