Preserving Locality in Vision Transformers for Class Incremental Learning

被引：3

作者：

Zheng, Bowen ^{[1
]}

Zhou, Wei ^{[1
]}

Ye, Han-Jia ^{[1
]}

Zhan, De-Chuan ^{[1
]}

机构：

[1] Nanjing Univ, Natl Key Lab Novel Software Technol, Nanjing, Peoples R China

来源：

2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME | 2023年

基金：

国家重点研发计划;

关键词：

Class Incremental Learning; Vision Transformer;

D O I：

10.1109/ICME55011.2023.00202

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Learning new classes without forgetting is crucial for real-world applications for a classification model. Vision Transformers (ViT) recently achieve remarkable performance in Class Incremental Learning (CIL). Previous works mainly focus on block design and model expansion for ViTs. However, in this paper, we find that when the ViT is incrementally trained, the attention layers gradually lose concentration on local features. We call this interesting phenomenon as Locality Degradation in ViTs for CIL. Since the low-level local information is crucial to the transferability of the representation, it is beneficial to preserve the locality in attention layers. In this paper, we encourage the model to preserve more local information as the training procedure goes on and devise a Locality-Preserved Attention (LPA) layer to emphasize the importance of local features. Specifically, we incorporate the local information directly into the vanilla attention and control the initial gradients of the vanilla attention by weighting it with a small initial value. Extensive experiments show that the representations facilitated by LPA capture more low-level general information which is easier to transfer to follow-up tasks. The improved model gets consistently better performance on CIFAR100 and ImageNet100. The source code is available at https://github.com/bwnzheng/LPA_ICME2023.

引用

页码：1157 / 1162

页数：6

共 50 条

[21] IMPROVING FEATURE GENERALIZABILITY WITH MULTITASK LEARNING IN CLASS INCREMENTAL LEARNING [J].

Ma, Dong ;

Tang, Chi Ian ;

Mascolo, Cecilia .

2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, :4173-4177

[22] Analyzing Transfer Learning of Vision Transformers for Interpreting Chest Radiography [J].

Mohammad Usman ;

Tehseen Zia ;

Ali Tariq .

Journal of Digital Imaging, 2022, 35 (6) :1445-1462

[23] Analyzing Transfer Learning of Vision Transformers for Interpreting Chest Radiography [J].

Usman, Mohammad ;

Zia, Tehseen ;

Tariq, Ali .

CANCER MANAGEMENT AND RESEARCH, 2022, 14 :1445-1462

[24] Analyzing Transfer Learning of Vision Transformers for Interpreting Chest Radiography [J].

Usman, Mohammad ;

Zia, Tehseen ;

Tariq, Ali .

JOURNAL OF DIGITAL IMAGING, 2022, 35 (06) :1445-1462

[25] Enhancing Skin Cancer Detection with Transfer Learning and Vision Transformers [J].

Ahmad, Istiak ;

Alsulami, Bassma Saleh ;

Alqurashi, Fahad .

INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2024, 15 (10) :1027-1034

[26] iNeMo: Incremental Neural Mesh Models for Robust Class-Incremental Learning [J].

Fischer, Tom ;

Liu, Yaoyao ;

Jesslen, Artur ;

Ahmed, Noor ;

Kaushik, Prakhar ;

Wang, Angtian ;

Yuille, Alan L. ;

Kortylewski, Adam ;

Ilg, Eddy .

COMPUTER VISION - ECCV 2024, PT LXXVII, 2024, 15135 :357-374

[27] Unobtrusive Sensing Incremental Social Contexts using Fuzzy Class Incremental Learning [J].

Chen, Zhenyu ;

Chen, Yiqiang ;

Gao, Xingyu ;

Wang, Shuangquan ;

Hu, Lisha ;

Yan, Chenggang Clarence ;

Lane, Nicholas D. ;

Miao, Chunyan .

2015 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2015, :71-80

[28] PILoRA: Prototype Guided Incremental LoRA for Federated Class-Incremental Learning [J].

Guo, Haiyang ;

Zhu, Fei ;

Liu, Wenzhuo ;

Zhang, Xu-Yao ;

Liu, Cheng-Lin .

COMPUTER VISION - ECCV 2024, PT LXV, 2025, 15123 :141-159

[29] Semantic Bridging and Feature Anchoring for Class Incremental Learning [J].

Wu, Kanghui ;

Guo, Dongyan .

2024 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME 2024, 2024,

[30] Future-proofing class-incremental learning [J].

Jodelet, Quentin ;

Liu, Xin ;

Phua, Yin Jun ;

Murata, Tsuyoshi .

MACHINE VISION AND APPLICATIONS, 2025, 36 (01)

← 1 2 3 4 5 →