Preserving Locality in Vision Transformers for Class Incremental Learning

被引:3
作者
Zheng, Bowen [1 ]
Zhou, Wei [1 ]
Ye, Han-Jia [1 ]
Zhan, De-Chuan [1 ]
机构
[1] Nanjing Univ, Natl Key Lab Novel Software Technol, Nanjing, Peoples R China
来源
2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME | 2023年
基金
国家重点研发计划;
关键词
Class Incremental Learning; Vision Transformer;
D O I
10.1109/ICME55011.2023.00202
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Learning new classes without forgetting is crucial for real-world applications for a classification model. Vision Transformers (ViT) recently achieve remarkable performance in Class Incremental Learning (CIL). Previous works mainly focus on block design and model expansion for ViTs. However, in this paper, we find that when the ViT is incrementally trained, the attention layers gradually lose concentration on local features. We call this interesting phenomenon as Locality Degradation in ViTs for CIL. Since the low-level local information is crucial to the transferability of the representation, it is beneficial to preserve the locality in attention layers. In this paper, we encourage the model to preserve more local information as the training procedure goes on and devise a Locality-Preserved Attention (LPA) layer to emphasize the importance of local features. Specifically, we incorporate the local information directly into the vanilla attention and control the initial gradients of the vanilla attention by weighting it with a small initial value. Extensive experiments show that the representations facilitated by LPA capture more low-level general information which is easier to transfer to follow-up tasks. The improved model gets consistently better performance on CIFAR100 and ImageNet100. The source code is available at https://github.com/bwnzheng/LPA_ICME2023.
引用
收藏
页码:1157 / 1162
页数:6
相关论文
共 50 条
  • [21] Enhancing Skin Cancer Detection with Transfer Learning and Vision Transformers
    Ahmad, Istiak
    Alsulami, Bassma Saleh
    Alqurashi, Fahad
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2024, 15 (10) : 1027 - 1034
  • [22] Analyzing Transfer Learning of Vision Transformers for Interpreting Chest Radiography
    Usman, Mohammad
    Zia, Tehseen
    Tariq, Ali
    CANCER MANAGEMENT AND RESEARCH, 2022, 14 : 1445 - 1462
  • [23] iNeMo: Incremental Neural Mesh Models for Robust Class-Incremental Learning
    Fischer, Tom
    Liu, Yaoyao
    Jesslen, Artur
    Ahmed, Noor
    Kaushik, Prakhar
    Wang, Angtian
    Yuille, Alan L.
    Kortylewski, Adam
    Ilg, Eddy
    COMPUTER VISION - ECCV 2024, PT LXXVII, 2024, 15135 : 357 - 374
  • [24] Unobtrusive Sensing Incremental Social Contexts using Fuzzy Class Incremental Learning
    Chen, Zhenyu
    Chen, Yiqiang
    Gao, Xingyu
    Wang, Shuangquan
    Hu, Lisha
    Yan, Chenggang Clarence
    Lane, Nicholas D.
    Miao, Chunyan
    2015 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2015, : 71 - 80
  • [25] PILoRA: Prototype Guided Incremental LoRA for Federated Class-Incremental Learning
    Guo, Haiyang
    Zhu, Fei
    Liu, Wenzhuo
    Zhang, Xu-Yao
    Liu, Cheng-Lin
    COMPUTER VISION - ECCV 2024, PT LXV, 2025, 15123 : 141 - 159
  • [26] Exemplar-Free Continual Learning of Vision Transformers via Gated Class-Attention and Cascaded Feature Drift Compensation
    Cotogni, Marco
    Yang, Fei
    Cusano, Claudio
    Bagdanov, Andrew D.
    van de Weijer, Joost
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2025, : 4571 - 4589
  • [27] Co-Transport for Class-Incremental Learning
    Zhou, Da-Wei
    Ye, Han-Jia
    Zhan, De-Chuan
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 1645 - 1654
  • [28] Future-proofing class-incremental learning
    Jodelet, Quentin
    Liu, Xin
    Phua, Yin Jun
    Murata, Tsuyoshi
    MACHINE VISION AND APPLICATIONS, 2025, 36 (01)
  • [29] Sparse personalized federated class-incremental learning
    Liu, Youchao
    Huang, Dingjiang
    INFORMATION SCIENCES, 2025, 706
  • [30] DiffClass: Diffusion-Based Class Incremental Learning
    Meng, Zichong
    Zhang, Jie
    Yang, Changdi
    Zhan, Zheng
    Zhao, Pu
    Wang, Yanzhi
    COMPUTER VISION - ECCV 2024, PT LXXXVII, 2025, 15145 : 142 - 159