Class-Incremental Learning Method With Fast Update and High Retainability Based on Broad Learning System

被引:8
作者
Du, Jie [1 ]
Liu, Peng [2 ]
Vong, Chi-Man [2 ]
Chen, Chuangquan [3 ]
Wang, Tianfu [1 ]
Chen, C. L. Philip [4 ,5 ]
机构
[1] Shenzhen Univ, Natl Reg Key Technol Engn Lab Med Ultrasound, Guangdong Key Lab Biomed Measurements & Ultrasound, Sch Biomed Engn,Hlth Sci Ctr, Shenzhen 518060, Peoples R China
[2] Univ Macau, Dept Comp & Informat Sci, Macau, Peoples R China
[3] Wuyi Univ, Fac Intelligent Mfg, Jiangmen 529020, Peoples R China
[4] South China Univ Technol, Sch Comp Sci & Engn, Guangzhou, Peoples R China
[5] South China Univ Technol, Pazhou Lab, Guangzhou 510335, Peoples R China
基金
中国国家自然科学基金;
关键词
Training; Task analysis; Learning systems; Data models; Predictive models; Correlation; Support vector machines; Broad learning system (BLS); catastrophic forgetting; class correlations; class-incremental learning (CIL); recursive update rule;
D O I
10.1109/TNNLS.2023.3259016
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Machine learning aims to generate a predictive model from a training dataset of a fixed number of known classes. However, many real-world applications (such as health monitoring and elderly care) are data streams in which new data arrive continually in a short time. Such new data may even belong to previously unknown classes. Hence, class-incremental learning (CIL) is necessary, which incrementally and rapidly updates an existing model with the data of new classes while retaining the existing knowledge of old classes. However, most current CIL methods are designed based on deep models that require a computationally expensive training and update process. In addition, deep learning based CIL (DCIL) methods typically employ stochastic gradient descent (SGD) as an optimizer that forgets the old knowledge to a certain extent. In this article, a broad learning system-based CIL (BLS-CIL) method with fast update and high retainability of old class knowledge is proposed. Traditional BLS is a fast and effective shallow neural network, but it does not work well on CIL tasks. However, our proposed BLS-CIL can overcome these issues and provide the following: 1) high accuracy due to our novel class-correlation loss function that considers the correlations between old and new classes; 2) significantly short training/update time due to the newly derived closed-form solution for our class-correlation loss without iterative optimization; and 3) high retainability of old class knowledge due to our newly derived recursive update rule for CIL (RULL) that does not replay the exemplars of all old classes, as contrasted to the exemplars-replaying methods with the SGD optimizer. The proposed BLS-CIL has been evaluated over 12 real-world datasets, including seven tabular/numerical datasets and six image datasets, and the compared methods include one shallow network and seven classical or state-of-the-art DCIL methods. Experimental results show that our BIL-CIL can significantly improve the classification performance over a shallow network by a large margin (8.80%-48.42%). It also achieves comparable or even higher accuracy than DCIL methods, but greatly reduces the training time from hours to minutes and the update time from minutes to seconds.
引用
收藏
页码:11332 / 11345
页数:14
相关论文
共 58 条
  • [1] Alassafi Madini O., 2016, International Journal of Information Technology and Computer Science, V8, P41, DOI 10.5815/ijitcs.2016.02.05
  • [2] Alcalá-Fdez J, 2011, J MULT-VALUED LOG S, V17, P255
  • [3] Memory Aware Synapses: Learning What (not) to Forget
    Aljundi, Rahaf
    Babiloni, Francesca
    Elhoseiny, Mohamed
    Rohrbach, Marcus
    Tuytelaars, Tinne
    [J]. COMPUTER VISION - ECCV 2018, PT III, 2018, 11207 : 144 - 161
  • [4] Asuncion A., 2010, Uci machine learning repository
  • [5] Class Incremental Learning With Few-Shots Based on Linear Programming for Hyperspectral Image Classification
    Bai, Jing
    Yuan, Anran
    Xiao, Zhu
    Zhou, Huaji
    Wang, Dingchen
    Jiang, Hongbo
    Jiao, Licheng
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (06) : 5474 - 5485
  • [6] Regularization and semi-supervised learning on large graphs
    Belkin, M
    Matveeva, I
    Niyogi, P
    [J]. LEARNING THEORY, PROCEEDINGS, 2004, 3120 : 624 - 638
  • [7] IL2M: Class Incremental Learning With Dual Memory
    Belouadah, Eden
    Popescu, Adrian
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 583 - 592
  • [8] An improved quantum-inspired cooperative co-evolution algorithm with muli-strategy and its application
    Cai, Xing
    Zhao, Huimin
    Shang, Shifan
    Zhou, Yongquan
    Deng, Wu
    Chen, Huayue
    Deng, Wuquan
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2021, 171
  • [9] Broad Learning System: An Effective and Efficient Incremental Learning System Without the Need for Deep Architecture
    Chen, C. L. Philip
    Liu, Zhulin
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 29 (01) : 10 - 24
  • [10] Universal Approximation Capability of Broad Learning System and Its Structural Variations
    Chen, C. L. Philip
    Liu, Zhulin
    Feng, Shuang
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2019, 30 (04) : 1191 - 1204