Class-Incremental Learning Method With Fast Update and High Retainability Based on Broad Learning System

被引：8

作者：

Du, Jie ^{[1
]}

Liu, Peng ^{[2
]}

Vong, Chi-Man ^{[2
]}

Chen, Chuangquan ^{[3
]}

Wang, Tianfu ^{[1
]}

Chen, C. L. Philip ^{[4
,5
]}

机构：

[1] Shenzhen Univ, Natl Reg Key Technol Engn Lab Med Ultrasound, Guangdong Key Lab Biomed Measurements & Ultrasound, Sch Biomed Engn,Hlth Sci Ctr, Shenzhen 518060, Peoples R China

[2] Univ Macau, Dept Comp & Informat Sci, Macau, Peoples R China

[3] Wuyi Univ, Fac Intelligent Mfg, Jiangmen 529020, Peoples R China

[4] South China Univ Technol, Sch Comp Sci & Engn, Guangzhou, Peoples R China

[5] South China Univ Technol, Pazhou Lab, Guangzhou 510335, Peoples R China

来源：

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS | 2024年 / 35卷 / 08期

基金：

中国国家自然科学基金;

关键词：

Training; Task analysis; Learning systems; Data models; Predictive models; Correlation; Support vector machines; Broad learning system (BLS); catastrophic forgetting; class correlations; class-incremental learning (CIL); recursive update rule;

D O I：

10.1109/TNNLS.2023.3259016

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Machine learning aims to generate a predictive model from a training dataset of a fixed number of known classes. However, many real-world applications (such as health monitoring and elderly care) are data streams in which new data arrive continually in a short time. Such new data may even belong to previously unknown classes. Hence, class-incremental learning (CIL) is necessary, which incrementally and rapidly updates an existing model with the data of new classes while retaining the existing knowledge of old classes. However, most current CIL methods are designed based on deep models that require a computationally expensive training and update process. In addition, deep learning based CIL (DCIL) methods typically employ stochastic gradient descent (SGD) as an optimizer that forgets the old knowledge to a certain extent. In this article, a broad learning system-based CIL (BLS-CIL) method with fast update and high retainability of old class knowledge is proposed. Traditional BLS is a fast and effective shallow neural network, but it does not work well on CIL tasks. However, our proposed BLS-CIL can overcome these issues and provide the following: 1) high accuracy due to our novel class-correlation loss function that considers the correlations between old and new classes; 2) significantly short training/update time due to the newly derived closed-form solution for our class-correlation loss without iterative optimization; and 3) high retainability of old class knowledge due to our newly derived recursive update rule for CIL (RULL) that does not replay the exemplars of all old classes, as contrasted to the exemplars-replaying methods with the SGD optimizer. The proposed BLS-CIL has been evaluated over 12 real-world datasets, including seven tabular/numerical datasets and six image datasets, and the compared methods include one shallow network and seven classical or state-of-the-art DCIL methods. Experimental results show that our BIL-CIL can significantly improve the classification performance over a shallow network by a large margin (8.80%-48.42%). It also achieves comparable or even higher accuracy than DCIL methods, but greatly reduces the training time from hours to minutes and the update time from minutes to seconds.

引用

页码：11332 / 11345

页数：14

共 58 条

[1] Alassafi Madini O., 2016, International Journal of Information Technology and Computer Science, V8, P41, DOI 10.5815/ijitcs.2016.02.05
[2] Alcalá-Fdez J, 2011, J MULT-VALUED LOG S, V17, P255
[3] Memory Aware Synapses: Learning What (not) to Forget
Aljundi, Rahaf
Babiloni, Francesca
Elhoseiny, Mohamed
Rohrbach, Marcus
Tuytelaars, Tinne
[J]. COMPUTER VISION - ECCV 2018, PT III, 2018, 11207 : 144 - 161
[4] Asuncion A., 2010, Uci machine learning repository
[5] Class Incremental Learning With Few-Shots Based on Linear Programming for Hyperspectral Image Classification
Bai, Jing
Yuan, Anran
Xiao, Zhu
Zhou, Huaji
Wang, Dingchen
Jiang, Hongbo
Jiao, Licheng
[J]. IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (06) : 5474 - 5485
[6] Regularization and semi-supervised learning on large graphs
Belkin, M
Matveeva, I
Niyogi, P
[J]. LEARNING THEORY, PROCEEDINGS, 2004, 3120 : 624 - 638
[7] IL2M: Class Incremental Learning With Dual Memory
Belouadah, Eden
Popescu, Adrian
[J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 583 - 592
[8] An improved quantum-inspired cooperative co-evolution algorithm with muli-strategy and its application
Cai, Xing
Zhao, Huimin
Shang, Shifan
Zhou, Yongquan
Deng, Wu
Chen, Huayue
Deng, Wuquan
[J]. EXPERT SYSTEMS WITH APPLICATIONS, 2021, 171
[9] Broad Learning System: An Effective and Efficient Incremental Learning System Without the Need for Deep Architecture
Chen, C. L. Philip
Liu, Zhulin
[J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 29 (01) : 10 - 24
[10] Universal Approximation Capability of Broad Learning System and Its Structural Variations
Chen, C. L. Philip
Liu, Zhulin
Feng, Shuang
[J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2019, 30 (04) : 1191 - 1204

← 1 2 3 4 5 6 →