Class-Incremental Learning Method With Fast Update and High Retainability Based on Broad Learning System

被引：8

作者：

Du, Jie ^{[1
]}

Liu, Peng ^{[2
]}

Vong, Chi-Man ^{[2
]}

Chen, Chuangquan ^{[3
]}

Wang, Tianfu ^{[1
]}

Chen, C. L. Philip ^{[4
,5
]}

机构：

[1] Shenzhen Univ, Natl Reg Key Technol Engn Lab Med Ultrasound, Guangdong Key Lab Biomed Measurements & Ultrasound, Sch Biomed Engn,Hlth Sci Ctr, Shenzhen 518060, Peoples R China

[2] Univ Macau, Dept Comp & Informat Sci, Macau, Peoples R China

[3] Wuyi Univ, Fac Intelligent Mfg, Jiangmen 529020, Peoples R China

[4] South China Univ Technol, Sch Comp Sci & Engn, Guangzhou, Peoples R China

[5] South China Univ Technol, Pazhou Lab, Guangzhou 510335, Peoples R China

来源：

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS | 2024年 / 35卷 / 08期

基金：

中国国家自然科学基金;

关键词：

Training; Task analysis; Learning systems; Data models; Predictive models; Correlation; Support vector machines; Broad learning system (BLS); catastrophic forgetting; class correlations; class-incremental learning (CIL); recursive update rule;

D O I：

10.1109/TNNLS.2023.3259016

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Machine learning aims to generate a predictive model from a training dataset of a fixed number of known classes. However, many real-world applications (such as health monitoring and elderly care) are data streams in which new data arrive continually in a short time. Such new data may even belong to previously unknown classes. Hence, class-incremental learning (CIL) is necessary, which incrementally and rapidly updates an existing model with the data of new classes while retaining the existing knowledge of old classes. However, most current CIL methods are designed based on deep models that require a computationally expensive training and update process. In addition, deep learning based CIL (DCIL) methods typically employ stochastic gradient descent (SGD) as an optimizer that forgets the old knowledge to a certain extent. In this article, a broad learning system-based CIL (BLS-CIL) method with fast update and high retainability of old class knowledge is proposed. Traditional BLS is a fast and effective shallow neural network, but it does not work well on CIL tasks. However, our proposed BLS-CIL can overcome these issues and provide the following: 1) high accuracy due to our novel class-correlation loss function that considers the correlations between old and new classes; 2) significantly short training/update time due to the newly derived closed-form solution for our class-correlation loss without iterative optimization; and 3) high retainability of old class knowledge due to our newly derived recursive update rule for CIL (RULL) that does not replay the exemplars of all old classes, as contrasted to the exemplars-replaying methods with the SGD optimizer. The proposed BLS-CIL has been evaluated over 12 real-world datasets, including seven tabular/numerical datasets and six image datasets, and the compared methods include one shallow network and seven classical or state-of-the-art DCIL methods. Experimental results show that our BIL-CIL can significantly improve the classification performance over a shallow network by a large margin (8.80%-48.42%). It also achieves comparable or even higher accuracy than DCIL methods, but greatly reduces the training time from hours to minutes and the update time from minutes to seconds.

引用

页码：11332 / 11345

页数：14

共 58 条

[41] Model Behavior Preserving for Class-Incremental Learning [J].

Liu, Yu ;

Hong, Xiaopeng ;

Tao, Xiaoyu ;

Dong, Songlin ;

Shi, Jingang ;

Gong, Yihong .

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (10) :7529-7540

[42] Stacked Broad Learning System: From Incremental Flatted Structure to Deep Model [J].

Liu, Zhulin ;

Chen, C. L. Philip ;

Feng, Shuang ;

Feng, Qiying ;

Zhang, Tong .

IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2021, 51 (01) :209-222

[43]

Principe W. Liu, 2011, KERNEL ADAPTIVE FILT

[44]

Rajasegaran J, 2019, ADV NEUR IN, V32

[45] iCaRL: Incremental Classifier and Representation Learning [J].

Rebuffi, Sylvestre-Alvise ;

Kolesnikov, Alexander ;

Sperl, Georg ;

Lampert, Christoph H. .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :5533-5542

[46] ImageNet Large Scale Visual Recognition Challenge [J].

Russakovsky, Olga ;

Deng, Jia ;

Su, Hao ;

Krause, Jonathan ;

Satheesh, Sanjeev ;

Ma, Sean ;

Huang, Zhiheng ;

Karpathy, Andrej ;

Khosla, Aditya ;

Bernstein, Michael ;

Berg, Alexander C. ;

Fei-Fei, Li .

INTERNATIONAL JOURNAL OF COMPUTER VISION, 2015, 115 (03) :211-252

[47] Continual Learning for Real-World Autonomous Systems: Algorithms, Challenges and Frameworks [J].

Shaheen, Khadija ;

Hanif, Muhammad Abdullah ;

Hasan, Osman ;

Shafique, Muhammad .

JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2022, 105 (01)

[48]

Vakili M., 2021, ARXIV

[49]

Woolson R., 2007, Wiley Encyclopedia Clin. Trials, P1, DOI DOI 10.1002/9780471462422.EOCT979

[50]

Xiaoyu Tao, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12364), P254, DOI 10.1007/978-3-030-58529-7_16

← 1 2 3 4 5 6 →