Online Continual Learning in Acoustic Scene Classification: An Empirical Study

被引:2
作者
Ha, Donghee [1 ,2 ]
Kim, Mooseop [1 ,2 ]
Jeong, Chi Yoon [1 ,2 ]
机构
[1] Elect & Telecommun Res Inst, Artificial Intelligence Res Lab, 218 Gajeong Ro, Daejeon 34129, Brazil
[2] Univ Sci & Technol, Artificial Intelligence, 217 Gajeong Ro, Daejeon 34113, South Korea
关键词
acoustic scene classification; catastrophic forgetting; continual learning; intransigence; online learning; RECOGNITION;
D O I
10.3390/s23156893
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Numerous deep learning methods for acoustic scene classification (ASC) have been proposed to improve the classification accuracy of sound events. However, only a few studies have focused on continual learning (CL) wherein a model continually learns to solve issues with task changes. Therefore, in this study, we systematically analyzed the performance of ten recent CL methods to provide guidelines regarding their performances. The CL methods included two regularization-based methods and eight replay-based methods. First, we defined realistic and difficult scenarios such as online class-incremental (OCI) and online domain-incremental (ODI) cases for three public sound datasets. Then, we systematically analyzed the performance of each CL method in terms of average accuracy, average forgetting, and training time. In OCI scenarios, iCaRL and SCR showed the best performance for small buffer sizes, and GDumb showed the best performance for large buffer sizes. In ODI scenarios, SCR adopting supervised contrastive learning consistently outperformed the other methods, regardless of the memory buffer size. Most replay-based methods have an almost constant training time, regardless of the memory buffer size, and their performance increases with an increase in the memory buffer size. Based on these results, we must first consider GDumb/SCR for the continual learning methods for ASC.
引用
收藏
页数:19
相关论文
共 57 条
[1]  
Rusu AA, 2016, Arxiv, DOI [arXiv:1606.04671, DOI 10.43550/ARXIV:1606.04671, DOI 10.48550/ARXIV.1606.04671]
[2]   A Review of Deep Learning Based Methods for Acoustic Scene Classification [J].
Abesser, Jakob .
APPLIED SCIENCES-BASEL, 2020, 10 (06)
[3]  
Aljundi R., 2019, P ADV NEUR INF PROC, V32
[4]  
Aljundi R, 2019, ADV NEUR IN, V32
[5]   A comprehensive study of class incremental learning algorithms for visual tasks [J].
Belouadah, Eden ;
Popescu, Adrian ;
Kanellos, Ioannis .
NEURAL NETWORKS, 2021, 135 :38-54
[6]  
Biesialska Magdalena, 2020, P 28 INT C COMP LING, P6523, DOI 10.18653/v1/2020.coling-main.574
[7]   Environmental sound recognition: a survey [J].
Chachada, Sachin ;
Kuo, C. -C. Jay .
APSIPA TRANSACTIONS ON SIGNAL AND INFORMATION PROCESSING, 2014, 3
[8]  
Chaudhry A., 2019, 7 INT C LEARNING REP
[9]  
Chaudhry A., 2018, P ECCV
[10]  
Chaudhry Arslan, 2019, arXiv