CNN hyper-parameter optimization for environmental sound classification

被引:34
作者
Inik, Ozkan [1 ]
机构
[1] Tokat Gaziosmanpasa Univ, Dept Comp Engn, Tokat, Turkiye
关键词
Environmental sound classification (ESC); CNN; Particle swarm optimization (PSO); Hyper -parameter optimization; Urbansound8k; ESC-50; PARTICLE SWARM OPTIMIZATION; NEURAL-NETWORK; SURVEILLANCE; ARCHITECTURES; RECOGNITION; FEATURES;
D O I
10.1016/j.apacoust.2022.109168
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Environmental sounds are being used widely in our lives. It is especially used in tasks such as managing smart cities, location determination, surveillance systems, machine hearing, and environmental monitoring. The main method for this, environmental sound classification (ESC), has been increasingly studied in recent years. However, the classification of these sounds is more difficult than other sounds because there are too many parameters that generate noise. The study tried to find the convolutional neural network (CNN) model that gave the highest accuracy for ESC tasks with the optimization of hyper-parameters. For this purpose, the Particle Swarm Optimization (PSO) algorithm was rearranged to represent the CNN architecture. Thus, the hyper-parameters in CNN are represented exactly without any transformation during optimization. Studies were carried out on the ESC-10, ESC-50, and Urbansound8k data sets, which are state-of-art for ESC tasks. Some data augmentation techniques have been used for data sets in the training of CNN models. The CNN models, which were obtained with PSO, achieved success rates of 98.64 % for ESC-10, 93.71 % for ESC-50, and 98.45 % for Urbansound8k, respectively. These results are the best accuracy values obtained with the pure CNN model when compared with previous studies. As a result, it has been made possible to automatically design CNN models for the classification of urban sounds, giving high classification accuracy. Thus, researchers who do not know much about CNN design can use this method in their desired datasets without the need for expert knowledge.(c) 2022 Elsevier Ltd. All rights reserved.
引用
收藏
页数:25
相关论文
共 76 条
[1]   End-to-end environmental sound classification using a 1D convolutional neural network [J].
Abdoli, Sajjad ;
Cardinal, Patrick ;
Koerich, Alessandro Lameiras .
EXPERT SYSTEMS WITH APPLICATIONS, 2019, 136 :252-263
[2]   Noisy vehicle surveillance camera: A system to deter noisy vehicle in smart city [J].
Agha, Apoory ;
Ranjan, Rishabh ;
Gan, Woon-Seng .
APPLIED ACOUSTICS, 2017, 117 :236-245
[3]   An automated environmental sound classification methods based on statistical and textural feature [J].
Akbal, Erhan .
APPLIED ACOUSTICS, 2020, 167
[4]   A study of the accuracy of mobile technology for measuring urban noise pollution in large scale participatory sensing campaigns [J].
Aumond, Pierre ;
Lavandier, Catherine ;
Ribeiro, Carlos ;
Boix, Elisa Gonzalez ;
Kambona, Kennedy ;
D'Hondt, Ellie ;
Delaitre, Pauline .
APPLIED ACOUSTICS, 2017, 117 :219-226
[5]  
Aytar Y, 2016, ADV NEUR IN, V29
[6]   Acoustic Scene Classification [J].
Barchiesi, Daniele ;
Giannoulis, Dimitrios ;
Stowell, Dan ;
Plumbley, Mark D. .
IEEE SIGNAL PROCESSING MAGAZINE, 2015, 32 (03) :16-34
[7]   Feature Learning With Matrix Factorization Applied to Acoustic Scene Classification [J].
Bisot, Victor ;
Serizel, Romain ;
Essid, Slim ;
Richard, Gael .
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2017, 25 (06) :1216-1229
[8]   Classifying environmental sounds using image recognition networks [J].
Boddapati, Venkatesh ;
Petef, Andrej ;
Rasmusson, Jim ;
Lundberg, Lars .
KNOWLEDGE-BASED AND INTELLIGENT INFORMATION & ENGINEERING SYSTEMS, 2017, 112 :2048-2056
[9]   Urban noise recognition with convolutional neural network [J].
Cao, Jiuwen ;
Cao, Min ;
Wang, Jianzhong ;
Yin, Chun ;
Wang, Danping ;
Vidal, Pierre-Paul .
MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (20) :29021-29041
[10]   Environmental sound classification with dilated convolutions [J].
Chen, Yan ;
Guo, Qian ;
Liang, Xinyan ;
Wang, Jiang ;
Qian, Yuhua .
APPLIED ACOUSTICS, 2019, 148 :123-132