CNN hyper-parameter optimization for environmental sound classification

被引:34
作者
Inik, Ozkan [1 ]
机构
[1] Tokat Gaziosmanpasa Univ, Dept Comp Engn, Tokat, Turkiye
关键词
Environmental sound classification (ESC); CNN; Particle swarm optimization (PSO); Hyper -parameter optimization; Urbansound8k; ESC-50; PARTICLE SWARM OPTIMIZATION; NEURAL-NETWORK; SURVEILLANCE; ARCHITECTURES; RECOGNITION; FEATURES;
D O I
10.1016/j.apacoust.2022.109168
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Environmental sounds are being used widely in our lives. It is especially used in tasks such as managing smart cities, location determination, surveillance systems, machine hearing, and environmental monitoring. The main method for this, environmental sound classification (ESC), has been increasingly studied in recent years. However, the classification of these sounds is more difficult than other sounds because there are too many parameters that generate noise. The study tried to find the convolutional neural network (CNN) model that gave the highest accuracy for ESC tasks with the optimization of hyper-parameters. For this purpose, the Particle Swarm Optimization (PSO) algorithm was rearranged to represent the CNN architecture. Thus, the hyper-parameters in CNN are represented exactly without any transformation during optimization. Studies were carried out on the ESC-10, ESC-50, and Urbansound8k data sets, which are state-of-art for ESC tasks. Some data augmentation techniques have been used for data sets in the training of CNN models. The CNN models, which were obtained with PSO, achieved success rates of 98.64 % for ESC-10, 93.71 % for ESC-50, and 98.45 % for Urbansound8k, respectively. These results are the best accuracy values obtained with the pure CNN model when compared with previous studies. As a result, it has been made possible to automatically design CNN models for the classification of urban sounds, giving high classification accuracy. Thus, researchers who do not know much about CNN design can use this method in their desired datasets without the need for expert knowledge.(c) 2022 Elsevier Ltd. All rights reserved.
引用
收藏
页数:25
相关论文
共 76 条
[21]   Particle swarm optimization of deep neural networks architectures for image classification [J].
Fernandes Junior, Francisco Erivaldo ;
Yen, Gary G. .
SWARM AND EVOLUTIONARY COMPUTATION, 2019, 49 :62-74
[22]  
Geiger JT, 2015, EUR SIGNAL PR CONF, P714, DOI 10.1109/EUSIPCO.2015.7362476
[23]   CNN architecture optimization using bio-inspired algorithms for breast cancer detection in infrared images [J].
Goncalves, Caroline Barcelos ;
Souza, Jefferson R. ;
Fernandes, Henrique .
COMPUTERS IN BIOLOGY AND MEDICINE, 2022, 142
[24]   Environmental sound monitoring using machine learning on mobile devices [J].
Green, Marc ;
Murphy, Damian .
APPLIED ACOUSTICS, 2020, 159
[25]  
Heittola T, 2010, EUR SIGNAL PR CONF, P1272
[26]  
Huang J, 2002, IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOL I AND II, PROCEEDINGS, pA253
[27]  
Inik O., 2017, Gaziosmanpasa Bilimsel Arastirma Dergisi (GBAD), V6, P85
[28]  
Intani P, 2013, 2013 13TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS 2013), P1751, DOI 10.1109/ICCAS.2013.6704220
[29]  
Kennedy J., 2001, Swarm Intelligence, DOI [10.1016/B978-1-55860-595-4.X5000-1, DOI 10.1016/B978-1-55860-595-4.X5000-1]
[30]   ImageNet Classification with Deep Convolutional Neural Networks [J].
Krizhevsky, Alex ;
Sutskever, Ilya ;
Hinton, Geoffrey E. .
COMMUNICATIONS OF THE ACM, 2017, 60 (06) :84-90