Classification of Environmental Sounds with Convolutional Neural Networks

被引：0

作者：

Dincer, Yalcin ^{[1
]}

Inik, Ozkan ^{[2
]}

机构：

[1] Bingol Univ, Tekn Bilimler Meslek Yuksekokulu, Bilgisayar Teknol Bolumu, Bingol, Turkiye

[2] Tokat Gaziosmanpasa Univ, Muhendislik & Mimarlik Fak, Bilgisayar Muhendisligi Bolumu, Tokat, Turkiye

来源：

KONYA JOURNAL OF ENGINEERING SCIENCES | 2023年 / 11卷 / 02期

关键词：

Deep Learning; Convolutional Neural Network; Environmental Sound Classification; ESC10; UrbanSound8K; SURVEILLANCE; MATRIX; RECOGNITION;

D O I：

10.36306/konjes.1201558

中图分类号：

T [工业技术];

学科分类号：

08 ;

摘要：

The use of sound data is critical for predicting the effects of environmental activities and gathering information about the environment of these activities. Sound data is utilized to obtain basic information about the functioning of urban activities such as noise pollution, security systems, health care, and local services. In this sense, Environmental Sound Classification (ESC) is becoming critical. Due to the increasing amount of data and time constraints in analysis, there is a need for new and powerful artificial intelligence methods that enable instant automatic identification of sounds. These methods can be developed with Convolutional Neural Networks (CNN) models, which have achieved high accuracy rates in other fields. For this reason, in this study, a new CNN based method is proposed for the classification of two different CSR datasets. In this method, the sounds are first converted into image format. Then, novel ESA models are designed for the classification of these sounds in image format. For each dataset, the ESA models with the highest accuracy rate were obtained among the multiple ESA models designed. The datasets used in the study are ESC10 and UrbanSound8K, respectively. The sound recordings in these datasets were converted to image format with 32x32x3 and 224x224x3 dimensions, and four different image format datasets were obtained. The CNN models developed to classify these datasets are named ESC10_ESA32, ESC10_ESA224, URBANSOUND8K_ESA32, and URBANSOUND8K_ESA224, respectively. These models were trained on the datasets using 10-fold cross-validation. In the obtained results, the average accuracy rates of the ESC10_ESA32, ESC10_ESA224, URBANSOUND8K_ESA32, and URBANSOUND8K_ESA224 models are 80.75%, 82.25%, 88.60%, and 84.33%, respectively. When these results are compared with other baseline studies in the literature on the same datasets, it is seen that these models achieve better results.

引用

页数：24

共 50 条

[21] Classification of Elephant Sounds Using Parallel Convolutional Neural Network
Leonid, T. Thomas
Jayaparvathy, R.
INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2022, 32 (03) : 1415 - 1426
[22] Convolutional Neural Networks for Classification of Malware Assembly Code
Gibert, Daniel
Bejar, Javier
Mateu, Carles
Planes, Jordi
Solis, Daniel
Vicens, Ramon
RECENT ADVANCES IN ARTIFICIAL INTELLIGENCE RESEARCH AND DEVELOPMENT, 2017, 300 : 221 - 226
[23] Fish Detection and Classification Using Convolutional Neural Networks
Rekha, B. S.
Srinivasan, G. N.
Reddy, Sravan Kumar
Kakwani, Divyanshu
Bhattad, Niraj
COMPUTATIONAL VISION AND BIO-INSPIRED COMPUTING, 2020, 1108 : 1221 - 1231
[24] Clifford Convolutional Neural Networks for Lymphoblast Image Classification
Vieira, Guilherme
Valle, Marcos Eduardo
Lopes, Wilder
ADVANCED COMPUTATIONAL APPLICATIONS OF GEOMETRIC ALGEBRA, ICACGA 2022, 2024, 13771 : 75 - 87
[25] Multiple Convolutional Neural Networks for Diabetic Retinopathy Classification
Schweisthal, Brigitte
Lascu, Mihaela
2021 INTERNATIONAL CONFERENCE ON E-HEALTH AND BIOENGINEERING (EHB 2021), 9TH EDITION, 2021,
[26] Convolutional Recurrent Neural Networks for Hyperspectral Data Classification
Wu, Hao
Prasad, Saurabh
REMOTE SENSING, 2017, 9 (03)
[27] End-to-end environmental sound classification using a 1D convolutional neural network
Abdoli, Sajjad
Cardinal, Patrick
Koerich, Alessandro Lameiras
EXPERT SYSTEMS WITH APPLICATIONS, 2019, 136 : 252 - 263
[28] Automatic cell image classification with convolutional neural networks
Kim S.-H.
Lee J.-H.
Choi E.-Y.
Jeon S.-T.
Choi M.-Y.
Jo S.-H.
Choe S.-W.
Transactions of the Korean Institute of Electrical Engineers, 2021, 70 (01) : 139 - 144
[29] Classification of Traffic Signs using Convolutional Neural Networks
Vaikole, Shubhangi
Bhalerao, Makarand
Nimbalkar, Parth
Moghe, Soham
JOURNAL OF ALGEBRAIC STATISTICS, 2022, 13 (02) : 1764 - 1769
[30] Seabed Classification Using a Convolutional Neural Network on Explosive Sounds
Howarth, Kira
Neilsen, Tracianne B.
Van Komen, David F.
Knobles, David Paul
IEEE JOURNAL OF OCEANIC ENGINEERING, 2022, 47 (03) : 670 - 679

← 1 2 3 4 5 →