Deep CNN Framework for Environmental Sound Classification using Weighting Filters

被引：0

作者：

Tang, Baolong ^{[1
]}

Li, Yuanqing ^{[1
]}

Li, Xuesheng ^{[1
]}

Xu, Limei ^{[1
]}

Yan, Yingchun ^{[1
]}

Yang, Qin ^{[1
]}

机构：

[1] Univ Elect Sci & Technol China, Sch Aeronaut & Astronaut, Chengdu 611731, Peoples R China

来源：

2019 IEEE INTERNATIONAL CONFERENCE ON MECHATRONICS AND AUTOMATION (ICMA) | 2019年

基金：

中国国家自然科学基金;

关键词：

Environment Sound Classification; CNN; Dropout; Weighting Filters; NEURAL-NETWORKS;

D O I：

10.1109/icma.2019.8816567

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Deep convolutional neural networks have been used to classify environmental sound recently. The classification system with high performance often requires a large well-labled dataset. The cost of tagging audio segments correctly and completely is quite high thus the deep learning models need to have high generalization ability if a weakly-tagged dataset is used. An algorithm named Weighting Filters algorithm(WF) which can be considered as an improved algorithm based on Dropout is proposed in this paper to enhance the generalization ability of models. To implement the Weighting Filters algorithm, an extra layer trained by backpropagation algorithm is introduced to produce a series of weighted filters. The simulation results show that the Weighting Filters algorithm is an effective way to improve the generalization ability of the model. Further more, a deep convolutional neural network using weighting filters algorithm is proposed for the applications of environmental sound classification. The main contributions of this paper are as follows: First, we proposed an effective algorithm WF based on Dropout, and secondly, we proposed a CNN-based framework using WF(CNN-WF) for environmental sound classification. The results obtained on ESC-50 demonstrate that the CNN-based framework we proposed has considerable performance for environmental sound classification.

引用

页码：2303 / 2308

页数：6

共 50 条

[31] Malaria parasite classification framework using a novel channel squeezed and boosted CNN [J].

Khan, Saddam Hussain ;

Shah, Najmus Saher ;

Nuzhat, Rabia ;

Majid, Abdul ;

Alquhayz, Hani ;

Khan, Asifullah .

MICROSCOPY, 2022, 71 (05) :271-282

[32] Fungi affected fruit leaf disease classification using deep CNN architecture [J].

Gaikwad S.S. ;

Rumma S.S. ;

Hangarge M. .

International Journal of Information Technology, 2022, 14 (7) :3815-3824

[33] SPECTROGRAM-BASED CLASSIFICATION OF SPOKEN FOUL LANGUAGE USING DEEP CNN [J].

Wazir, Abdulaziz Saleh Ba ;

Karim, Hezerul Abdul ;

Abdullah, Mohd Haris Lye ;

Mansor, Sarina ;

AlDahoul, Nouar ;

Fauzi, Mohammad Faizal Ahmad ;

See, John .

2020 IEEE 22ND INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2020,

[34] Environmental sound classification with dilated convolutions [J].

Chen, Yan ;

Guo, Qian ;

Liang, Xinyan ;

Wang, Jiang ;

Qian, Yuhua .

APPLIED ACOUSTICS, 2019, 148 :123-132

[35] Lung sound classification using deep neural networks with pre-training － Comparison of the performance between CNN, LSTM and convolutional LSTM－ [J].

Wakamoto R. ;

Mabu S. ;

Kido S. ;

Kuremoto T. .

Mabu, Shingo (mabu@yamaguchi-u.ac.jp), 1600, Institute of Electrical Engineers of Japan (140) :1402-1409

[36] From Baselines to DenseNet: A Deep Learning Framework for CNN Optimization and Augmentation [J].

Shatnawi, Hazim ;

Abusager, Mahmoud ;

Saquer, Jamil .

PROCEEDINGS OF THE 2025 ACM SOUTHEAST CONFERENCE, ACMSE 2025, 2025, :2-11

[37] Classification of heart sound signals using a novel deep WaveNet model [J].

Oh, Shu Lih ;

Jahmunah, V ;

Ooi, Chui Ping ;

Tan, Ru-San ;

Ciaccio, Edward J. ;

Yamakawa, Toshitaka ;

Tanabe, Masayuki ;

Kobayashi, Makiko ;

Acharya, U. Rajendra .

COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2020, 196

[38] Simplified swarm optimisation for CNN hyperparameters: a sound classification approach [J].

Liu, Zhenyao ;

Yeh, Wei-Chang .

INTERNATIONAL JOURNAL OF WEB AND GRID SERVICES, 2024, 20 (01) :93-113

[39] CNN-BLSTM based deep learning framework for eukaryotic kinome classification: An explainability based approach [J].

John, Chinju ;

Sahoo, Jayakrushna ;

Sajan, Irish K. ;

Madhavan, Manu ;

Mathew, Oommen K. .

COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2024, 112

[40] Deep and CNN fusion method for binaural sound source localisation [J].

Jiang, Shilong ;

Wu, Lulu ;

Yuan, Peipei ;

Sun, Yongheng ;

Liu, Hong .

JOURNAL OF ENGINEERING-JOE, 2020, 2020 (13) :511-516

← 1 2 3 4 5 →