A novel spiral pattern and 2D M4 pooling based environmental sound classification method

被引:11
作者
Tuncer, Turker [1 ]
Subasi, Abdulhamit [2 ]
Ertam, Fatih [1 ]
Dogan, Sengul [1 ]
机构
[1] Firat Univ, Technol Fac, Dept Digital Forens Engn, Elazig, Turkey
[2] Effat Univ, Coll Engn, Dept Informat Syst, Jeddah, Saudi Arabia
关键词
Environmental sound classification; Spiral pattern; 2D M4 pooling; Deep neural network; Machine learning; Digital forensics; CONVOLUTIONAL NEURAL-NETWORKS; RECOGNITION; BINARY; SYSTEM;
D O I
10.1016/j.apacoust.2020.107508
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
One of the crucial problems of the signal processing, digital forensics and machine learning is the environmental sound classification (ESC). Several ESC methods have been presented to obtain highly accurate model. In this work, a novel multileveled ESC method is presented. The presented ESC method uses two novel algorithms namely Spiral Pattern and two dimensional maximum, minimum, median and mean (2D-M4) pooling. By using these methods (Spiral Pattern and 2D-M4 pooling), 9 level feature generation approach is presented. Since the proposed Spiral Pattern has nine arrows, it extracts 9 and 18 bits using signum and ternary functions respectively. As a result, 1536 features are extracted in each level and totally 15,360 features are generated using from 0th to 9th levels. In order to select the discriminative features, neighbourhood component analysis (NCA) is used and 700 most distinctive features are selected. In the classification phase, deep neural network is trained and tested with the ESC-10 and ESC-50 datasets. 98.75% and 85.75% average classification accuracies were achieved with 10-folds cross validation for ESC-10 and ESC-50 datasets respectively. The experimental results reveal that the proposed Spiral Pattern and 2D-M4 pooling based ESC method is superior than the human auditory system (HAS) for environmental sound classification. (C) 2020 Elsevier Ltd. All rights reserved.
引用
收藏
页数:11
相关论文
共 53 条
[1]  
Abdoli Sajjad, 2019, Expert Systems with Applications
[2]  
Agrawal DM, 2017, EUR SIGNAL PR CONF, P1809, DOI 10.23919/EUSIPCO.2017.8081521
[3]   Environmental sound classification using optimum allocation sampling based empirical mode decomposition [J].
Ahmad, Saad ;
Agrawal, Shubham ;
Joshi, Samta ;
Taran, Sachin ;
Bajaj, Varun ;
Demir, Fatih ;
Sengur, Abdulkadir .
PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2020, 537
[4]  
Aytar Y, 2016, ADV NEUR IN, V29
[5]  
Benetos E, 2016, INT CONF ACOUST SPEE, P6450, DOI 10.1109/ICASSP.2016.7472919
[6]  
Bisot V, 2016, INT CONF ACOUST SPEE, P6445, DOI 10.1109/ICASSP.2016.7472918
[7]   Classifying environmental sounds using image recognition networks [J].
Boddapati, Venkatesh ;
Petef, Andrej ;
Rasmusson, Jim ;
Lundberg, Lars .
KNOWLEDGE-BASED AND INTELLIGENT INFORMATION & ENGINEERING SYSTEMS, 2017, 112 :2048-2056
[8]   Improving sentiment analysis via sentence type classification using BiLSTM-CRF and CNN [J].
Chen, Tao ;
Xu, Ruifeng ;
He, Yulan ;
Wang, Xuan .
EXPERT SYSTEMS WITH APPLICATIONS, 2017, 72 :221-230
[9]   Environmental sound classification with dilated convolutions [J].
Chen, Yan ;
Guo, Qian ;
Liang, Xinyan ;
Wang, Jiang ;
Qian, Yuhua .
APPLIED ACOUSTICS, 2019, 148 :123-132
[10]   Divide and Conquer-Based 1D CNN Human Activity Recognition Using Test Data Sharpening [J].
Cho, Heeryon ;
Yoon, Sang Min .
SENSORS, 2018, 18 (04)