A new lateral geniculate nucleus pattern-based environmental sound classification using a new large sound dataset

被引：15

作者：

Tasci, Burak ^{[1
]}

Acharya, Madhav R. ^{[2
]}

Barua, Prabal Datta ^{[3
,4
]}

Yildiz, Arif Metehan ^{[5
]}

Gun, Mehmet Veysel ^{[5
]}

Keles, Tugce ^{[5
]}

Dogan, Sengul ^{[5
]}

Tuncer, Turker ^{[5
]}

机构：

[1] Firat Univ, Vocat Sch Tech Sci, TR-23119 Elazig, Turkey

[2] Manipal Acad Higher Educ, Dept Biomed Engn, Manipal, Karnataka, India

[3] Univ Southern Queensland, Sch Business Informat Syst, Toowoomba, Qld 4350, Australia

[4] Univ Technol Sydney, Fac Engn & Informat Technol, Sydney, NSW 2007, Australia

[5] Firat Univ, Coll Technol, Dept Digital Forens Engn, Elazig, Turkey

来源：

APPLIED ACOUSTICS | 2022年 / 196卷

关键词：

Environmental sound classification; Human auditory system; Human visual system; Sound classification; Hand-modeled learning method; HEARING;

D O I：

10.1016/j.apacoust.2022.108897

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Background and purpose: One of the essential purposes of sound classification is to achieve similar/over classification ability of the human auditory system (HAS). A new dataset and a biologically inspired feature extraction function have been proposed to realize this aim. We have developed a highly accurate sound classification architecture using the proposed biological-inspired feature extraction function. Materials and methods: In this research, a new environmental sound classification (ESC) dataset has been collected as a testbed, and this dataset contains 5000 sounds with 50 classes. Moreover, the collected ESC sound dataset is balanced. A new hand-modeled sound classification model has been proposed to classify sounds of this dataset. This model consists of (i) feature generation using a new lateral geniculate nucleus pattern (LGNPat), statistical moments and discrete wavelet transform (DWT), (ii) loop-based (iterative) neighborhood component analysis (INCA) based feature selection, (iii) classification using the selected features by k nearest neighbors (kNN) with 10-fold cross-validation. The proposed sound classification architecture is an extendable model. In this aspect, a new generation of hand-modeled sound/onedimensional signal classification methods can be proposed. Results: The presented hand-modeled learning method was applied to the ESC dataset acquired, and our LGNPat-based model attained 93.34% classification. Conclusions: The computed 93.34% on the collected ESC dataset has demonstrated the success of our model. Moreover, the collected dataset has been publicly published. In this aspect, the published dataset can be used to improve advanced sound classification models. (C) 2022 Elsevier Ltd. All rights reserved.

引用

页数：11

共 51 条

[1] BARF: A new direct and cross-based binary residual feature fusion with uncertainty-aware module for medical image classification [J].

Abdar, Moloud ;

Fahami, Mohammad Amin ;

Chakrabarti, Satarupa ;

Khosravi, Abbas ;

Plawiak, Pawel ;

Acharya, U. Rajendra ;

Tadeusiewicz, Ryszard ;

Nahavandi, Saeid .

INFORMATION SCIENCES, 2021, 577 (577) :353-378

[2]

Adams J. W., 2021, Handbook to service the deaf and hard of hearing: A bridge to accessibility

[3]

[Anonymous], 2015, Int. J. ue-Service, Sci. Technol., DOI DOI 10.14257/IJUNESST.2015.8.1.12

[4] A novel biometric recognition method based on multi kernelled bijection octal pattern using gait sound [J].

Aydemir, Emrah ;

Tuncer, Turker ;

Dogan, Sengul ;

Unsal, Musa .

APPLIED ACOUSTICS, 2021, 173

[5] CNN-RNN and Data Augmentation Using Deep Convolutional Generative Adversarial Network for Environmental Sound Classification [J].

Bahmei, Behnaz ;

Birmingham, Elina ;

Arzanpour, Siamak .

IEEE SIGNAL PROCESSING LETTERS, 2022, 29 :682-686

[6]

Bandara M, 2017, DESIGN ROAD SIDE THR

[7]

Banuroopa K., 2022, INT J NONLINEAR ANAL, V12, P2125, DOI 10.22075/ijnaa.2022.6049

[8] Classifying environmental sounds using image recognition networks [J].

Boddapati, Venkatesh ;

Petef, Andrej ;

Rasmusson, Jim ;

Lundberg, Lars .

KNOWLEDGE-BASED AND INTELLIGENT INFORMATION & ENGINEERING SYSTEMS, 2017, 112 :2048-2056

[9] The world report on hearing, 2021 [J].

Chadha, Shelly ;

Kamenov, Kaloyan ;

Cieza, Alarcos .

BULLETIN OF THE WORLD HEALTH ORGANIZATION, 2021, 99 (04) :242-+

[10] A Comprehensive Review of Polyphonic Sound Event Detection [J].

Chan, T. K. ;

Chin, Cheng Siong .

IEEE ACCESS, 2020, 8 :103339-103373

← 1 2 3 4 5 6 →