Environmental Noise Classification Using Convolutional Neural Networks with Input Transform for Hearing Aids

被引：7

作者：

Park, Gyuseok ^{[1
]}

Lee, Sangmin ^{[1
]}

机构：

[1] Inha Univ, Dept Elect Engn, Incheon 22212, South Korea

来源：

INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH | 2020年 / 17卷 / 07期

基金：

新加坡国家研究基金会;

关键词：

hearing loss; hearing aids; environmental noise; deep learning; convolutional neural networks; SOUND CLASSIFICATION; SPEECH; INTELLIGIBILITY;

D O I：

10.3390/ijerph17072270

中图分类号：

X [环境科学、安全科学];

学科分类号：

08 ; 0830 ;

摘要：

Hearing aids are essential for people with hearing loss, and noise estimation and classification are some of the most important technologies used in devices. This paper presents an environmental noise classification algorithm for hearing aids that uses convolutional neural networks (CNNs) and image signals transformed from sound signals. The algorithm was developed using the data of ten types of noise acquired from living environments where such noises occur. Spectrogram images transformed from sound data are used as the input of the CNNs after processing of the images by a sharpening mask and median filter. The classification results of the proposed algorithm were compared with those of other noise classification methods. A maximum correct classification accuracy of 99.25% was achieved by the proposed algorithm for a spectrogram time length of 1 s, with the correct classification accuracy decreasing with increasing spectrogram time length up to 8 s. For a spectrogram time length of 8 s and using the sharpening mask and median filter, the classification accuracy was 98.73%, which is comparable with the 98.79% achieved by the conventional method for a time length of 1 s. The proposed hearing aid noise classification algorithm thus offers less computational complexity without compromising on performance.

引用

页数：17

共 26 条

[1] Convolutional Neural Networks for Speech Recognition
Abdel-Hamid, Ossama
Mohamed, Abdel-Rahman
Jiang, Hui
Deng, Li
Penn, Gerald
Yu, Dong
[J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2014, 22 (10) : 1533 - 1545
[2] [Anonymous], 1998, ISIS TECH REP
[3] [Anonymous], 2002, Spectrum and spectral density estimation by the Discrete Fourier transform
[4] [Anonymous], ARXIV14085882
[5] Beale H.D., 1996, Neural Networks: A Comprehensive Foundation
[6] Sound classification in hearing aids inspired by auditory scene analysis
Büchler, M
Allegro, S
Launer, S
Dillier, N
[J]. EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING, 2005, 2005 (18) : 2991 - 3002
[7] A tutorial on Support Vector Machines for pattern recognition
Burges, CJC
[J]. DATA MINING AND KNOWLEDGE DISCOVERY, 1998, 2 (02) : 121 - 167
[8] DEVELOPMENT OF THE SPEECH-INTELLIGIBILITY RATING (SIR) TEST FOR HEARING-AID COMPARISONS
COX, RM
MCDANIEL, DM
[J]. JOURNAL OF SPEECH AND HEARING RESEARCH, 1989, 32 (02): : 347 - 352
[9] Dillon H., 2008, Hearing Aids
[10] Duda R. O., 1973, Pattern Classification and Scene Analysis, V3

← 1 2 3 →