Environmental Noise Classification Using Convolutional Neural Networks with Input Transform for Hearing Aids

被引:7
作者
Park, Gyuseok [1 ]
Lee, Sangmin [1 ]
机构
[1] Inha Univ, Dept Elect Engn, Incheon 22212, South Korea
基金
新加坡国家研究基金会;
关键词
hearing loss; hearing aids; environmental noise; deep learning; convolutional neural networks; SOUND CLASSIFICATION; SPEECH; INTELLIGIBILITY;
D O I
10.3390/ijerph17072270
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Hearing aids are essential for people with hearing loss, and noise estimation and classification are some of the most important technologies used in devices. This paper presents an environmental noise classification algorithm for hearing aids that uses convolutional neural networks (CNNs) and image signals transformed from sound signals. The algorithm was developed using the data of ten types of noise acquired from living environments where such noises occur. Spectrogram images transformed from sound data are used as the input of the CNNs after processing of the images by a sharpening mask and median filter. The classification results of the proposed algorithm were compared with those of other noise classification methods. A maximum correct classification accuracy of 99.25% was achieved by the proposed algorithm for a spectrogram time length of 1 s, with the correct classification accuracy decreasing with increasing spectrogram time length up to 8 s. For a spectrogram time length of 8 s and using the sharpening mask and median filter, the classification accuracy was 98.73%, which is comparable with the 98.79% achieved by the conventional method for a time length of 1 s. The proposed hearing aid noise classification algorithm thus offers less computational complexity without compromising on performance.
引用
收藏
页数:17
相关论文
共 26 条
  • [1] Convolutional Neural Networks for Speech Recognition
    Abdel-Hamid, Ossama
    Mohamed, Abdel-Rahman
    Jiang, Hui
    Deng, Li
    Penn, Gerald
    Yu, Dong
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2014, 22 (10) : 1533 - 1545
  • [2] [Anonymous], 1998, ISIS TECH REP
  • [3] [Anonymous], 2002, Spectrum and spectral density estimation by the Discrete Fourier transform
  • [4] [Anonymous], ARXIV14085882
  • [5] Beale H.D., 1996, Neural Networks: A Comprehensive Foundation
  • [6] Sound classification in hearing aids inspired by auditory scene analysis
    Büchler, M
    Allegro, S
    Launer, S
    Dillier, N
    [J]. EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING, 2005, 2005 (18) : 2991 - 3002
  • [7] A tutorial on Support Vector Machines for pattern recognition
    Burges, CJC
    [J]. DATA MINING AND KNOWLEDGE DISCOVERY, 1998, 2 (02) : 121 - 167
  • [8] DEVELOPMENT OF THE SPEECH-INTELLIGIBILITY RATING (SIR) TEST FOR HEARING-AID COMPARISONS
    COX, RM
    MCDANIEL, DM
    [J]. JOURNAL OF SPEECH AND HEARING RESEARCH, 1989, 32 (02): : 347 - 352
  • [9] Dillon H., 2008, Hearing Aids
  • [10] Duda R. O., 1973, Pattern Classification and Scene Analysis, V3