Deep Learning-Based Noise Reduction Approach to Improve Speech Intelligibility for Cochlear Implant Recipients

被引:60
|
作者
Lai, Ying-Hui [1 ]
Tsao, Yu [2 ]
Lu, Xugang [3 ]
Chen, Fei [4 ]
Su, Yu-Ting [5 ]
Chen, Kuang-Chao [6 ,7 ]
Chen, Yu-Hsuan [8 ]
Chen, Li-Ching [7 ]
Li, Lieber Po-Hung [7 ,9 ]
Lee, Chin-Hui [10 ]
机构
[1] Natl Yang Ming Univ, Dept Biomed Engn, Taipei, Taiwan
[2] Acad Sinica, Res Ctr Informat Technol Innovat, Taipei, Taiwan
[3] Natl Inst Informat & Commun Technol, Tokyo, Japan
[4] Southern Univ Sci & Technol, Dept Elect & Elect Engn, Shenzhen, Peoples R China
[5] Natl Taiwan Normal Univ, Dept Mechatron Engn, Taipei, Taiwan
[6] Far Eastern Mem Hosp, Dept Otolaryngol, New Taipei, Taiwan
[7] Cheng Hsin Gen Hosp, Dept Otolaryngol, 45 Cheng Hsin St, Taipei, Taiwan
[8] Cheng Hsin Gen Hosp, Dept Internal Med, Taipei, Taiwan
[9] Natl Yang Ming Univ, Sch Med, Fac Med, Taipei, Taiwan
[10] Georgia Inst Technol, Sch Elect & Comp Engn, Atlanta, GA 30332 USA
来源
EAR AND HEARING | 2018年 / 39卷 / 04期
基金
中国国家自然科学基金;
关键词
Cochlear implant; Deep denoising autoencoder; Deep learning; Noise reduction; DENOISING AUTOENCODER; SUBSPACE APPROACH; NEURAL-NETWORKS; VOCODED SPEECH; DYNAMIC-RANGE; RECOGNITION; ENHANCEMENT; HEARING; PERFORMANCE; ALGORITHMS;
D O I
10.1097/AUD.0000000000000537
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
Objective: We investigate the clinical effectiveness of a novel deep learning-based noise reduction (NR) approach under noisy conditions with challenging noise types at low signal to noise ratio (SNR) levels for Mandarin-speaking cochlear implant (CI) recipients. Design: The deep learning-based NR approach used in this study consists of two modules: noise classifier (NC) and deep denoising autoencoder (DDAE), thus termed (NC + DDAE). In a series of comprehensive experiments, we conduct qualitative and quantitative analyses on the NC module and the overall NC + DDAE approach. Moreover, we evaluate the speech recognition performance of the NC + DDAE NR and classical single-microphone NR approaches for Mandarin-speaking CI recipients under different noisy conditions. The testing set contains Mandarin sentences corrupted by two types of maskers, two-talker babble noise, and a construction jackhammer noise, at 0 and 5 dB SNR levels. Two conventional NR techniques and the proposed deep learning-based approach are used to process the noisy utterances. We qualitatively compare the NR approaches by the amplitude envelope and spectrogram plots of the processed utterances. Quantitative objective measures include (1) normalized covariance measure to test the intelligibility of the utterances processed by each of the NR approaches; and (2) speech recognition tests conducted by nine Mandarin-speaking CI recipients. These nine CI recipients use their own clinical speech processors during testing. Results: The experimental results of objective evaluation and listening test indicate that under challenging listening conditions, the proposed NC + DDAE NR approach yields higher intelligibility scores than the two compared classical NR techniques, under both matched and mismatched training-testing conditions. Conclusions: When compared to the two well-known conventional NR techniques under challenging listening condition, the proposed NC + DDAE NR approach has superior noise suppression capabilities and gives less distortion for the key speech envelope information, thus, improving speech recognition more effectively for Mandarin CI recipients. The results suggest that the proposed deep learning-based NR approach can potentially be integrated into existing CI signal processors to overcome the degradation of speech perception caused by noise.
引用
收藏
页码:795 / 809
页数:15
相关论文
共 50 条
  • [1] A Deep Learning based Noise Reduction Approach to Improve Speech Intelligibility for Cochlear Implant Recipients in the Presence of Competing Speech Noise
    Wang, Syu-Siang
    Tsao, Yu
    Wang, Hsiao-Lan Sharon
    Lai, Ying-Hui
    Li, Lieber Po-Hung
    2017 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC 2017), 2017, : 808 - 812
  • [2] PREDICTING THE EFFECT OF AGC ON SPEECH INTELLIGIBILITY OF COCHLEAR IMPLANT RECIPIENTS IN NOISE
    Khing, Phyu P.
    Ambikairajah, Eliathamby
    Swanson, Brett A.
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 8061 - 8065
  • [3] Speech enhancement based on harmonic estimation combined with MMSE to improve speech intelligibility for cochlear implant recipients
    Wang, Dongmei
    Hansen, John H. L.
    18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 186 - 190
  • [4] The impact of reverberation on speech intelligibility in cochlear implant recipients
    Kressner, Abigail Anne
    Westermann, Adam
    Buchholz, Jorg M.
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2018, 144 (02): : 1113 - 1122
  • [5] Effect of Noise Reduction Gain Errors on Simulated Cochlear Implant Speech Intelligibility
    Kressner, Abigail A.
    May, Tobias
    Dau, Torsten
    TRENDS IN HEARING, 2019, 23
  • [6] Effect of noise and reverberation on speech intelligibility for cochlear implant recipients in realistic sound environments
    Badajoz-Davila, Javier
    Buchholz, Jorg M.
    Van-Hoesel, Richard
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2020, 147 (05): : 3538 - 3549
  • [7] Speech enhancement based on neural networks improves speech intelligibility in noise for cochlear implant users
    Goehring, Tobias
    Bolner, Federico
    Monaghan, Jessica J. M.
    van Dijk, Bas
    Zarowski, Andrzej
    Bleeck, Stefan
    HEARING RESEARCH, 2017, 344 : 183 - 194
  • [8] A Deep Denoising Autoencoder Approach to Improving the Intelligibility of Vocoded Speech in Cochlear Implant Simulation
    Lai, Ying-Hui
    Chen, Fei
    Wang, Syu-Siang
    Lu, Xugang
    Tsao, Yu
    Lee, Chin-Hui
    IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2017, 64 (07) : 1568 - 1578
  • [9] Effects of a transient noise reduction algorithm on speech intelligibility in noise, noise tolerance and perceived annoyance in cochlear implant users
    Dingemanse, J. Gertjan
    Vroegop, Jantien L.
    Goedegebure, Andre
    INTERNATIONAL JOURNAL OF AUDIOLOGY, 2018, 57 (05) : 360 - 369
  • [10] Effect of adaptive beamforming and noise reduction algorithms on speech intelligibility and noise tolerance in bimodal cochlear implant users
    Michels, Anne
    Oukheira, Yassine
    Brendel, Martina
    Aschendorff, Antje
    Arndt, Susan
    Wesarg, Thomas
    COCHLEAR IMPLANTS INTERNATIONAL, 2022, 23 (03) : 148 - 157