Speech recognition with a hearing-aid processing scheme combining beamforming with mask-informed speech enhancement

被引:12
作者
Green, Tim [1 ]
Hilkhuysen, Gaston [1 ]
Huckvale, Mark [1 ]
Rosen, Stuart [1 ]
Brookes, Mike [2 ]
Moore, Alastair [2 ]
Naylor, Patrick [2 ]
Lightburn, Leo [2 ]
Xue, Wei [2 ]
机构
[1] UCL, Dept Speech Hearing & Phonet Sci, Chandler House,2 Wakefield St, London WC1N 1PF, England
[2] Imperial Coll, Dept Elect & Elect Engn, London, England
基金
英国工程与自然科学研究理事会;
关键词
Hearing loss; binaural hearing; cocktail party listening; spatial hearing; BINAURAL HEARING; INTELLIGIBILITY; NOISE; SEPARATION; RECEPTION; ARRAY;
D O I
10.1177/23312165211068629
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
A signal processing approach combining beamforming with mask-informed speech enhancement was assessed by measuring sentence recognition in listeners with mild-to-moderate hearing impairment in adverse listening conditions that simulated the output of behind-the-ear hearing aids in a noisy classroom. Two types of beamforming were compared: binaural, with the two microphones of each aid treated as a single array, and bilateral, where independent left and right beamformers were derived. Binaural beamforming produces a narrower beam, maximising improvement in signal-to-noise ratio (SNR), but eliminates the spatial diversity that is preserved in bilateral beamforming. Each beamformer type was optimised for the true target position and implemented with and without additional speech enhancement in which spectral features extracted from the beamformer output were passed to a deep neural network trained to identify time-frequency regions dominated by target speech. Additional conditions comprising binaural beamforming combined with speech enhancement implemented using Wiener filtering or modulation-domain Kalman filtering were tested in normally-hearing (NH) listeners. Both beamformer types gave substantial improvements relative to no processing, with significantly greater benefit for binaural beamforming. Performance with additional mask-informed enhancement was poorer than with beamforming alone, for both beamformer types and both listener groups. In NH listeners the addition of mask-informed enhancement produced significantly poorer performance than both other forms of enhancement, neither of which differed from the beamformer alone. In summary, the additional improvement in SNR provided by binaural beamforming appeared to outweigh loss of spatial information, while speech understanding was not further improved by the mask-informed enhancement method implemented here.
引用
收藏
页数:16
相关论文
共 50 条
[41]   An exploratory Study of EEG Alpha Oscillation and Pupil Dilation in Hearing-Aid Users During Effortful listening to Continuous Speech [J].
Ala, Tirdad Seifi ;
Graversen, Carina ;
Wendt, Dorothea ;
Alickovic, Emina ;
Whitmer, William M. ;
Lunner, Thomas .
PLOS ONE, 2020, 15 (07)
[42]   Binaural speech enhancement system combining dereverberation and spatial masking-based noise removal for robust speech recognition [J].
Tien Dung Tran ;
Dang Khoa Nguyen ;
Quoc Cuong Nguyen ;
Huu Binh Nguyen .
2012 FOURTH INTERNATIONAL CONFERENCE ON COMMUNICATIONS AND ELECTRONICS (ICCE), 2012, :345-350
[43]   Combining adaptive sparse NMF feature extraction and soft mask to optimize DNN for speech enhancement [J].
Jia, Hairong ;
Wang, Weimei ;
Mei, Shulin .
APPLIED ACOUSTICS, 2021, 171
[44]   A HYBRID APPROACH TO COMBINING CONVENTIONAL AND DEEP LEARNING TECHNIQUES FOR SINGLE-CHANNEL SPEECH ENHANCEMENT AND RECOGNITION [J].
Tu, Yan-Hui ;
Tashev, Ivan ;
Zarar, Shuayb ;
Lee, Chin-Hui .
2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, :2531-2535
[45]   Noise Reduction Using Modified Wiener Filter in Digital Hearing Aid for Speech Signal Enhancement [J].
Kumar, Madam Aravind ;
Chari, Kamsali Manjunatha .
JOURNAL OF INTELLIGENT SYSTEMS, 2020, 29 (01) :1360-1378
[46]   Impaired Prosodic Processing but Not Hearing Function Is Associated with an Age-Related Reduction in AI Speech Recognition [J].
Herrmann, Bjorn ;
Cui, Mo Eric .
AUDIOLOGY RESEARCH, 2025, 15 (01)
[47]   Influences of listener gender and working memory capacity on speech recognition in noise for hearing aid users [J].
Yumba, Wycliffe K. .
SPEECH LANGUAGE AND HEARING, 2022, 25 (02) :112-124
[48]   Effect of a Bluetooth-Implemented Hearing Aid on Speech Recognition Performance: Subjective and Objective Measurement [J].
Kim, Min-Beom ;
Chung, Won-Ho ;
Choi, Jeesun ;
Hong, Sung Hwa ;
Cho, Yang-Sun ;
Park, Gyuseok ;
Lee, Sangmin .
ANNALS OF OTOLOGY RHINOLOGY AND LARYNGOLOGY, 2014, 123 (06) :395-401
[49]   Detection, Speech Recognition, Loudness, and Preference Outcomes With a Direct Drive Hearing Aid: Effects of Bandwidth [J].
Folkeard, Paula ;
Eeckhoutte, Maaike Van ;
Levy, Suzanne ;
Dundas, Drew ;
Abbasalipour, Parvaneh ;
Glista, Danielle ;
Agrawal, Sumit ;
Scollie, Susan .
TRENDS IN HEARING, 2021, 25
[50]   Recognition and Localization of Speech by Adult Cochlear Implant Recipients Wearing a Digital Hearing Aid in the Nonimplanted Ear (Bimodal Hearing) [J].
Potts, Lisa G. ;
Skinner, Margaret W. ;
Litovsky, Ruth A. ;
Strube, Michael J. ;
Kuk, Francis .
JOURNAL OF THE AMERICAN ACADEMY OF AUDIOLOGY, 2009, 20 (06) :353-373