Speech recognition with a hearing-aid processing scheme combining beamforming with mask-informed speech enhancement

被引：12

作者：

Green, Tim ^{[1
]}

Hilkhuysen, Gaston ^{[1
]}

Huckvale, Mark ^{[1
]}

Rosen, Stuart ^{[1
]}

Brookes, Mike ^{[2
]}

Moore, Alastair ^{[2
]}

Naylor, Patrick ^{[2
]}

Lightburn, Leo ^{[2
]}

Xue, Wei ^{[2
]}

机构：

[1] UCL, Dept Speech Hearing & Phonet Sci, Chandler House,2 Wakefield St, London WC1N 1PF, England

[2] Imperial Coll, Dept Elect & Elect Engn, London, England

来源：

TRENDS IN HEARING | 2022年 / 26卷

基金：

英国工程与自然科学研究理事会;

关键词：

Hearing loss; binaural hearing; cocktail party listening; spatial hearing; BINAURAL HEARING; INTELLIGIBILITY; NOISE; SEPARATION; RECEPTION; ARRAY;

D O I：

10.1177/23312165211068629

中图分类号：

R36 [病理学]; R76 [耳鼻咽喉科学];

学科分类号：

100104 ; 100213 ;

摘要：

A signal processing approach combining beamforming with mask-informed speech enhancement was assessed by measuring sentence recognition in listeners with mild-to-moderate hearing impairment in adverse listening conditions that simulated the output of behind-the-ear hearing aids in a noisy classroom. Two types of beamforming were compared: binaural, with the two microphones of each aid treated as a single array, and bilateral, where independent left and right beamformers were derived. Binaural beamforming produces a narrower beam, maximising improvement in signal-to-noise ratio (SNR), but eliminates the spatial diversity that is preserved in bilateral beamforming. Each beamformer type was optimised for the true target position and implemented with and without additional speech enhancement in which spectral features extracted from the beamformer output were passed to a deep neural network trained to identify time-frequency regions dominated by target speech. Additional conditions comprising binaural beamforming combined with speech enhancement implemented using Wiener filtering or modulation-domain Kalman filtering were tested in normally-hearing (NH) listeners. Both beamformer types gave substantial improvements relative to no processing, with significantly greater benefit for binaural beamforming. Performance with additional mask-informed enhancement was poorer than with beamforming alone, for both beamformer types and both listener groups. In NH listeners the addition of mask-informed enhancement produced significantly poorer performance than both other forms of enhancement, neither of which differed from the beamformer alone. In summary, the additional improvement in SNR provided by binaural beamforming appeared to outweigh loss of spatial information, while speech understanding was not further improved by the mask-informed enhancement method implemented here.

引用

页数：16

共 50 条

[31] Effect of compression release time of a hearing aid on sentence recognition and the quality judgment of speech [J].

Shetty, Hemanth Narayan ;

Raju, Suma .

NOISE & HEALTH, 2019, 21 (103) :232-241

[32] Assessing speech recognition abilities with digits in noise in cochlear implant and hearing aid users [J].

Kaandorp, Marre W. ;

Smits, Cas ;

Merkus, Paul ;

Goverts, S. Theo ;

Festen, Joost M. .

INTERNATIONAL JOURNAL OF AUDIOLOGY, 2015, 54 (01) :48-57

[33] Conditional Emission Densities for Combining Speech Enhancement and Recognition Systems [J].

Sehr, Armin ;

Yoshioka, Takuya ;

Delcroix, Marc ;

Kinoshita, Keisuke ;

Nakatani, Tomohiro ;

Maas, Roland ;

Kellermann, Walter .

14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, :3469-3473

[34] Effect of Hearing Aid Bandwidth on Speech Recognition Performance of Listeners Using a Cochlear Implant and Contralateral Hearing Aid (Bimodal Hearing) [J].

Neuman, Arlene C. ;

Svirsky, Mario A. .

EAR AND HEARING, 2013, 34 (05) :553-561

[35] Assessment of hearing aid algorithms using a master hearing aid: the influence of hearing aid experience on the relationship between speech recognition and cognitive capacity [J].

Raehlmann, Sebastian ;

Meis, Markus ;

Schulte, Michael ;

Kiessling, Juergen ;

Walger, Martin ;

Meister, Hartmut .

INTERNATIONAL JOURNAL OF AUDIOLOGY, 2018, 57 :S105-S111

[36] Speech Recognition for Bilaterally Asymmetric and Symmetric Hearing Aid Microphone Modes in Simulated Classroom Environments [J].

Ricketts, Todd A. ;

Picou, Erin M. .

EAR AND HEARING, 2013, 34 (05) :601-609

[37] Evaluation of Speech Recognition Skills in Different Noises with the Turkish Matrix Sentence Test in Hearing Aid Users [J].

Cildir, Bunyamin ;

Tokgoz-Yilmaz, Suna .

TURKISH ARCHIVES OF OTORHINOLARYNGOLOGY, 2021, 59 (02) :133-138

[38] The Effect of Hearing Aid Bandwidth and Configuration of Hearing Loss on Bimodal Speech Recognition in Cochlear Implant Users [J].

Neuman, Arlene C. ;

Zeman, Annette ;

Neukam, Jonathan ;

Wang, Binhuan ;

Svirsky, Mario A. .

EAR AND HEARING, 2019, 40 (03) :621-635

[39] Measuring Speech Intelligibility and Hearing-Aid Benefit Using Everyday Conversational Sentences in Real-World Environments [J].

Miles, Kelly ;

Beechey, Timothy ;

Best, Virginia ;

Buchholz, Jorg .

FRONTIERS IN NEUROSCIENCE, 2022, 16

[40] Multi-objective learning based speech enhancement method to increase speech quality and intelligibility for hearing aid device users [J].

Lai, Ying-Hui ;

Zheng, Wei-Zhong .

BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2019, 48 :35-45

← 1 2 3 4 5 →