BINAURAL MASK-INFORMED SPEECH ENHANCEMENT FOR HEARING AIDS WITH HEAD TRACKING

被引:0
作者
Moore, Alastair H. [1 ]
Lightburn, Leo [1 ]
Xue, Wei [1 ]
Naylor, Patrick A. [1 ]
Brookes, Mike [1 ]
机构
[1] Imperial Coll London, Elect & Elect Engn, Exhibit Rd, London, England
来源
2018 16TH INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC) | 2018年
基金
英国工程与自然科学研究理事会;
关键词
Beamforming; Speech enhancement; Time-frequency mask; Assisted listening; Head rotation; INTELLIGIBILITY;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
An end-to-end speech enhancement system for hearing aids is proposed which seeks to improve the intelligibility of binaural speech in noise during head movement. The system uses a reference beamformer whose look direction is informed by knowledge of the head orientation and the a priori known direction of the desired source. From this a time-frequency mask is estimated using a deep neural network. The binaural signals are obtained using bilateral beamformers followed by a classical minimum mean square error speech enhancer, modified to use the estimated mask as a speech presence probability prior. In simulated experiments, the improvement in a binaural intelligibility metric (DBSTOI) given by the proposed system relative to beamforming alone corresponds to an SNR improvement of 4 to 6 dB. Results also demonstrate the individual contributions of incorporating the mask and the head orientation-aware beam steering to the proposed system.
引用
收藏
页码:461 / 465
页数:5
相关论文
共 31 条
[1]   Predicting the Intelligibility of Noisy and Nonlinearly Processed Binaural Speech [J].
Andersen, Asger Heidemann ;
de Haan, Jan Mark ;
Tan, Zheng-Hua ;
Jensen, Jesper .
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2016, 24 (11) :1908-1920
[2]  
[Anonymous], 1993, ITU T RECOMMENDATION
[3]  
[Anonymous], 1999, ART VOIC
[4]  
[Anonymous], 2010, P INT WORKSH AC ECH
[5]  
ANSI, 1997, METH CALC SPEECH INT, pS35
[6]   System identification in the short-time Fourier transform domain with crossband filtering [J].
Avargel, Yekutiel ;
Cohen, Israel .
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (04) :1305-1319
[7]  
Brandstein M., 2001, MICROPHONE ARRAYS SI
[8]  
Brookes D. M., 1997, VOICEBOX: A speech processing toolbox for MATLAB
[9]   Large-scale training to increase speech intelligibility for hearing-impaired listeners in novel noises [J].
Chen, Jitong ;
Wang, Yuxuan ;
Yoho, Sarah E. ;
Wang, DeLiang ;
Healy, Eric W. .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2016, 139 (05) :2604-2612
[10]   Noise estimation by minima controlled recursive averaging for robust speech enhancement [J].
Cohen, I ;
Berdugo, B .
IEEE SIGNAL PROCESSING LETTERS, 2002, 9 (01) :12-15