A Binaural Steering Beamformer System for Enhancing a Moving Speech Source

被引:38
作者
Adiloglu, Kamil [1 ,2 ]
Kayser, Hendrik [2 ,3 ]
Baumgaertel, Regina M. [2 ,3 ]
Rennebeck, Sanja [2 ,3 ,4 ]
Dietz, Mathias [2 ,3 ]
Hohmann, Volker [1 ,2 ,3 ]
机构
[1] HorTech gGmbH, Marie Curie Str 2, D-26129 Oldenburg, Germany
[2] Cluster Excellence Hearing4all, Oldenburg, Germany
[3] Carl von Ossietzky Univ Oldenburg, Med Phys, D-26111 Oldenburg, Germany
[4] Jade Hsch, Oldenburg, Germany
关键词
audio signal localization; signal enhancement; speech intelligibility; objective evaluation; perceptual evaluation; MICROPHONE; ENHANCEMENT;
D O I
10.1177/2331216515618903
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
In many daily life communication situations, several sound sources are simultaneously active. While normal-hearing listeners can easily distinguish the target sound source from interfering sound sources-as long as target and interferers are spatially or spectrally separated-and concentrate on the target, hearing-impaired listeners and cochlear implant users have difficulties in making such a distinction. In this article, we propose a binaural approach composed of a spatial filter controlled by a direction-of-arrival estimator to track and enhance a moving target sound. This approach was implemented on a real-time signal processing platform enabling experiments with test subjects in situ. To evaluate the proposed method, a data set of sound signals with a single moving sound source in an anechoic diffuse noise environment was generated using virtual acoustics. The proposed steering method was compared with a fixed (nonsteering) method that enhances sound from the frontal direction in an objective evaluation and subjective experiments using this database. In both cases, the obtained results indicated a significant improvement in speech intelligibility and quality compared with the unprocessed signal. Furthermore, the proposed method outperformed the nonsteering method.
引用
收藏
页数:13
相关论文
共 29 条
[1]  
[Anonymous], ITG C VOIC COMM SPRA
[2]  
[ASA ANSI], 1997, AM NAT STAND METH CA
[3]   A Binaural CI Research Platform for Oticon Medical SP/XP Implants Enabling ITD/ILD and Variable Rate Processing [J].
Backus, B. ;
Adiloglu, K. ;
Herzke, T. .
TRENDS IN HEARING, 2015, 19
[4]   Comparing Binaural Pre-processing Strategies II: Speech Intelligibility of Bilateral Cochlear Implant Users [J].
Baumgaertel, Regina M. ;
Hu, Hongmei ;
Krawczyk-Becker, Martin ;
Marquardt, Daniel ;
Herzke, Tobias ;
Coleman, Graham ;
Adiloglu, Kamil ;
Bomke, Katrin ;
Plotz, Karsten ;
Gerkmann, Timo ;
Doclo, Simon ;
Kollmeier, Birger ;
Hohmann, Volker ;
Dietz, Mathias .
TRENDS IN HEARING, 2015, 19
[5]  
Baumgartel R. M., 2015, P 18 JAHR DTSCH GES
[6]  
Bitzer J, 2001, DIGITAL SIGNAL PROC, P19
[7]   Efficient adaptive procedures for threshold and concurrent slope estimates for psychophysics and speech intelligibility tests [J].
Brand, T ;
Kollmeier, B .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2002, 111 (06) :2801-2810
[8]   Advanced Beamformers for Cochlear Implant Users: Acute Measurement of Speech Perception in Challenging Listening Conditions [J].
Buechner, Andreas ;
Dyballa, Karl-Heinz ;
Hehrmann, Phillipp ;
Fredelake, Stefan ;
Lenarz, Thomas .
PLOS ONE, 2014, 9 (04)
[9]   Reduced-bandwidth Multi-channel Wiener Filter based binaural noise reduction and localization cue preservation in binaural hearing aids [J].
Cornelis, Bram ;
Moonen, Marc ;
Wouters, Jan .
SIGNAL PROCESSING, 2014, 99 :1-16
[10]   Unbiased MMSE-Based Noise Power Estimation With Low Complexity and Low Tracking Delay [J].
Gerkmann, Timo ;
Hendriks, Richard C. .
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2012, 20 (04) :1383-1393