A Binaural Steering Beamformer System for Enhancing a Moving Speech Source

被引:37
作者
Adiloglu, Kamil [1 ,2 ]
Kayser, Hendrik [2 ,3 ]
Baumgaertel, Regina M. [2 ,3 ]
Rennebeck, Sanja [2 ,3 ,4 ]
Dietz, Mathias [2 ,3 ]
Hohmann, Volker [1 ,2 ,3 ]
机构
[1] HorTech gGmbH, Marie Curie Str 2, D-26129 Oldenburg, Germany
[2] Cluster Excellence Hearing4all, Oldenburg, Germany
[3] Carl von Ossietzky Univ Oldenburg, Med Phys, D-26111 Oldenburg, Germany
[4] Jade Hsch, Oldenburg, Germany
来源
TRENDS IN HEARING | 2015年 / 19卷
关键词
audio signal localization; signal enhancement; speech intelligibility; objective evaluation; perceptual evaluation; MICROPHONE; ENHANCEMENT;
D O I
10.1177/2331216515618903
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
In many daily life communication situations, several sound sources are simultaneously active. While normal-hearing listeners can easily distinguish the target sound source from interfering sound sources-as long as target and interferers are spatially or spectrally separated-and concentrate on the target, hearing-impaired listeners and cochlear implant users have difficulties in making such a distinction. In this article, we propose a binaural approach composed of a spatial filter controlled by a direction-of-arrival estimator to track and enhance a moving target sound. This approach was implemented on a real-time signal processing platform enabling experiments with test subjects in situ. To evaluate the proposed method, a data set of sound signals with a single moving sound source in an anechoic diffuse noise environment was generated using virtual acoustics. The proposed steering method was compared with a fixed (nonsteering) method that enhances sound from the frontal direction in an objective evaluation and subjective experiments using this database. In both cases, the obtained results indicated a significant improvement in speech intelligibility and quality compared with the unprocessed signal. Furthermore, the proposed method outperformed the nonsteering method.
引用
收藏
页数:13
相关论文
共 29 条
  • [1] [Anonymous], ITG C VOIC COMM SPRA
  • [2] [ASA ANSI], 1997, AM NAT STAND METH CA
  • [3] A Binaural CI Research Platform for Oticon Medical SP/XP Implants Enabling ITD/ILD and Variable Rate Processing
    Backus, B.
    Adiloglu, K.
    Herzke, T.
    [J]. TRENDS IN HEARING, 2015, 19
  • [4] Comparing Binaural Pre-processing Strategies II: Speech Intelligibility of Bilateral Cochlear Implant Users
    Baumgaertel, Regina M.
    Hu, Hongmei
    Krawczyk-Becker, Martin
    Marquardt, Daniel
    Herzke, Tobias
    Coleman, Graham
    Adiloglu, Kamil
    Bomke, Katrin
    Plotz, Karsten
    Gerkmann, Timo
    Doclo, Simon
    Kollmeier, Birger
    Hohmann, Volker
    Dietz, Mathias
    [J]. TRENDS IN HEARING, 2015, 19
  • [5] Baumgartel R. M., 2015, P 18 JAHR DTSCH GES
  • [6] Bitzer J, 2001, DIGITAL SIGNAL PROC, P19
  • [7] Efficient adaptive procedures for threshold and concurrent slope estimates for psychophysics and speech intelligibility tests
    Brand, T
    Kollmeier, B
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2002, 111 (06) : 2801 - 2810
  • [8] Advanced Beamformers for Cochlear Implant Users: Acute Measurement of Speech Perception in Challenging Listening Conditions
    Buechner, Andreas
    Dyballa, Karl-Heinz
    Hehrmann, Phillipp
    Fredelake, Stefan
    Lenarz, Thomas
    [J]. PLOS ONE, 2014, 9 (04):
  • [9] Reduced-bandwidth Multi-channel Wiener Filter based binaural noise reduction and localization cue preservation in binaural hearing aids
    Cornelis, Bram
    Moonen, Marc
    Wouters, Jan
    [J]. SIGNAL PROCESSING, 2014, 99 : 1 - 16
  • [10] Unbiased MMSE-Based Noise Power Estimation With Low Complexity and Low Tracking Delay
    Gerkmann, Timo
    Hendriks, Richard C.
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2012, 20 (04): : 1383 - 1393