Speech enhancement based on the general transfer function GSC and postfiltering

被引:103
作者
Gannot, S [1 ]
Cohen, I
机构
[1] Bar Ilan Univ, Sch Engn, IL-52900 Ramat Gan, Israel
[2] Technion Israel Inst Technol, Fac Elect Engn, IL-32000 Haifa, Israel
来源
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING | 2004年 / 12卷 / 06期
关键词
generalized sidelobe canceller; microphone arrays; nonstationarity; postfiltering; speech enhancement;
D O I
10.1109/TSA.2004.834599
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In speech enhancement applications microphone array postfiltering allows additional reduction of noise components at a beamformer output. Among microphone array structures the recently proposed general transfer function generalized sidelobe canceller (TF-GSC) has shown impressive noise reduction abilities in a directional noise field, while still maintaining low speech distortion. However, in a diffused noise field less significant noise reduction is obtainable. The performance is even further degraded when the noise signal is nonstationary. In this contribution we propose three postfiltering methods for improving the performance of microphone arrays. Two of which are based on single-channel speech enhancers and making use of recently proposed algorithms concatenated to the beamformer output. The third is a multichannel speech enhancer which exploits noise-only components constructed within the TF-GSC structure. This work concentrates on the assessment of the proposed postfiltering structures. An extensive experimental study, which consists of both objective and subjective evaluation in various noise fields, demonstrates the advantage of the multichannel postfiltering compared to the single-channel techniques.
引用
收藏
页码:561 / 571
页数:11
相关论文
共 27 条
  • [1] [Anonymous], P IEEE ICASSP 88
  • [2] [Anonymous], 1993, S111986 ANSI
  • [3] [Anonymous], P INT WORKSH AC ECH
  • [4] Multi-microphone noise reduction techniques as front-end devices for speech recognition
    Bitzer, J
    Simmer, KU
    Kammeyer, KD
    [J]. SPEECH COMMUNICATION, 2001, 34 (1-2) : 3 - 12
  • [5] BITZER J, 1999, P INT WORKSH AC ECH, P100
  • [6] BOLL SF, 1983, SPEECH ENHANCEMENT, P61
  • [7] Speech enhancement using a mixture-maximum model
    Burshtein, D
    Gannot, S
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2002, 10 (06): : 341 - 351
  • [8] BURSHTEIN D, 1999, P 6 EUR C SPEECH COM, V6, P2591
  • [9] Cohen I, 2002, INT CONF ACOUST SPEE, P901
  • [10] Noise estimation by minima controlled recursive averaging for robust speech enhancement
    Cohen, I
    Berdugo, B
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2002, 9 (01) : 12 - 15