Speech enhancement based on the general transfer function GSC and postfiltering

被引：104

作者：

Gannot, S ^{[1
]}

Cohen, I

机构：

[1] Bar Ilan Univ, Sch Engn, IL-52900 Ramat Gan, Israel

[2] Technion Israel Inst Technol, Fac Elect Engn, IL-32000 Haifa, Israel

来源：

IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING | 2004年 / 12卷 / 06期

关键词：

generalized sidelobe canceller; microphone arrays; nonstationarity; postfiltering; speech enhancement;

D O I：

10.1109/TSA.2004.834599

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

In speech enhancement applications microphone array postfiltering allows additional reduction of noise components at a beamformer output. Among microphone array structures the recently proposed general transfer function generalized sidelobe canceller (TF-GSC) has shown impressive noise reduction abilities in a directional noise field, while still maintaining low speech distortion. However, in a diffused noise field less significant noise reduction is obtainable. The performance is even further degraded when the noise signal is nonstationary. In this contribution we propose three postfiltering methods for improving the performance of microphone arrays. Two of which are based on single-channel speech enhancers and making use of recently proposed algorithms concatenated to the beamformer output. The third is a multichannel speech enhancer which exploits noise-only components constructed within the TF-GSC structure. This work concentrates on the assessment of the proposed postfiltering structures. An extensive experimental study, which consists of both objective and subjective evaluation in various noise fields, demonstrates the advantage of the multichannel postfiltering compared to the single-channel techniques.

引用

页码：561 / 571

页数：11

共 27 条

[1]

[Anonymous], P IEEE ICASSP 88

[2]

[Anonymous], 1993, S111986 ANSI

[3]

[Anonymous], P INT WORKSH AC ECH

[4] Multi-microphone noise reduction techniques as front-end devices for speech recognition [J].

Bitzer, J ;

Simmer, KU ;

Kammeyer, KD .

SPEECH COMMUNICATION, 2001, 34 (1-2) :3-12

[5]

BITZER J, 1999, P INT WORKSH AC ECH, P100

[6]

BOLL SF, 1983, SPEECH ENHANCEMENT, P61

[7] Speech enhancement using a mixture-maximum model [J].

Burshtein, D ;

Gannot, S .

IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2002, 10 (06) :341-351

[8]

BURSHTEIN D, 1999, P 6 EUR C SPEECH COM, V6, P2591

[9]

Cohen I, 2002, INT CONF ACOUST SPEE, P901

[10] Noise estimation by minima controlled recursive averaging for robust speech enhancement [J].

Cohen, I ;

Berdugo, B .

IEEE SIGNAL PROCESSING LETTERS, 2002, 9 (01) :12-15

← 1 2 3 →