A Microphone Array Beamformer for the Performance Enhancement of Speech Recognizer in Car

被引:0
作者
Han, Chul-Hee
Kang, Hong-Goo
Hwang, Youngsoo
Youn, Dae-Hee
机构
来源
JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA | 2005年 / 24卷 / 07期
关键词
Microphone Array; Relative Transfer Function; Near-Field; MVDR Beamformer; Speech Enhancement; Speech Recognition; RTF-MVDR;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, a microphone array beamforming algorithm that reduces the signal distortion caused by reverberation and near-field effect in car environment is proposed. When reverberation or near-field effect is present, an optimum beamformer should be constructed with a steering vector consisting of transfer functions between source and microphones, but it is generally difficult to estimate transfer functions on-line without knowledge of the source signal. Instead, a sub-optimal beamforming algorithm that reduces signal distortion is proposed. It is constructed with steering vectors consisting of relative transfer functions between reference sensor and other sensors. In order to evaluate the performance of the proposed algorithm, we had recorded noisy speech database in a car, and performed speech recognition experiments with HIVIM Toolkit (HTK) released by Cambridge University. The recognition rate of the proposed algorithm was 15 percents higher than that of the conventional far-field beamformers in best case.
引用
收藏
页码:423 / 430
页数:8
相关论文
共 12 条
[1]   A signal subspace tracking algorithm for microphone array processing of speech [J].
Affes, S ;
Grenier, Y .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1997, 5 (05) :425-437
[2]  
Asano F, 2000, IEICE T FUND ELECTR, VE83A, P2286
[3]  
Asano F., 2001, P 7 EUR C SPEECH COM, P1013
[4]  
Bransdstein M., 2001, MICROPHONE ARRAYS
[5]   Multichannel post-filtering in nonstationary noise environments [J].
Cohen, I .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2004, 52 (05) :1149-1160
[6]   Relative transfer function identification using speech seals [J].
Cohen, I .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2004, 12 (05) :451-459
[7]  
Crochiere R., 1983, MULTIRATE DIGITAL SI
[8]   Signal enhancement using beamforming and nonstationarity with applications to speech [J].
Gannot, S ;
Burshtein, D ;
Weinstein, E .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2001, 49 (08) :1614-1626
[9]   AN ALTERNATIVE APPROACH TO LINEARLY CONSTRAINED ADAPTIVE BEAMFORMING [J].
GRIFFITHS, LJ ;
JIM, CW .
IEEE TRANSACTIONS ON ANTENNAS AND PROPAGATION, 1982, 30 (01) :27-34
[10]  
Rabiner L. R., 1978, THEORY APPL DIGITAL