Analysis of CFA-BF: Novel combined fixed/adaptive beamforming for robust speech recognition in real car environments

被引:6
作者
Hansen, John H. L. [1 ]
Zhang, Xianxian [1 ]
机构
[1] Univ Texas Dallas, CRSS, Dept Elect Engn, Erik Jonsson Sch Engn & Comp Sci, Richardson, TX 75083 USA
关键词
Array processing; Robust speech recognition; In-vehicle speech systems; Beamforming; ADAPTIVE BEAMFORMER; ENHANCEMENT; NOISE; ALGORITHM;
D O I
10.1016/j.specom.2009.09.001
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Among a number of studies which have investigated various speech enhancement and processing schemes for in-vehicle speech systems, the delay-and-sum beamforming (DASB) and adaptive beamforming are two typical methods that both have their advantages and disadvantages. In this paper, we propose a novel combined fixed/adaptive beamforming solution (CFA-BF) based on previous work for speech enhancement and recognition in real moving car environments, which seeks to take advantage of both methods. The working scheme of CFA-BF consists of two steps: source location calibration and target signal enhancement. The first step is to pre-record the transfer functions between the speaker and microphone array from different potential source positions using adaptive beamforming under quiet environments; and the second step is to use this pre-recorded information to enhance the desired speech when the car is running on the road. An evaluation using extensive actual car speech data from the CU-Move Corpus shows that the method can decrease WER for speech recognition by up to 30% over a single channel scenario and improve speech quality via the SEGSNR measure by up to 1 dB on the average. (C) 2009 Elsevier B.V. All rights reserved.
引用
收藏
页码:134 / 149
页数:16
相关论文
共 1 条
  • [1] CSA-BF: A constrained switched adaptive beamformer for speech enhancement and recognition in real car environments
    Zhang, XX
    Hansen, JHL
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2003, 11 (06): : 733 - 745