Multichannel Eigenspace Beamforming in a Reverberant Noisy Environment With Multiple Interfering Speech Signals

被引:212
|
作者
Markovich, Shmulik [1 ]
Gannot, Sharon [1 ]
Cohen, Israel [2 ]
机构
[1] Bar Ilan Univ, Sch Chem, IL-52900 Ramat Gan, Israel
[2] Technion Israel Inst Technol, Dept Elect Engn, IL-32000 Haifa, Israel
关键词
Array signal processing; interference cancellation; speech enhancement; subspace methods; SUBSPACE APPROACH; SEPARATION; ALGORITHM; DOMAIN;
D O I
10.1109/TASL.2009.2016395
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In many practical environments we wish to extract several desired speech signals, which are contaminated by nonstationary and stationary interfering signals. The desired signals may also be subject to distortion imposed by the acoustic room impulse responses (RIRs). In this paper, a linearly constrained minimum variance (LCMV) beamformer is designed for extracting the desired signals from multimicrophone measurements. The beamformer satisfies two sets of linear constraints. One set is dedicated to maintaining the desired signals, while the other set is chosen to mitigate both the stationary and nonstationary interferences. Unlike classical beamformers, which approximate the RIRs as delay-only filters, we take into account the entire RIR [or its respective acoustic transfer function (ATF)]. The LCMV beamformer is then reformulated in a generalized side-lobe canceler (GSC) structure, consisting of a fixed beamformer (FBF), blocking matrix (BM), and adaptive noise canceler (ANC). It is shown that for spatially white noise field, the beamformer reduces to a FBF, satisfying the constraint sets, without power minimization. It is shown that the application of the adaptive ANC contributes to interference reduction, but only when the constraint sets are not completely satisfied. We show that relative transfer functions (RTFs), which relate the desired speech sources and the microphones, and a basis for the interference subspace suffice for constructing the beamformer. The RTFs are estimated by applying the generalized eigenvalue decomposition (GEVD) procedure to the power spectral density (PSD) matrices of the received signals and the stationary noise. A basis for the interference subspace is estimated by collecting eigenvectors, calculated in segments where nonstationary interfering sources are active and the desired sources are inactive. The rank of the basis is then reduced by the application of the orthogonal triangular decomposition (QRD). This procedure relaxes the common requirement for nonoverlapping activity periods of the interference sources. A comprehensive experimental study in both simulated and real environments demonstrates the performance of the proposed beamformer.
引用
收藏
页码:1071 / 1086
页数:16
相关论文
共 33 条
  • [1] Combined LCMV-TRINICON Beamforming for Separating Multiple Speech Sources in Noisy and Reverberant Environments
    Markovich-Golan, Shmulik
    Gannot, Sharon
    Kellermann, Walter
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2017, 25 (02) : 320 - 332
  • [2] SPEECH RECOGNITION IN A NOISY AND REVERBERANT ENVIRONMENT WITH AND WITHOUT EARMUFFS
    PEKKARINEN, E
    VILJANEN, V
    SALMIVALLI, A
    SUONPAA, J
    AUDIOLOGY, 1990, 29 (05): : 286 - 293
  • [3] Maximum likelihood approach to speech enhancement for noisy reverberant signals
    Yoshioka, Takuya
    Nakatani, Tomohiro
    Hikichi, Takafumi
    Miyoshi, Masato
    2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 4585 - 4588
  • [4] Experimental study of robust acoustic beamforming for speech acquisition in reverberant and noisy environments
    Zhao, Yingke
    Jensen, Jesper Rindom
    Jensen, Tobias Lindstrom
    Chen, Jingdong
    Christensen, Mads Graesboll
    APPLIED ACOUSTICS, 2020, 170
  • [5] A COMPARISON BETWEEN ALTERNATIVE BEAMFORMING STRATEGIES FOR INTERFERENCE CANCELATION IN NOISY AND REVERBERANT ENVIRONMENT
    Markovich, Shmulik
    Gannot, Sharon
    Cohen, Israel
    2008 IEEE 25TH CONVENTION OF ELECTRICAL AND ELECTRONICS ENGINEERS IN ISRAEL, VOLS 1 AND 2, 2008, : 203 - +
  • [6] Time difference of arrival estimation of speech source in a noisy and reverberant environment
    Dvorkind, TG
    Gannot, S
    SIGNAL PROCESSING, 2005, 85 (01) : 177 - 204
  • [7] A multiresolution approach to blind separation of speech signals in a reverberant environment
    Ikram, MZ
    Morgan, DR
    2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING, 2001, : 2757 - 2760
  • [8] UNSUPERVISED BEAMFORMING BASED ON MULTICHANNEL NONNEGATIVE MATRIX FACTORIZATION FOR NOISY SPEECH RECOGNITION
    Shimada, Kazuki
    Bando, Yoshiaki
    Mimura, Masato
    Itoyama, Katsutoshi
    Yoshii, Kazuyoshi
    Kawahara, Tatsuya
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 5734 - 5738
  • [9] Combination of MVDR beamforming and single-channel spectral processing for enhancing noisy and reverberant speech
    Cauchi, Benjamin
    Kodrasi, Ina
    Rehr, Robert
    Gerlach, Stephan
    Jukic, Ante
    Gerkmann, Timo
    Doclo, Simon
    Goetze, Stefan
    EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2015,
  • [10] Combination of MVDR beamforming and single-channel spectral processing for enhancing noisy and reverberant speech
    Benjamin Cauchi
    Ina Kodrasi
    Robert Rehr
    Stephan Gerlach
    Ante Jukić
    Timo Gerkmann
    Simon Doclo
    Stefan Goetze
    EURASIP Journal on Advances in Signal Processing, 2015