Multichannel Eigenspace Beamforming in a Reverberant Noisy Environment With Multiple Interfering Speech Signals

被引:212
|
作者
Markovich, Shmulik [1 ]
Gannot, Sharon [1 ]
Cohen, Israel [2 ]
机构
[1] Bar Ilan Univ, Sch Chem, IL-52900 Ramat Gan, Israel
[2] Technion Israel Inst Technol, Dept Elect Engn, IL-32000 Haifa, Israel
关键词
Array signal processing; interference cancellation; speech enhancement; subspace methods; SUBSPACE APPROACH; SEPARATION; ALGORITHM; DOMAIN;
D O I
10.1109/TASL.2009.2016395
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In many practical environments we wish to extract several desired speech signals, which are contaminated by nonstationary and stationary interfering signals. The desired signals may also be subject to distortion imposed by the acoustic room impulse responses (RIRs). In this paper, a linearly constrained minimum variance (LCMV) beamformer is designed for extracting the desired signals from multimicrophone measurements. The beamformer satisfies two sets of linear constraints. One set is dedicated to maintaining the desired signals, while the other set is chosen to mitigate both the stationary and nonstationary interferences. Unlike classical beamformers, which approximate the RIRs as delay-only filters, we take into account the entire RIR [or its respective acoustic transfer function (ATF)]. The LCMV beamformer is then reformulated in a generalized side-lobe canceler (GSC) structure, consisting of a fixed beamformer (FBF), blocking matrix (BM), and adaptive noise canceler (ANC). It is shown that for spatially white noise field, the beamformer reduces to a FBF, satisfying the constraint sets, without power minimization. It is shown that the application of the adaptive ANC contributes to interference reduction, but only when the constraint sets are not completely satisfied. We show that relative transfer functions (RTFs), which relate the desired speech sources and the microphones, and a basis for the interference subspace suffice for constructing the beamformer. The RTFs are estimated by applying the generalized eigenvalue decomposition (GEVD) procedure to the power spectral density (PSD) matrices of the received signals and the stationary noise. A basis for the interference subspace is estimated by collecting eigenvectors, calculated in segments where nonstationary interfering sources are active and the desired sources are inactive. The rank of the basis is then reduced by the application of the orthogonal triangular decomposition (QRD). This procedure relaxes the common requirement for nonoverlapping activity periods of the interference sources. A comprehensive experimental study in both simulated and real environments demonstrates the performance of the proposed beamformer.
引用
收藏
页码:1071 / 1086
页数:16
相关论文
共 33 条
  • [21] A robust pitch detection algorithm for speech signals in a practical noisy environment
    Shahnaz, C.
    Zhu, W. -P.
    Ahmad, M. O.
    2007 50TH MIDWEST SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS 1-3, 2007, : 330 - 333
  • [22] Enhancement of speech signals in a noisy environment based on wavelet based adaptive filtering
    Dept. of Electronics and Communication Engineering, Govt. College of Engineering, Tirunelveli, India
    Int. J. Signal Process. Image Process. Pattern Recogn., 9 (69-76): : 69 - 76
  • [23] Comparison of speech intelligibility of elderly aged 60-69 years and young adults in the noisy and reverberant environment
    Zeng, Jiazhong
    Peng, Jianxin
    Zhao, Yuezhe
    APPLIED ACOUSTICS, 2020, 159
  • [24] Speech recognition in reverberant and noisy environments employing multiple feature extractors and i-vector speaker adaptation
    Md Jahangir Alam
    Vishwa Gupta
    Patrick Kenny
    Pierre Dumouchel
    EURASIP Journal on Advances in Signal Processing, 2015
  • [25] Speech recognition in reverberant and noisy environments employing multiple feature extractors and i-vector speaker adaptation
    Alam, Md Jahangir
    Gupta, Vishwa
    Kenny, Patrick
    Dumouchel, Pierre
    EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2015, : 1 - 13
  • [26] Method for time-of-flight estimation of low frequency acoustic signals in reverberant and noisy environment with sparse impulse response
    Elfering, Michael
    Annas, Sven
    Jantzen, Hans-Arno
    Janoske, Uwe
    MEASUREMENT SCIENCE AND TECHNOLOGY, 2022, 33 (04)
  • [27] Single-Channel Speech Dereverberation in Noisy Environment for Non-Orthogonal Signals
    Fahim, Abdullah
    Samarasinghe, Prasanga N.
    Abhayapala, Thushara D.
    ACTA ACUSTICA UNITED WITH ACUSTICA, 2018, 104 (06) : 1041 - 1055
  • [28] Recovering speech intelligibility with deep learning and multiple microphones in noisy-reverberant situations for people using cochlear implants
    Cambridge Hearing Group, Medical Research Council Cognition and Brain Sciences Unit, University of Cambridge, Cambridge
    CB2 7EF, United Kingdom
    J. Acoust. Soc. Am., 2024, 6 (3833-3847):
  • [29] MIMO beamforming system for speech enhancement in realistic environment with multiple noise sources
    Mohammed J.R.
    International Journal of Speech Technology, 2018, 21 (3) : 671 - 680
  • [30] Recovering speech intelligibility with deep learning and multiple microphones in noisy-reverberant situations for people using cochlear implants
    Gaultier, Clement
    Goehring, Tobias
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2024, 155 (06): : 3833 - 3847