Signal-Dependent Penalty Functions for Robust Acoustic Multi-Channel Equalization

被引:7
作者
Kodrasi, Ina [1 ,2 ]
Doclo, Simon [1 ,2 ]
机构
[1] Carl von Ossietzky Univ Oldenburg, Dept Med Phys & Acoust, Signal Proc Grp, D-26129 Oldenburg, Germany
[2] Carl von Ossietzky Univ Oldenburg, Cluster Excellence Hearing4All, D-26129 Oldenburg, Germany
关键词
Acoustic multi-channel equalization; ADMM; sparsity; signal-dependent penalty function; SPEECH DEREVERBERATION; NOISE; REVERBERATION; PREDICTION; HEARING;
D O I
10.1109/TASLP.2017.2699326
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Acoustic multi-channel equalization techniques, which aim to achieve dereverberation by reshaping the room impulse responses (RIRs) between the source and the microphone array, are known to be highly sensitive to RIR perturbations. In order to increase the robustness againstRIR perturbations, several signal-independent methods have been proposed, which only rely on the available perturbed RIRs and do not incorporate any knowledge about the output signal. This paper presents a novel signal-dependent method to increase the robustness of equalization techniques by enforcing the output signal to exhibit spectrotemporal characteristics of a clean speech signal. Motivated by the sparse nature of clean speech, we propose to extend the cost function of state-of-the-art least squares equalization techniques, i.e., the multiple-input/output inverse theorem (MINT), relaxed multi-channel least squares (RMCLS), and partial multi-channel equalization based on MINT (PMINT), with a signal-dependent penalty function promoting sparsity of the output signal in the short-time Fourier transform domain. Three conventionally used sparsity-promoting penalty functions are investigated, i.e., the l0-norm, the l1-norm, and the weighted l1-norm, and the sparsitypromoting reshaping filters are iteratively computed using the alternating direction method of multipliers. Simulation results for several acoustic systems and RIR perturbations demonstrate that incorporating sparsity-promoting penalty functions significantly increases the robustness of MINT, RMCLS, and PMINT, with the weighted l1-norm typically outperforming the l0-norm and the l1-norm. Furthermore, it is shown that the weighted l1-norm sparsity-promoting PMINT technique outperforms the other sparsity-promoting techniques in terms of perceptual speech quality. Finally, it is shown that the signal-dependent weighted l1-norm sparsity-promoting PMINT technique yields a similar or better dereverberation performance than the signal-independent regularized PMINT technique, confirming the advantage of using signal-dependent penalty functions for robust dereverberation filter design.
引用
收藏
页码:1512 / 1525
页数:14
相关论文
共 57 条
  • [21] Hu M, 2015, EUR SIGNAL PR CONF, P2476, DOI 10.1109/EUSIPCO.2015.7362830
  • [22] Jukic A., 2016, P AES INT C DER REV
  • [23] Multi-Channel Linear Prediction-Based Speech Dereverberation With Sparse Priors
    Jukic, Ante
    van Waterschoot, Toon
    Gerkmann, Timo
    Doclo, Simon
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2015, 23 (09) : 1509 - 1520
  • [24] Combined Acoustic MIMO Channel Crosstalk Cancellation and Room Impulse Response Reshaping
    Jungmann, Jan Ole
    Mazur, Radoslaw
    Kallinger, Markus
    Mei, Tiemin
    Mertins, Alfred
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2012, 20 (06): : 1829 - 1842
  • [25] KALLINGER M, 2006, INT CONF ACOUST SPEE, P101
  • [26] Frequency domain selective tap adaptive algorithms for sparse system identification
    Khong, Andy W. H.
    Lin, Xiang Shawn
    Doroslocki, Milos
    Naylor, Patrick A.
    [J]. 2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 229 - +
  • [27] Kodrasi I, 2016, INT CONF ACOUST SPEE, P166, DOI 10.1109/ICASSP.2016.7471658
  • [28] Regularization for Partial Multichannel Equalization for Speech Dereverberation
    Kodrasi, Ina
    Goetze, Stefan
    Doclo, Simon
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2013, 21 (09): : 1879 - 1890
  • [29] Kodrasi I, 2012, EUR SIGNAL PR CONF, P2442
  • [30] Blind System Identification Using Sparse Learning for TDOA Estimation of Room Reflections
    Kowalczyk, Konrad
    Habets, Emanuel A. P.
    Kellermann, Walter
    Naylor, Patrick A.
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2013, 20 (07) : 653 - 656