Signal-Dependent Penalty Functions for Robust Acoustic Multi-Channel Equalization

被引：7

作者：

Kodrasi, Ina ^{[1
,2
]}

Doclo, Simon ^{[1
,2
]}

机构：

[1] Carl von Ossietzky Univ Oldenburg, Dept Med Phys & Acoust, Signal Proc Grp, D-26129 Oldenburg, Germany

[2] Carl von Ossietzky Univ Oldenburg, Cluster Excellence Hearing4All, D-26129 Oldenburg, Germany

来源：

IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING | 2017年 / 25卷 / 07期

关键词：

Acoustic multi-channel equalization; ADMM; sparsity; signal-dependent penalty function; SPEECH DEREVERBERATION; NOISE; REVERBERATION; PREDICTION; HEARING;

D O I：

10.1109/TASLP.2017.2699326

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Acoustic multi-channel equalization techniques, which aim to achieve dereverberation by reshaping the room impulse responses (RIRs) between the source and the microphone array, are known to be highly sensitive to RIR perturbations. In order to increase the robustness againstRIR perturbations, several signal-independent methods have been proposed, which only rely on the available perturbed RIRs and do not incorporate any knowledge about the output signal. This paper presents a novel signal-dependent method to increase the robustness of equalization techniques by enforcing the output signal to exhibit spectrotemporal characteristics of a clean speech signal. Motivated by the sparse nature of clean speech, we propose to extend the cost function of state-of-the-art least squares equalization techniques, i.e., the multiple-input/output inverse theorem (MINT), relaxed multi-channel least squares (RMCLS), and partial multi-channel equalization based on MINT (PMINT), with a signal-dependent penalty function promoting sparsity of the output signal in the short-time Fourier transform domain. Three conventionally used sparsity-promoting penalty functions are investigated, i.e., the l0-norm, the l1-norm, and the weighted l1-norm, and the sparsitypromoting reshaping filters are iteratively computed using the alternating direction method of multipliers. Simulation results for several acoustic systems and RIR perturbations demonstrate that incorporating sparsity-promoting penalty functions significantly increases the robustness of MINT, RMCLS, and PMINT, with the weighted l1-norm typically outperforming the l0-norm and the l1-norm. Furthermore, it is shown that the weighted l1-norm sparsity-promoting PMINT technique outperforms the other sparsity-promoting techniques in terms of perceptual speech quality. Finally, it is shown that the signal-dependent weighted l1-norm sparsity-promoting PMINT technique yields a similar or better dereverberation performance than the signal-independent regularized PMINT technique, confirming the advantage of using signal-dependent penalty functions for robust dereverberation filter design.

引用

页码：1512 / 1525

页数：14

共 57 条

[21] Hu M, 2015, EUR SIGNAL PR CONF, P2476, DOI 10.1109/EUSIPCO.2015.7362830
[22] Jukic A., 2016, P AES INT C DER REV
[23] Multi-Channel Linear Prediction-Based Speech Dereverberation With Sparse Priors
Jukic, Ante
van Waterschoot, Toon
Gerkmann, Timo
Doclo, Simon
[J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2015, 23 (09) : 1509 - 1520
[24] Combined Acoustic MIMO Channel Crosstalk Cancellation and Room Impulse Response Reshaping
Jungmann, Jan Ole
Mazur, Radoslaw
Kallinger, Markus
Mei, Tiemin
Mertins, Alfred
[J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2012, 20 (06): : 1829 - 1842
[25] KALLINGER M, 2006, INT CONF ACOUST SPEE, P101
[26] Frequency domain selective tap adaptive algorithms for sparse system identification
Khong, Andy W. H.
Lin, Xiang Shawn
Doroslocki, Milos
Naylor, Patrick A.
[J]. 2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 229 - +
[27] Kodrasi I, 2016, INT CONF ACOUST SPEE, P166, DOI 10.1109/ICASSP.2016.7471658
[28] Regularization for Partial Multichannel Equalization for Speech Dereverberation
Kodrasi, Ina
Goetze, Stefan
Doclo, Simon
[J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2013, 21 (09): : 1879 - 1890
[29] Kodrasi I, 2012, EUR SIGNAL PR CONF, P2442
[30] Blind System Identification Using Sparse Learning for TDOA Estimation of Room Reflections
Kowalczyk, Konrad
Habets, Emanuel A. P.
Kellermann, Walter
Naylor, Patrick A.
[J]. IEEE SIGNAL PROCESSING LETTERS, 2013, 20 (07) : 653 - 656

← 1 2 3 4 5 6 →