A General Framework for Incorporating Time-Frequency Domain Sparsity in Multichannel Speech Dereverberation

被引：7

作者：

Jukic, Ante ^{[1
,2
]}

van Waterschoot, Toon ^{[3
]}

Gerkmann, Timo ^{[4
]}

Doclo, Simon ^{[1
,2
]}

机构：

[1] Carl von Ossietzky Univ Oldenburg, Dept Med Phys & Acoust, Oldenburg, Germany

[2] Cluster Excellence Hearing4All, Oldenburg, Germany

[3] Katholieke Univ Leuven, Dept Elect Engn ESAT STADIUS ETC, Leuven, Belgium

[4] Univ Hamburg, Dept Informat, Hamburg, Germany

来源：

JOURNAL OF THE AUDIO ENGINEERING SOCIETY | 2017年 / 65卷 / 1-2期

关键词：

LINEAR PREDICTION; SOURCE SEPARATION; SHRINKAGE; REVERBERATION; SYSTEMS;

D O I：

10.17743/jaes.2016.0064

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Blind multichannel speech dereverberation methods based on multichannel linear prediction (MCLP) estimate the dereverberated speech component without any knowledge of the room acoustics by estimating and subtracting the undesired reverberant component from the reference microphone signal. In this paper we present a general framework for incorporating sparsity in the time-frequency domain into MCLP-based speech dereverberation. The presented framework enables to use either a wideband or a narrowband signal model with either an analysis or a synthesis sparsity prior for the desired speech component and generalizes stateof-the-art MCLP-based speech dereverberation methods, which is shown both analytically as well as using simulations.

引用

页码：17 / 30

页数：14

共 48 条

[1] Audio Inpainting [J].

Adler, Amir ;

Emiya, Valentin ;

Jafari, Maria G. ;

Elad, Michael ;

Gribonval, Remi ;

Plumbley, Mark D. .

IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2012, 20 (03) :922-932

[2]

[Anonymous], 2010, P INT WORKSH AC ECH

[3]

[Anonymous], 2014, INT C ACOUSTICS SPEE

[4]

[Anonymous], 1983, AUGMENTED LAGRANGIAN, DOI DOI 10.1016/S0168-2024(08)70028-6

[5] Sparse Reverberant Audio Source Separation via Reweighted Analysis [J].

Arberet, Simon ;

Vandergheynst, Pierre ;

Carrillo, Rafael E. ;

Thiran, Jean-Philippe ;

Wiaux, Yves .

IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2013, 21 (07) :1391-1402

[6] Adapted and Adaptive Linear Time-Frequency Representations [J].

Balazs, Peter ;

Doerfler, Monika ;

Kowalski, Matthieu ;

Torresani, Bruno .

IEEE SIGNAL PROCESSING MAGAZINE, 2013, 30 (06) :20-31

[7] A Fast Iterative Shrinkage-Thresholding Algorithm for Linear Inverse Problems [J].

Beck, Amir ;

Teboulle, Marc .

SIAM JOURNAL ON IMAGING SCIENCES, 2009, 2 (01) :183-202

[8] Prediction of speech intelligibility in spatial noise and reverberation for normal-hearing and hearing-impaired listeners [J].

Beutelmann, Rainer ;

Brand, Thomas .

JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2006, 120 (01) :331-342

[9]

Bofill P., 2000, Second International Workshop on Independent Component Analysis and Blind Signal Separation. Proceedings, P87

[10] Underdetermined blind source separation using sparse representations [J].

Bofill, P ;

Zibulevsky, M .

SIGNAL PROCESSING, 2001, 81 (11) :2353-2362

← 1 2 3 4 5 →