AUDIO SOURCE SEPARATION BASED ON CONVOLUTIVE TRANSFER FUNCTION AND FREQUENCY-DOMAIN LASSO OPTIMIZATION

被引：0

作者：

Li, Xiaofei ^{[1
]}

Girin, Laurent ^{[1
,2
,3
]}

Horaud, Radu ^{[1
]}

机构：

[1] INRIA Grenoble Rhone Alpes, Montbonnot St Martin, France

[2] GIPSA Lab, St Martin Dheres, France

[3] Univ Grenoble Alpes, Grenoble, France

来源：

2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2017年

基金：

欧盟第七框架计划;

关键词：

Source separation; convolutive transfer function; l(1)-norm regularization; RELATIVE TRANSFER-FUNCTION; MIXTURES; IDENTIFICATION; APPROXIMATION; SHRINKAGE;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

This paper addresses the problem of under-determined convolutive audio source separation in a semi-oracle configuration where the mixing filters are assumed to be known. We propose a separation procedure based on the convolutive transfer function (CTF), which is a more appropriate model for strongly reverberant signals than the widely-used multiplicative transfer function approximation. In the short-time Fourier transform domain, source signals are estimated by minimizing the mixture fitting cost using Lasso optimization, with a l(1)-norm regularization to exploit the spectral sparsity of source signals. Experiments show that the proposed method achieves satisfactory performance on highly reverberant speech mixtures, with a much lower computational cost compared to time-domain dual techniques.

引用

页码：541 / 545

页数：5

共 50 条

[21] Combining Superdirective Beamforming and Frequency-Domain Blind Source Separation for Highly Reverberant Signals [J].

Wang, Lin ;

Ding, Heping ;

Yin, Fuliang .

EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2010,

[22] Reverberant Audio Blind Source Separation via Local Convolutive Independent Vector Analysis [J].

Feng, Fangchen ;

Begdadi, Azeddine .

2020 IEEE 22ND INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2020,

[23] Convolutive Audio Source Separation Using Robust ICA and Reduced Likelihood Ratio Jump [J].

Mallis, Dimitrios ;

Sgouros, Thomas ;

Mitianoudis, Nikolaos .

ARTIFICIAL INTELLIGENCE APPLICATIONS AND INNOVATIONS, AIAI 2016, 2016, 475 :230-241

[24] Multichannel Identification and Nonnegative Equalization for Dereverberation and Noise Reduction Based on Convolutive Transfer Function [J].

Li, Xiaofei ;

Gannot, Sharon ;

Girin, Laurent ;

Horaud, Radu .

IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2018, 26 (10) :1755-1768

[25] Source Separation Based on Transfer Function between Microphones and its Dispersion [J].

Kohmura, Sayuri ;

Togawa, Taro ;

Otani, Takeshi .

2017 IEEE 7TH ANNUAL COMPUTING AND COMMUNICATION WORKSHOP AND CONFERENCE IEEE CCWC-2017, 2017,

[26] Audio source separation with multiple microphones on time-frequency representations [J].

Sawada, Hiroshi .

INDEPENDENT COMPONENT ANALYSES, COMPRESSIVE SAMPLING, WAVELETS, NEURAL NET, BIOSYSTEMS, AND NANOENGINEERING XI, 2013, 8750

[27] Underdetermined Convolutive Blind Source Separation via Time-Frequency Masking [J].

Reju, Vaninirappuputhenpurayil Gopalan ;

Koh, Soo Ngee ;

Soon, Ing Yann .

IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (01) :101-116

[28] Design of Frequency-domain Controlled Source for Electromagnetic Prospecting Based on Multi-frequency Resonance [J].

Zhu, Wang ;

Liu, Yan ;

Zhu, Xuegui .

PROCEEDINGS OF 2017 IEEE 2ND INFORMATION TECHNOLOGY, NETWORKING, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (ITNEC), 2017, :1819-1822

[29] Underdetermined blind separation of audio sources from the time-frequency representation of their convolutive mixtures [J].

Aissa-El-Bey, Abdeldjalil ;

Abed-Meraim, Karim ;

Grenier, Yves .

2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PTS 1-3, PROCEEDINGS, 2007, :153-156

[30] Time-Domain Audio Source Separation With Neural Networks Based on Multiresolution Analysis [J].

Nakamura, Tomohiko ;

Kozuka, Shihori ;

Saruwatari, Hiroshi .

IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 29 :1687-1701

← 1 2 3 4 5 →