AUDIO SOURCE SEPARATION BASED ON CONVOLUTIVE TRANSFER FUNCTION AND FREQUENCY-DOMAIN LASSO OPTIMIZATION

被引:0
作者
Li, Xiaofei [1 ]
Girin, Laurent [1 ,2 ,3 ]
Horaud, Radu [1 ]
机构
[1] INRIA Grenoble Rhone Alpes, Montbonnot St Martin, France
[2] GIPSA Lab, St Martin Dheres, France
[3] Univ Grenoble Alpes, Grenoble, France
来源
2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2017年
基金
欧盟第七框架计划;
关键词
Source separation; convolutive transfer function; l(1)-norm regularization; RELATIVE TRANSFER-FUNCTION; MIXTURES; IDENTIFICATION; APPROXIMATION; SHRINKAGE;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper addresses the problem of under-determined convolutive audio source separation in a semi-oracle configuration where the mixing filters are assumed to be known. We propose a separation procedure based on the convolutive transfer function (CTF), which is a more appropriate model for strongly reverberant signals than the widely-used multiplicative transfer function approximation. In the short-time Fourier transform domain, source signals are estimated by minimizing the mixture fitting cost using Lasso optimization, with a l(1)-norm regularization to exploit the spectral sparsity of source signals. Experiments show that the proposed method achieves satisfactory performance on highly reverberant speech mixtures, with a much lower computational cost compared to time-domain dual techniques.
引用
收藏
页码:541 / 545
页数:5
相关论文
共 50 条
[21]   Combining Superdirective Beamforming and Frequency-Domain Blind Source Separation for Highly Reverberant Signals [J].
Wang, Lin ;
Ding, Heping ;
Yin, Fuliang .
EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2010,
[22]   Reverberant Audio Blind Source Separation via Local Convolutive Independent Vector Analysis [J].
Feng, Fangchen ;
Begdadi, Azeddine .
2020 IEEE 22ND INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2020,
[23]   Convolutive Audio Source Separation Using Robust ICA and Reduced Likelihood Ratio Jump [J].
Mallis, Dimitrios ;
Sgouros, Thomas ;
Mitianoudis, Nikolaos .
ARTIFICIAL INTELLIGENCE APPLICATIONS AND INNOVATIONS, AIAI 2016, 2016, 475 :230-241
[24]   Multichannel Identification and Nonnegative Equalization for Dereverberation and Noise Reduction Based on Convolutive Transfer Function [J].
Li, Xiaofei ;
Gannot, Sharon ;
Girin, Laurent ;
Horaud, Radu .
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2018, 26 (10) :1755-1768
[25]   Source Separation Based on Transfer Function between Microphones and its Dispersion [J].
Kohmura, Sayuri ;
Togawa, Taro ;
Otani, Takeshi .
2017 IEEE 7TH ANNUAL COMPUTING AND COMMUNICATION WORKSHOP AND CONFERENCE IEEE CCWC-2017, 2017,
[26]   Audio source separation with multiple microphones on time-frequency representations [J].
Sawada, Hiroshi .
INDEPENDENT COMPONENT ANALYSES, COMPRESSIVE SAMPLING, WAVELETS, NEURAL NET, BIOSYSTEMS, AND NANOENGINEERING XI, 2013, 8750
[27]   Underdetermined Convolutive Blind Source Separation via Time-Frequency Masking [J].
Reju, Vaninirappuputhenpurayil Gopalan ;
Koh, Soo Ngee ;
Soon, Ing Yann .
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (01) :101-116
[28]   Design of Frequency-domain Controlled Source for Electromagnetic Prospecting Based on Multi-frequency Resonance [J].
Zhu, Wang ;
Liu, Yan ;
Zhu, Xuegui .
PROCEEDINGS OF 2017 IEEE 2ND INFORMATION TECHNOLOGY, NETWORKING, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (ITNEC), 2017, :1819-1822
[29]   Underdetermined blind separation of audio sources from the time-frequency representation of their convolutive mixtures [J].
Aissa-El-Bey, Abdeldjalil ;
Abed-Meraim, Karim ;
Grenier, Yves .
2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PTS 1-3, PROCEEDINGS, 2007, :153-156
[30]   Time-Domain Audio Source Separation With Neural Networks Based on Multiresolution Analysis [J].
Nakamura, Tomohiko ;
Kozuka, Shihori ;
Saruwatari, Hiroshi .
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 29 :1687-1701