AUDIO SOURCE SEPARATION BASED ON CONVOLUTIVE TRANSFER FUNCTION AND FREQUENCY-DOMAIN LASSO OPTIMIZATION

被引：0

作者：

Li, Xiaofei ^{[1
]}

Girin, Laurent ^{[1
,2
,3
]}

Horaud, Radu ^{[1
]}

机构：

[1] INRIA Grenoble Rhone Alpes, Montbonnot St Martin, France

[2] GIPSA Lab, St Martin Dheres, France

[3] Univ Grenoble Alpes, Grenoble, France

来源：

2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2017年

基金：

欧盟第七框架计划;

关键词：

Source separation; convolutive transfer function; l(1)-norm regularization; RELATIVE TRANSFER-FUNCTION; MIXTURES; IDENTIFICATION; APPROXIMATION; SHRINKAGE;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

This paper addresses the problem of under-determined convolutive audio source separation in a semi-oracle configuration where the mixing filters are assumed to be known. We propose a separation procedure based on the convolutive transfer function (CTF), which is a more appropriate model for strongly reverberant signals than the widely-used multiplicative transfer function approximation. In the short-time Fourier transform domain, source signals are estimated by minimizing the mixture fitting cost using Lasso optimization, with a l(1)-norm regularization to exploit the spectral sparsity of source signals. Experiments show that the proposed method achieves satisfactory performance on highly reverberant speech mixtures, with a much lower computational cost compared to time-domain dual techniques.

引用

页码：541 / 545

页数：5

共 50 条

[1] AN EM ALGORITHM FOR AUDIO SOURCE SEPARATION BASED ON THE CONVOLUTIVE TRANSFER FUNCTION
Li, Xiaofei
Girin, Laurent
Horaud, Radu
2017 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2017, : 56 - 60
[2] A frequency domain method for blind source separation of convolutive audio mixtures
Rahbar, K
Reilly, JP
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2005, 13 (05): : 832 - 844
[3] Blind source separation based on time-domain optimization of a frequency-domain independence criterion
Mei, Tiemin
Xi, Jiangtao
Yin, Fuliang
Mertins, Alfred
Chicharo, Joe F.
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2006, 14 (06): : 2075 - 2085
[4] Expectation-maximisation for speech source separation using convolutive transfer function
Li, Xiaofei
Girin, Laurent
Horaud, Radu
CAAI TRANSACTIONS ON INTELLIGENCE TECHNOLOGY, 2019, 4 (01) : 47 - 53
[5] Convolutive Transfer Function-Based Multichannel Nonnegative Matrix Factorization for Overdetermined Blind Source Separation
Wang, Taihui
Yang, Feiran
Yang, Jun
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 30 : 802 - 815
[6] Multichannel Speech Separation and Enhancement Using the Convolutive Transfer Function
Li, Xiaofei
Girin, Laurent
Gannot, Sharon
Horaud, Radu
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2019, 27 (03) : 645 - 659
[7] A PARTITIONED FREQUENCY DOMAIN ALGORITHM FOR CONVOLUTIVE BLIND SOURCE SEPARATION
Scarpiniti, Michele
Picaro, Andrea
Parisi, Raffaele
Uncini, Aurelio
2009 IEEE INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING, 2009, : 411 - 416
[8] Joint dereverberation and blind source separation using a hybrid autoregressive and convolutive transfer function-based model
Liu, Shengdong
Yang, Feiran
Chen, Rilin
Yang, Jun
APPLIED ACOUSTICS, 2024, 224
[9] Convolutive transfer function-based independent component analysis for overdetermined blind source separation
Wang, Taihui
Yang, Feiran
Li, Nan
Zhang, Chen
Yang, Jun
2022 16TH IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP2022), VOL 1, 2022, : 22 - 26
[10] Frequency-domain implementation of a time-domain blind separation algorithm for convolutive mixtures of sources
Ohata, Masashi
Matsuoka, Kiyotoshi
INDEPENDENT COMPONENT ANALYSIS AND SIGNAL SEPARATION, PROCEEDINGS, 2007, 4666 : 528 - +

← 1 2 3 4 5 →