AUDIO SOURCE SEPARATION BASED ON CONVOLUTIVE TRANSFER FUNCTION AND FREQUENCY-DOMAIN LASSO OPTIMIZATION

被引:0
作者
Li, Xiaofei [1 ]
Girin, Laurent [1 ,2 ,3 ]
Horaud, Radu [1 ]
机构
[1] INRIA Grenoble Rhone Alpes, Montbonnot St Martin, France
[2] GIPSA Lab, St Martin Dheres, France
[3] Univ Grenoble Alpes, Grenoble, France
来源
2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2017年
基金
欧盟第七框架计划;
关键词
Source separation; convolutive transfer function; l(1)-norm regularization; RELATIVE TRANSFER-FUNCTION; MIXTURES; IDENTIFICATION; APPROXIMATION; SHRINKAGE;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper addresses the problem of under-determined convolutive audio source separation in a semi-oracle configuration where the mixing filters are assumed to be known. We propose a separation procedure based on the convolutive transfer function (CTF), which is a more appropriate model for strongly reverberant signals than the widely-used multiplicative transfer function approximation. In the short-time Fourier transform domain, source signals are estimated by minimizing the mixture fitting cost using Lasso optimization, with a l(1)-norm regularization to exploit the spectral sparsity of source signals. Experiments show that the proposed method achieves satisfactory performance on highly reverberant speech mixtures, with a much lower computational cost compared to time-domain dual techniques.
引用
收藏
页码:541 / 545
页数:5
相关论文
共 50 条
[31]   Frequency-domain optimization of fixed-structure controllers [J].
van Solingen, E. ;
van Wingerden, J. W. ;
Oomen, T. .
INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2018, 28 (12) :3784-3805
[32]   Acoustic source separation of convolutive mixtures based on intensity vector statistics [J].
Gunel, Banu ;
Hacihabiboglu, Hueseyin ;
Kondoz, Ahmet M. .
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2008, 16 (04) :748-756
[33]   Eliminating the Permutation Ambiguity of Convolutive Blind Source Separation by Using Coupled Frequency Bins [J].
Xie, Kan ;
Zhou, Guoxu ;
Yang, Junjie ;
He, Zhaoshui ;
Xie, Shengli .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 31 (02) :589-599
[34]   Cycle GAN-Based Audio Source Separation Using Time–Frequency Masking [J].
Sujo Joseph ;
Rajeev Rajan .
Circuits, Systems, and Signal Processing, 2023, 42 :1163-1180
[35]   Convolutive blind source separation based on joint block Toeplitzation and block-inner diagonalization [J].
Xu, Xian-Feng ;
Feng, Da-Zheng ;
Zheng, Wei Xing ;
Zhang, Hua .
SIGNAL PROCESSING, 2010, 90 (01) :119-133
[36]   Adapted Deflation Approach for Referenced Contrast Optimization in Blind MIMO Convolutive Source Separation [J].
Dubroca, Remi ;
De Luigi, Christophe ;
Moreau, Eric .
INDEPENDENT COMPONENT ANALYSIS AND SIGNAL SEPARATION, PROCEEDINGS, 2009, 5441 :243-250
[37]   Cooperative Optimization-Based Frequency-Domain Curve Fitting: Enhancing Accuracy and Efficiency [J].
Sato, Shimpei ;
Maeda, Yoshihiro .
IEEE OPEN JOURNAL OF THE INDUSTRIAL ELECTRONICS SOCIETY, 2025, 6 :309-319
[38]   Cycle GAN-Based Audio Source Separation Using Time-Frequency Masking [J].
Joseph, Sujo ;
Rajan, Rajeev .
CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2023, 42 (02) :1163-1180
[39]   Frequency-Domain Pearson Distribution Approach for Independent Component Analysis (FD-Pearson-ICA) in Blind Source Separation [J].
Solvang, Hiroko Kato ;
Nagahara, Yuichi ;
Araki, Shoko ;
Sawada, Hiroshi ;
Makino, Shoji .
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2009, 17 (04) :639-649
[40]   Homotopy optimisation based NMF for audio source separation [J].
Koundinya, Sriharsha ;
Karmakar, Abhijit .
IET SIGNAL PROCESSING, 2018, 12 (09) :1099-1106