AUDIO SOURCE SEPARATION BASED ON CONVOLUTIVE TRANSFER FUNCTION AND FREQUENCY-DOMAIN LASSO OPTIMIZATION

被引:0
作者
Li, Xiaofei [1 ]
Girin, Laurent [1 ,2 ,3 ]
Horaud, Radu [1 ]
机构
[1] INRIA Grenoble Rhone Alpes, Montbonnot St Martin, France
[2] GIPSA Lab, St Martin Dheres, France
[3] Univ Grenoble Alpes, Grenoble, France
来源
2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2017年
基金
欧盟第七框架计划;
关键词
Source separation; convolutive transfer function; l(1)-norm regularization; RELATIVE TRANSFER-FUNCTION; MIXTURES; IDENTIFICATION; APPROXIMATION; SHRINKAGE;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper addresses the problem of under-determined convolutive audio source separation in a semi-oracle configuration where the mixing filters are assumed to be known. We propose a separation procedure based on the convolutive transfer function (CTF), which is a more appropriate model for strongly reverberant signals than the widely-used multiplicative transfer function approximation. In the short-time Fourier transform domain, source signals are estimated by minimizing the mixture fitting cost using Lasso optimization, with a l(1)-norm regularization to exploit the spectral sparsity of source signals. Experiments show that the proposed method achieves satisfactory performance on highly reverberant speech mixtures, with a much lower computational cost compared to time-domain dual techniques.
引用
收藏
页码:541 / 545
页数:5
相关论文
共 50 条
  • [1] AN EM ALGORITHM FOR AUDIO SOURCE SEPARATION BASED ON THE CONVOLUTIVE TRANSFER FUNCTION
    Li, Xiaofei
    Girin, Laurent
    Horaud, Radu
    2017 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2017, : 56 - 60
  • [2] A frequency domain method for blind source separation of convolutive audio mixtures
    Rahbar, K
    Reilly, JP
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2005, 13 (05): : 832 - 844
  • [3] Blind source separation based on time-domain optimization of a frequency-domain independence criterion
    Mei, Tiemin
    Xi, Jiangtao
    Yin, Fuliang
    Mertins, Alfred
    Chicharo, Joe F.
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2006, 14 (06): : 2075 - 2085
  • [4] Expectation-maximisation for speech source separation using convolutive transfer function
    Li, Xiaofei
    Girin, Laurent
    Horaud, Radu
    CAAI TRANSACTIONS ON INTELLIGENCE TECHNOLOGY, 2019, 4 (01) : 47 - 53
  • [5] Convolutive Transfer Function-Based Multichannel Nonnegative Matrix Factorization for Overdetermined Blind Source Separation
    Wang, Taihui
    Yang, Feiran
    Yang, Jun
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 30 : 802 - 815
  • [6] Multichannel Speech Separation and Enhancement Using the Convolutive Transfer Function
    Li, Xiaofei
    Girin, Laurent
    Gannot, Sharon
    Horaud, Radu
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2019, 27 (03) : 645 - 659
  • [7] A PARTITIONED FREQUENCY DOMAIN ALGORITHM FOR CONVOLUTIVE BLIND SOURCE SEPARATION
    Scarpiniti, Michele
    Picaro, Andrea
    Parisi, Raffaele
    Uncini, Aurelio
    2009 IEEE INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING, 2009, : 411 - 416
  • [8] Joint dereverberation and blind source separation using a hybrid autoregressive and convolutive transfer function-based model
    Liu, Shengdong
    Yang, Feiran
    Chen, Rilin
    Yang, Jun
    APPLIED ACOUSTICS, 2024, 224
  • [9] Convolutive transfer function-based independent component analysis for overdetermined blind source separation
    Wang, Taihui
    Yang, Feiran
    Li, Nan
    Zhang, Chen
    Yang, Jun
    2022 16TH IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP2022), VOL 1, 2022, : 22 - 26
  • [10] Frequency-domain implementation of a time-domain blind separation algorithm for convolutive mixtures of sources
    Ohata, Masashi
    Matsuoka, Kiyotoshi
    INDEPENDENT COMPONENT ANALYSIS AND SIGNAL SEPARATION, PROCEEDINGS, 2007, 4666 : 528 - +