AUDIO SOURCE SEPARATION BASED ON CONVOLUTIVE TRANSFER FUNCTION AND FREQUENCY-DOMAIN LASSO OPTIMIZATION

被引:0
作者
Li, Xiaofei [1 ]
Girin, Laurent [1 ,2 ,3 ]
Horaud, Radu [1 ]
机构
[1] INRIA Grenoble Rhone Alpes, Montbonnot St Martin, France
[2] GIPSA Lab, St Martin Dheres, France
[3] Univ Grenoble Alpes, Grenoble, France
来源
2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2017年
基金
欧盟第七框架计划;
关键词
Source separation; convolutive transfer function; l(1)-norm regularization; RELATIVE TRANSFER-FUNCTION; MIXTURES; IDENTIFICATION; APPROXIMATION; SHRINKAGE;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper addresses the problem of under-determined convolutive audio source separation in a semi-oracle configuration where the mixing filters are assumed to be known. We propose a separation procedure based on the convolutive transfer function (CTF), which is a more appropriate model for strongly reverberant signals than the widely-used multiplicative transfer function approximation. In the short-time Fourier transform domain, source signals are estimated by minimizing the mixture fitting cost using Lasso optimization, with a l(1)-norm regularization to exploit the spectral sparsity of source signals. Experiments show that the proposed method achieves satisfactory performance on highly reverberant speech mixtures, with a much lower computational cost compared to time-domain dual techniques.
引用
收藏
页码:541 / 545
页数:5
相关论文
共 50 条
  • [21] Reverberant Audio Blind Source Separation via Local Convolutive Independent Vector Analysis
    Feng, Fangchen
    Begdadi, Azeddine
    2020 IEEE 22ND INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2020,
  • [22] Convolutive Audio Source Separation Using Robust ICA and Reduced Likelihood Ratio Jump
    Mallis, Dimitrios
    Sgouros, Thomas
    Mitianoudis, Nikolaos
    ARTIFICIAL INTELLIGENCE APPLICATIONS AND INNOVATIONS, AIAI 2016, 2016, 475 : 230 - 241
  • [23] Source Separation Based on Transfer Function between Microphones and its Dispersion
    Kohmura, Sayuri
    Togawa, Taro
    Otani, Takeshi
    2017 IEEE 7TH ANNUAL COMPUTING AND COMMUNICATION WORKSHOP AND CONFERENCE IEEE CCWC-2017, 2017,
  • [24] Multichannel Identification and Nonnegative Equalization for Dereverberation and Noise Reduction Based on Convolutive Transfer Function
    Li, Xiaofei
    Gannot, Sharon
    Girin, Laurent
    Horaud, Radu
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2018, 26 (10) : 1755 - 1768
  • [25] Audio source separation with multiple microphones on time-frequency representations
    Sawada, Hiroshi
    INDEPENDENT COMPONENT ANALYSES, COMPRESSIVE SAMPLING, WAVELETS, NEURAL NET, BIOSYSTEMS, AND NANOENGINEERING XI, 2013, 8750
  • [26] Underdetermined Convolutive Blind Source Separation via Time-Frequency Masking
    Reju, Vaninirappuputhenpurayil Gopalan
    Koh, Soo Ngee
    Soon, Ing Yann
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (01): : 101 - 116
  • [27] Design of Frequency-domain Controlled Source for Electromagnetic Prospecting Based on Multi-frequency Resonance
    Zhu, Wang
    Liu, Yan
    Zhu, Xuegui
    PROCEEDINGS OF 2017 IEEE 2ND INFORMATION TECHNOLOGY, NETWORKING, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (ITNEC), 2017, : 1819 - 1822
  • [28] Underdetermined blind separation of audio sources from the time-frequency representation of their convolutive mixtures
    Aissa-El-Bey, Abdeldjalil
    Abed-Meraim, Karim
    Grenier, Yves
    2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PTS 1-3, PROCEEDINGS, 2007, : 153 - 156
  • [29] Time-Domain Audio Source Separation With Neural Networks Based on Multiresolution Analysis
    Nakamura, Tomohiko
    Kozuka, Shihori
    Saruwatari, Hiroshi
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 29 : 1687 - 1701
  • [30] Frequency-domain optimization of fixed-structure controllers
    van Solingen, E.
    van Wingerden, J. W.
    Oomen, T.
    INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2018, 28 (12) : 3784 - 3805