AUDIO SOURCE SEPARATION BASED ON CONVOLUTIVE TRANSFER FUNCTION AND FREQUENCY-DOMAIN LASSO OPTIMIZATION

被引:0
作者
Li, Xiaofei [1 ]
Girin, Laurent [1 ,2 ,3 ]
Horaud, Radu [1 ]
机构
[1] INRIA Grenoble Rhone Alpes, Montbonnot St Martin, France
[2] GIPSA Lab, St Martin Dheres, France
[3] Univ Grenoble Alpes, Grenoble, France
来源
2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2017年
基金
欧盟第七框架计划;
关键词
Source separation; convolutive transfer function; l(1)-norm regularization; RELATIVE TRANSFER-FUNCTION; MIXTURES; IDENTIFICATION; APPROXIMATION; SHRINKAGE;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper addresses the problem of under-determined convolutive audio source separation in a semi-oracle configuration where the mixing filters are assumed to be known. We propose a separation procedure based on the convolutive transfer function (CTF), which is a more appropriate model for strongly reverberant signals than the widely-used multiplicative transfer function approximation. In the short-time Fourier transform domain, source signals are estimated by minimizing the mixture fitting cost using Lasso optimization, with a l(1)-norm regularization to exploit the spectral sparsity of source signals. Experiments show that the proposed method achieves satisfactory performance on highly reverberant speech mixtures, with a much lower computational cost compared to time-domain dual techniques.
引用
收藏
页码:541 / 545
页数:5
相关论文
共 50 条
[41]   Genetic algorithm optimized frequency-domain convolutional blind source separation for multiple leakage locations in water supply pipeline [J].
Liu, Hongjin ;
Fang, Hongyuan ;
Yu, Xiang ;
Xia, Yangyang .
COMPUTER-AIDED CIVIL AND INFRASTRUCTURE ENGINEERING, 2025, 40 (09) :1235-1252
[42]   A Survey of Optimization Methods for Independent Vector Analysis in Audio Source Separation [J].
Guo, Ruiming ;
Luo, Zhongqiang ;
Li, Mingchun .
SENSORS, 2023, 23 (01)
[43]   A Unifying View on Blind Source Separation of Convolutive Mixtures Based on Independent Component Analysis [J].
Brendel, Andreas ;
Haubner, Thomas ;
Kellermann, Walter .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2023, 71 :816-830
[44]   ON THE USE OF CONTEXTUAL TIME-FREQUENCY INFORMATION FOR FULL-BAND CLUSTERING-BASED CONVOLUTIVE BLIND SOURCE SEPARATION [J].
Atcheson, Matt ;
Jafari, Ingrid ;
Togneri, Roberto ;
Nordholm, Sven .
2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
[45]   Beamspace-Domain Multichannel Nonnegative Matrix Factorization for Audio Source Separation [J].
Lee, Seokjin ;
Park, Sang Ha ;
Sung, Koeng-Mo .
IEEE SIGNAL PROCESSING LETTERS, 2012, 19 (01) :43-46
[46]   Convolutive Blind Source Separation for Communication Signals Based on the Sliding Z-Transform [J].
Jia, Yinjie ;
Xu, Pengfei .
IEEE ACCESS, 2020, 8 :41213-41219
[47]   Underdetermined Convolutive Blind Source Separation via Frequency Bin-Wise Clustering and Permutation Alignment [J].
Sawada, Hiroshi ;
Araki, Shoko ;
Makino, Shoji .
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (03) :516-527
[48]   Bayesian Multichannel Audio Source Separation Based on Integrated Source and Spatial Models [J].
Itakura, Kousuke ;
Bando, Yoshiaki ;
Nakamura, Eita ;
Itoyama, Katsutoshi ;
Yoshii, Kazuyoshi ;
Kawahara, Tatsuya .
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2018, 26 (04) :831-846
[49]   Underdetermined Blind Source Separation by Parallel Factor Analysis in Time-Frequency Domain [J].
Yang, Liu ;
Lv, Jun ;
Xiang, Yong .
COGNITIVE COMPUTATION, 2013, 5 (02) :207-214
[50]   Data-driven frequency-domain iterative learning control with transfer learning [J].
Lee, Yu-Hsiu ;
Chin, Yu-Hsiang ;
Hsueh, Chun-Yuan .
MECHATRONICS, 2025, 108