AUDIO SOURCE SEPARATION BASED ON CONVOLUTIVE TRANSFER FUNCTION AND FREQUENCY-DOMAIN LASSO OPTIMIZATION

被引：0

作者：

Li, Xiaofei ^{[1
]}

Girin, Laurent ^{[1
,2
,3
]}

Horaud, Radu ^{[1
]}

机构：

[1] INRIA Grenoble Rhone Alpes, Montbonnot St Martin, France

[2] GIPSA Lab, St Martin Dheres, France

[3] Univ Grenoble Alpes, Grenoble, France

来源：

2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2017年

基金：

欧盟第七框架计划;

关键词：

Source separation; convolutive transfer function; l(1)-norm regularization; RELATIVE TRANSFER-FUNCTION; MIXTURES; IDENTIFICATION; APPROXIMATION; SHRINKAGE;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

This paper addresses the problem of under-determined convolutive audio source separation in a semi-oracle configuration where the mixing filters are assumed to be known. We propose a separation procedure based on the convolutive transfer function (CTF), which is a more appropriate model for strongly reverberant signals than the widely-used multiplicative transfer function approximation. In the short-time Fourier transform domain, source signals are estimated by minimizing the mixture fitting cost using Lasso optimization, with a l(1)-norm regularization to exploit the spectral sparsity of source signals. Experiments show that the proposed method achieves satisfactory performance on highly reverberant speech mixtures, with a much lower computational cost compared to time-domain dual techniques.

引用

页码：541 / 545

页数：5

共 50 条

[41] Genetic algorithm optimized frequency-domain convolutional blind source separation for multiple leakage locations in water supply pipeline [J].

Liu, Hongjin ;

Fang, Hongyuan ;

Yu, Xiang ;

Xia, Yangyang .

COMPUTER-AIDED CIVIL AND INFRASTRUCTURE ENGINEERING, 2025, 40 (09) :1235-1252

[42] A Survey of Optimization Methods for Independent Vector Analysis in Audio Source Separation [J].

Guo, Ruiming ;

Luo, Zhongqiang ;

Li, Mingchun .

SENSORS, 2023, 23 (01)

[43] A Unifying View on Blind Source Separation of Convolutive Mixtures Based on Independent Component Analysis [J].

Brendel, Andreas ;

Haubner, Thomas ;

Kellermann, Walter .

IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2023, 71 :816-830

[44] ON THE USE OF CONTEXTUAL TIME-FREQUENCY INFORMATION FOR FULL-BAND CLUSTERING-BASED CONVOLUTIVE BLIND SOURCE SEPARATION [J].

Atcheson, Matt ;

Jafari, Ingrid ;

Togneri, Roberto ;

Nordholm, Sven .

2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,

[45] Beamspace-Domain Multichannel Nonnegative Matrix Factorization for Audio Source Separation [J].

Lee, Seokjin ;

Park, Sang Ha ;

Sung, Koeng-Mo .

IEEE SIGNAL PROCESSING LETTERS, 2012, 19 (01) :43-46

[46] Convolutive Blind Source Separation for Communication Signals Based on the Sliding Z-Transform [J].

Jia, Yinjie ;

Xu, Pengfei .

IEEE ACCESS, 2020, 8 :41213-41219

[47] Underdetermined Convolutive Blind Source Separation via Frequency Bin-Wise Clustering and Permutation Alignment [J].

Sawada, Hiroshi ;

Araki, Shoko ;

Makino, Shoji .

IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (03) :516-527

[48] Bayesian Multichannel Audio Source Separation Based on Integrated Source and Spatial Models [J].

Itakura, Kousuke ;

Bando, Yoshiaki ;

Nakamura, Eita ;

Itoyama, Katsutoshi ;

Yoshii, Kazuyoshi ;

Kawahara, Tatsuya .

IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2018, 26 (04) :831-846

[49] Underdetermined Blind Source Separation by Parallel Factor Analysis in Time-Frequency Domain [J].

Yang, Liu ;

Lv, Jun ;

Xiang, Yong .

COGNITIVE COMPUTATION, 2013, 5 (02) :207-214

[50] Data-driven frequency-domain iterative learning control with transfer learning [J].

Lee, Yu-Hsiu ;

Chin, Yu-Hsiang ;

Hsueh, Chun-Yuan .

MECHATRONICS, 2025, 108

← 1 2 3 4 5 →