Eliminating the Permutation Ambiguity of Convolutive Blind Source Separation by Using Coupled Frequency Bins

被引：23

作者：

Xie, Kan ^{[1
,2
]}

Zhou, Guoxu ^{[1
,3
]}

Yang, Junjie ^{[1
]}

He, Zhaoshui ^{[1
]}

Xie, Shengli ^{[1
,4
]}

机构：

[1] Guangdong Univ Technol, Sch Automat, Guangzhou 510006, Peoples R China

[2] Guangdong Key Lab IoT Informat Proc, Guangzhou 510006, Peoples R China

[3] Guangdong Univ Technol, Key Lab, Minist Educ, Guangzhou 510006, Peoples R China

[4] Guangdong Univ Technol, State Key Lab Precis Elect Mfg Technol & Equipmen, Guangzhou 510006, Peoples R China

来源：

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS | 2020年 / 31卷 / 02期

基金：

中国国家自然科学基金;

关键词：

Convolutive blind source separation (CBSS); independent component analysis; permutation ambiguity; tensor decomposition; COMPONENT ANALYSIS; ALGORITHMS; MIXTURES; TENSOR;

D O I：

10.1109/TNNLS.2019.2906833

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Blind source separation (BSS) is a typical unsupervised learning method that extracts latent components from their observations. In the meanwhile, convolutive BSS (CBSS) is particularly challenging as the observations are the mixtures of latent components as well as their delayed versions. CBSS is usually solved in frequency domain since convolutive mixtures in time domain is just instantaneous mixtures in frequency domain, which allows to recover source frequency components independently of each frequency bin by running ordinary BSS, and then concatenate them to form the Fourier transformation of source signals. Because BSS has inherent permutation ambiguity, this category of CBSS methods suffers from a common drawback: it is very difficult to choose the frequency components belonging to a specific source as they are estimated from different frequency bins using BSS. This paper presents a tensor framework that can completely eliminate the permutation ambiguity. By combining each frequency bin with an anchor frequency bin that is chosen arbitrarily in advance, we establish a new virtual BSS model where the corresponding correlation matrices comply with a block tensor decomposition (BTD) model. The essential uniqueness of BTD and the sparse structure of coupled mixing parameters allow the estimation of the mixing matrices free of permutation ambiguity. Extensive simulation results confirmed that the proposed algorithm could achieve higher separation accuracy compared with the state-of-the-art methods.

引用

页码：589 / 599

页数：11

共 39 条

[1] Sequential Independent Component Analysis Density Estimation [J].

Aladjem, Mayer ;

Israeli-Ran, Itamar ;

Bortman, Maria .

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 29 (10) :5084-5097

[2] IMAGE METHOD FOR EFFICIENTLY SIMULATING SMALL-ROOM ACOUSTICS [J].

ALLEN, JB ;

BERKLEY, DA .

JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1979, 65 (04) :943-950

[3] Underdetermined Convolutive BSS: Bayes Risk Minimization Based on a Mixture of Super-Gaussian Posterior Approximation [J].

Cho, Janghoon ;

Yoo, Chang D. .

IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2015, 23 (05) :828-839

[4]

Cichocki A., 2003, ADAPTIVE BLIND SIGNA

[5] DECOMPOSITIONS OF A HIGHER-ORDER TENSOR IN BLOCK TERMS-PART II: DEFINITIONS AND UNIQUENESS [J].

De Lathauwer, Lieven .

SIAM JOURNAL ON MATRIX ANALYSIS AND APPLICATIONS, 2008, 30 (03) :1033-1066

[6] DECOMPOSITIONS OF A HIGHER-ORDER TENSOR IN BLOCK TERMS-PART I: LEMMAS FOR PARTITIONED MATRICES [J].

De Lathauwer, Lieven .

SIAM JOURNAL ON MATRIX ANALYSIS AND APPLICATIONS, 2008, 30 (03) :1022-1032

[7] A near real-time approach for convolutive blind source separation [J].

Ding, S ;

Huang, J ;

Wei, D ;

Cichocki, A .

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2006, 53 (01) :114-128

[8] Spatio-temporal FastICA algorithms for the blind separation of convolutive mixtures [J].

Douglas, Scott C. ;

Gupta, Malay ;

Sawada, Hiroshi ;

Makino, Shoji .

IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (05) :1511-1520

[9] Blind Separation of Quasi-Stationary Sources: Exploiting Convex Geometry in Covariance Domain [J].

Fu, Xiao ;

Ma, Wing-Kin ;

Huang, Kejun ;

Sidiropoulos, Nicholas D. .

IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2015, 63 (09) :2306-2320

[10] A nonunitary joint block diagonalization algorithm for blind separation of convolutive mixtures of sources [J].

Ghennioui, Hicham ;

Fadaili, El Mostafa ;

Thirion-Moreau, Nadge ;

Adib, Abdellah ;

Moreau, Eric .

IEEE SIGNAL PROCESSING LETTERS, 2007, 14 (11) :860-863

← 1 2 3 4 →