Eliminating the Permutation Ambiguity of Convolutive Blind Source Separation by Using Coupled Frequency Bins

被引:22
作者
Xie, Kan [1 ,2 ]
Zhou, Guoxu [1 ,3 ]
Yang, Junjie [1 ]
He, Zhaoshui [1 ]
Xie, Shengli [1 ,4 ]
机构
[1] Guangdong Univ Technol, Sch Automat, Guangzhou 510006, Peoples R China
[2] Guangdong Key Lab IoT Informat Proc, Guangzhou 510006, Peoples R China
[3] Guangdong Univ Technol, Key Lab, Minist Educ, Guangzhou 510006, Peoples R China
[4] Guangdong Univ Technol, State Key Lab Precis Elect Mfg Technol & Equipmen, Guangzhou 510006, Peoples R China
基金
中国国家自然科学基金;
关键词
Convolutive blind source separation (CBSS); independent component analysis; permutation ambiguity; tensor decomposition; COMPONENT ANALYSIS; ALGORITHMS; MIXTURES; TENSOR;
D O I
10.1109/TNNLS.2019.2906833
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Blind source separation (BSS) is a typical unsupervised learning method that extracts latent components from their observations. In the meanwhile, convolutive BSS (CBSS) is particularly challenging as the observations are the mixtures of latent components as well as their delayed versions. CBSS is usually solved in frequency domain since convolutive mixtures in time domain is just instantaneous mixtures in frequency domain, which allows to recover source frequency components independently of each frequency bin by running ordinary BSS, and then concatenate them to form the Fourier transformation of source signals. Because BSS has inherent permutation ambiguity, this category of CBSS methods suffers from a common drawback: it is very difficult to choose the frequency components belonging to a specific source as they are estimated from different frequency bins using BSS. This paper presents a tensor framework that can completely eliminate the permutation ambiguity. By combining each frequency bin with an anchor frequency bin that is chosen arbitrarily in advance, we establish a new virtual BSS model where the corresponding correlation matrices comply with a block tensor decomposition (BTD) model. The essential uniqueness of BTD and the sparse structure of coupled mixing parameters allow the estimation of the mixing matrices free of permutation ambiguity. Extensive simulation results confirmed that the proposed algorithm could achieve higher separation accuracy compared with the state-of-the-art methods.
引用
收藏
页码:589 / 599
页数:11
相关论文
共 50 条
  • [1] Convolutive audio source separation using robust ICA and an intelligent evolving permutation ambiguity solution
    Mallis, Dimitrios
    Sgouros, Thomas
    Mitianoudis, Nikolaos
    EVOLVING SYSTEMS, 2018, 9 (04) : 315 - 329
  • [2] Convolutive audio source separation using robust ICA and an intelligent evolving permutation ambiguity solution
    Dimitrios Mallis
    Thomas Sgouros
    Nikolaos Mitianoudis
    Evolving Systems, 2018, 9 : 315 - 329
  • [3] Low Complexity Blind Separation Technique to Solve the Permutation Ambiguity of Convolutive Speech Mixtures
    Lima, Pedro F. C.
    Miranda, Ricardo Kehrle
    da Costa, Joao Paulo C. L.
    Zelenovsky, Ricardo
    Yuan, Yizheng
    Del Galdo, Giovanni
    2016 10TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATION SYSTEMS (ICSPCS), 2016,
  • [4] Nonparametric Bayesian sparse factor analysis for frequency domain blind source separation without permutation ambiguity
    Nagira, Kohei
    Otsuka, Takuma
    Okuno, Hiroshi G.
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2013,
  • [5] Underdetermined Convolutive Blind Source Separation via Frequency Bin-Wise Clustering and Permutation Alignment
    Sawada, Hiroshi
    Araki, Shoko
    Makino, Shoji
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (03): : 516 - 527
  • [6] Blind source separation of convolutive mixtures of speech in frequency domain
    Makino, S
    Sawada, H
    Mukai, R
    Araki, S
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2005, E88A (07) : 1640 - 1655
  • [7] A PARTITIONED FREQUENCY DOMAIN ALGORITHM FOR CONVOLUTIVE BLIND SOURCE SEPARATION
    Scarpiniti, Michele
    Picaro, Andrea
    Parisi, Raffaele
    Uncini, Aurelio
    2009 IEEE INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING, 2009, : 411 - 416
  • [8] Underdetermined Convolutive Blind Source Separation via Time-Frequency Masking
    Reju, Vaninirappuputhenpurayil Gopalan
    Koh, Soo Ngee
    Soon, Ing Yann
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (01): : 101 - 116
  • [9] A Sparsity-Based Method to Solve Permutation Indeterminacy in Frequency-Domain Convolutive Blind Source Separation
    Sudhakar, Prasad
    Gribonval, Remi
    INDEPENDENT COMPONENT ANALYSIS AND SIGNAL SEPARATION, PROCEEDINGS, 2009, 5441 : 338 - 345
  • [10] TENSORIAL CONVOLUTIVE BLIND SOURCE SEPARATION
    Thanh Trung Le
    Abed-Meraim, Karim
    Ravier, Philippe
    Buttelli, Olivier
    Holobar, Ales
    2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024, 2024, : 5575 - 5579