Blind Separation of Convolutive Speech Mixtures Based on Local Sparsity and K-means

被引:0
作者
Huang, Yuyang [1 ]
Chu, Ping [1 ]
Liao, Bin [1 ]
机构
[1] Shenzhen Univ, Coll Elect & Informat Engn, Shenzhen 518060, Peoples R China
来源
28TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2020) | 2021年
基金
中国国家自然科学基金;
关键词
Blind source separation; convolutive speech mixture; K-means; permutation ambiguity;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, an accurate and efficient blind source separation method based on local sparsity and K-means (LSK-BSS) is proposed. Specifically, the proposed LSK-BSS approach exploits the local sparsity of speech sources in the transformed domain to obtain closed-form solution for per-frequency mixing system estimation. On this basis, through designing superior initial points of clustering, the well-established K-means algorithm is employed to achieve accurate permutation alignment. Simulations with real reverberant speech sources show that the LSK-BSS approach yields competitive efficiency, robustness and effectiveness, in comparison with the state-of-the-arts methods.
引用
收藏
页码:271 / 275
页数:5
相关论文
共 50 条
  • [1] BLIND SEPARATION OF CONVOLUTIVE MIXTURES OF SPEECH SOURCES: EXPLOITING LOCAL SPARSITY
    Fu, Xiao
    Ma, Wing-Kin
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 4315 - 4319
  • [2] Subband-based blind separation for convolutive mixtures of speech
    Araki, S
    Makino, S
    Aichner, R
    Nishikawa, T
    Saruwatari, H
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2005, E88A (12) : 3593 - 3603
  • [3] Blind source separation of convolutive mixtures of speech in frequency domain
    Makino, S
    Sawada, H
    Mukai, R
    Araki, S
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2005, E88A (07) : 1640 - 1655
  • [4] Batch and Adaptive PARAFAC-Based Blind Separation of Convolutive Speech Mixtures
    Nion, Dimitri
    Mokios, Kleanthis N.
    Sidiropoulos, Nicholas D.
    Potamianos, Alexandros
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (06): : 1193 - 1207
  • [5] Subband-based blind signal processing for source separation in convolutive mixtures of speech
    Kokkinakis, Kostas
    Loizou, Philipos C.
    2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 917 - +
  • [6] Blind separation of convolutive mixtures by decorrelation
    Mei, TM
    Yin, FL
    SIGNAL PROCESSING, 2004, 84 (12) : 2297 - 2313
  • [7] Blind separation of convolutive image mixtures
    Shwartz, Sarit
    Schechner, Yoav Y.
    Zibulevsky, Michael
    NEUROCOMPUTING, 2008, 71 (10-12) : 2164 - 2179
  • [8] Blind source separation of convolutive mixtures
    Makino, Shoji
    INDEPENDENT COMPONENT ANALYSES, WAVELETS, UNSUPERVISED SMART SENSORS, AND NEURAL NETWORKS IV, 2006, 6247
  • [9] The fundamental limitation of frequency domain blind source separation for convolutive mixtures of speech
    Araki, S
    Mukai, R
    Makino, S
    Nishikawa, T
    Saruwatari, H
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2003, 11 (02): : 109 - 116
  • [10] A SPARSITY BASED CRITERION FOR SOLVING THE PERMUTATION AMBIGUITY IN CONVOLUTIVE BLIND SOURCE SEPARATION
    Mazur, Radoslaw
    Mertins, Alfred
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 1996 - 1999