Blind Separation of Convolutive Speech Mixtures Based on Local Sparsity and K-means

被引：0

作者：

Huang, Yuyang ^{[1
]}

Chu, Ping ^{[1
]}

Liao, Bin ^{[1
]}

机构：

[1] Shenzhen Univ, Coll Elect & Informat Engn, Shenzhen 518060, Peoples R China

来源：

28TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2020) | 2021年

基金：

中国国家自然科学基金;

关键词：

Blind source separation; convolutive speech mixture; K-means; permutation ambiguity;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

In this paper, an accurate and efficient blind source separation method based on local sparsity and K-means (LSK-BSS) is proposed. Specifically, the proposed LSK-BSS approach exploits the local sparsity of speech sources in the transformed domain to obtain closed-form solution for per-frequency mixing system estimation. On this basis, through designing superior initial points of clustering, the well-established K-means algorithm is employed to achieve accurate permutation alignment. Simulations with real reverberant speech sources show that the LSK-BSS approach yields competitive efficiency, robustness and effectiveness, in comparison with the state-of-the-arts methods.

引用

页码：271 / 275

页数：5

共 50 条

[1] BLIND SEPARATION OF CONVOLUTIVE MIXTURES OF SPEECH SOURCES: EXPLOITING LOCAL SPARSITY
Fu, Xiao
Ma, Wing-Kin
2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 4315 - 4319
[2] Subband-based blind separation for convolutive mixtures of speech
Araki, S
Makino, S
Aichner, R
Nishikawa, T
Saruwatari, H
IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2005, E88A (12) : 3593 - 3603
[3] Blind source separation of convolutive mixtures of speech in frequency domain
Makino, S
Sawada, H
Mukai, R
Araki, S
IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2005, E88A (07) : 1640 - 1655
[4] Batch and Adaptive PARAFAC-Based Blind Separation of Convolutive Speech Mixtures
Nion, Dimitri
Mokios, Kleanthis N.
Sidiropoulos, Nicholas D.
Potamianos, Alexandros
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (06): : 1193 - 1207
[5] Subband-based blind signal processing for source separation in convolutive mixtures of speech
Kokkinakis, Kostas
Loizou, Philipos C.
2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 917 - +
[6] Blind separation of convolutive mixtures by decorrelation
Mei, TM
Yin, FL
SIGNAL PROCESSING, 2004, 84 (12) : 2297 - 2313
[7] Blind separation of convolutive image mixtures
Shwartz, Sarit
Schechner, Yoav Y.
Zibulevsky, Michael
NEUROCOMPUTING, 2008, 71 (10-12) : 2164 - 2179
[8] Blind source separation of convolutive mixtures
Makino, Shoji
INDEPENDENT COMPONENT ANALYSES, WAVELETS, UNSUPERVISED SMART SENSORS, AND NEURAL NETWORKS IV, 2006, 6247
[9] The fundamental limitation of frequency domain blind source separation for convolutive mixtures of speech
Araki, S
Mukai, R
Makino, S
Nishikawa, T
Saruwatari, H
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2003, 11 (02): : 109 - 116
[10] A SPARSITY BASED CRITERION FOR SOLVING THE PERMUTATION AMBIGUITY IN CONVOLUTIVE BLIND SOURCE SEPARATION
Mazur, Radoslaw
Mertins, Alfred
2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 1996 - 1999

← 1 2 3 4 5 →