Partial separation method for solving permutation problem in frequency domain blind source separation of speech signals

被引:9
作者
Reju, V. G. [1 ]
Koh, Soo Ngee [1 ]
Soon, Ing Yann [1 ]
机构
[1] Nanyang Technol Univ, Sch Elect & Elect Engn, Singapore 639798, Singapore
关键词
blind source separation; convolutive mixture; direction of arrival; independent component analysis; permutation problem;
D O I
10.1016/j.neucom.2007.08.030
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper addresses the well known permutation problem in frequency domain blind source separation. The proposed method uses correlation between two signals in each DFT bin to solve the permutation problem. One of the signals is partially separated by a time domain blind source separation method and the other is obtained by the frequency domain blind source separation method. Two different ways of configuring the time and frequency domain blocks, i.e., in parallel or cascade, have been studied. The cascaded configuration not only achieves a better separation performance but also reduces the computational cost as compared to the parallel configuration. (C) 2008 Elsevier B.V. All rights reserved.
引用
收藏
页码:2098 / 2112
页数:15
相关论文
共 27 条
[1]  
Aichner R, 2002, NEURAL NETWORKS FOR SIGNAL PROCESSING XII, PROCEEDINGS, P445, DOI 10.1109/NNSP.2002.1030056
[2]  
AICHNER R, 2004, 5 INT S IND COMP AN, P840
[3]   Natural gradient works efficiently in learning [J].
Amari, S .
NEURAL COMPUTATION, 1998, 10 (02) :251-276
[4]   The fundamental limitation of frequency domain blind source separation for convolutive mixtures of speech [J].
Araki, S ;
Mukai, R ;
Makino, S ;
Nishikawa, T ;
Saruwatari, H .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2003, 11 (02) :109-116
[5]  
Asano F, 2001, INT CONF ACOUST SPEE, P2729, DOI 10.1109/ICASSP.2001.940210
[6]   AN INFORMATION MAXIMIZATION APPROACH TO BLIND SEPARATION AND BLIND DECONVOLUTION [J].
BELL, AJ ;
SEJNOWSKI, TJ .
NEURAL COMPUTATION, 1995, 7 (06) :1129-1159
[7]  
Bingham E, 2000, Int J Neural Syst, V10, P1, DOI 10.1142/S0129065700000028
[8]   A generalization of blind source separation algorithms for convolutive mixtures based on second-order statistics [J].
Buchner, H ;
Aichner, R ;
Kellermann, W .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2005, 13 (01) :120-134
[9]  
Buchner H., 2003, P INT S IND COMP AN, P945
[10]   The effect of sensor placement in blind source separation [J].
Hopgood, JR ;
Rayner, PJW ;
Yuen, PWT .
PROCEEDINGS OF THE 2001 IEEE WORKSHOP ON THE APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS, 2001, :95-98