A low-complexity permutation alignment method for frequency-domain blind source separation

被引:9
作者
Kang, Fang [1 ,3 ]
Yang, Feiran [1 ,2 ,3 ]
Yang, Jun [1 ,2 ,3 ]
机构
[1] Chinese Acad Sci, Inst Acoust, Key Lab Noise & Vibrat Res, Beijing 100190, Peoples R China
[2] Chinese Acad Sci, Inst Acoust, State Key Lab Acoust, Beijing 100190, Peoples R China
[3] Univ Chinese Acad Sci, Sch Elect Elect & Commun Engn, Beijing 100049, Peoples R China
基金
国家重点研发计划;
关键词
Blind source separation (BSS); Permutation problem; Local permutation alignment; Global correction; Computational complexity; INDEPENDENT VECTOR ANALYSIS; SPEECH SEPARATION; ROBUST; NETWORKS; ICA;
D O I
10.1016/j.specom.2019.11.002
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Frequency-domain blind source separation is an effective way to separate the signals from convolutive mixtures. The independence component analysis (ICA) is commonly employed to separate signals in each frequency bin, resulting in the well-known permutation problem. To resolve this problem, we present a low-complexity permutation alignment method based on the inter-frequency dependence of signal power ratio. A bin-wise permutation alignment is first carried out across all the frequency bins by measuring the correlation between the current frequency bin and the previous one, but only the permutation with a high confidence is fixed. The permutation with low confidence is then determined by maximizing the correlation between the current frequency bin and a local centroid, which is calculated from a set of determined frequency bins with high confidence. By so doing, the permutation for most frequency bins is aligned without iterations. Finally, a clustering algorithm with centroids is adopted to achieve the fine global optimization in the fullband with only a few iterations. Experiment results show that the proposed method achieves a comparable performance with the state-of-the-art permutation alignment schemes, but the new method achieves a significant computational saving.
引用
收藏
页码:88 / 94
页数:7
相关论文
共 35 条
[1]  
Anemuller J., 2000, Second International Workshop on Independent Component Analysis and Blind Signal Separation. Proceedings, P215
[2]  
[Anonymous], 2009, NONNEGATIVE MATRIX T
[3]  
Araki S, 2007, INT CONF ACOUST SPEE, P41
[4]  
Araki S, 2015, INT CONF ACOUST SPEE, P116, DOI 10.1109/ICASSP.2015.7177943
[5]  
Asano F, 2001, INT CONF ACOUST SPEE, P2729, DOI 10.1109/ICASSP.2001.940210
[6]   AN INFORMATION MAXIMIZATION APPROACH TO BLIND SEPARATION AND BLIND DECONVOLUTION [J].
BELL, AJ ;
SEJNOWSKI, TJ .
NEURAL COMPUTATION, 1995, 7 (06) :1129-1159
[7]   BLIND SEPARATION OF SOURCES .2. PROBLEMS STATEMENT [J].
COMON, P ;
JUTTEN, C ;
HERAULT, J .
SIGNAL PROCESSING, 1991, 24 (01) :11-20
[8]   Robust binaural speech separation in adverse conditions based on deep neural network with modified spatial features and training target [J].
Dadvar, Paria ;
Geravanchizadeh, Masoud .
SPEECH COMMUNICATION, 2019, 108 :41-52
[9]  
Herault J., 1986, AIP Conference Proceedings, P206, DOI 10.1063/1.36258
[10]  
Hiroe A, 2006, LECT NOTES COMPUT SC, V3889, P601