SIMULTANEOUS CLUSTERING OF MIXING AND SPECTRAL MODEL PARAMETERS FOR BLIND SPARSE SOURCE SEPARATION

被引:11
作者
Araki, Shoko [1 ]
Nakatani, Tomohiro [1 ]
Sawada, Hiroshi [1 ]
机构
[1] NTT Corp, NTT Commun Sci Labs, Seika, Kyoto 6190237, Japan
来源
2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING | 2010年
关键词
Time-frequency mask; spectral model; common amplitude modulation (AM); EM algorithm; MIXTURE;
D O I
10.1109/ICASSP.2010.5496283
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper proposes a sparse source separation method which clusters the phase difference between the microphone observations and the amplitude modulation (AM) of the source spectrum simultaneously. The phase difference clustering separates the signals in each frequency bin, and the AM clustering corresponds to permutation alignment. Because the proposed method has an inherent ability to align the permutation of frequency components, the proposed method can be applied even when the spatial aliasing problem occurs. Moreover, because the common AM property collects the synchronized frequency components, we can model the microphone observations with a small number of sources. This property enables us to count the number of sources. That is, the proposed method can be applied even if the number of sources is unknown. The experimental results confirm the effectiveness of our proposed method.
引用
收藏
页码:5 / 8
页数:4
相关论文
共 11 条
[1]  
[Anonymous], P IEEE WORKSH APPL S
[2]   Underdetermined blind sparse source separation for arbitrarily arranged multiple sensors [J].
Araki, Shoko ;
Sawada, Hiroshi ;
Mukai, Ryo ;
Makino, Shoji .
SIGNAL PROCESSING, 2007, 87 (08) :1833-1847
[3]   BLIND SPARSE SOURCE SEPARATION FOR UNKNOWN NUMBER OF SOURCES USING GAUSSIAN MIXTURE MODEL FITTING WITH DIRICHLET PRIOR [J].
Araki, Shoko ;
Nakatani, Tomohiro ;
Sawada, Hiroshi ;
Makino, Shoji .
2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, :33-36
[4]  
BROWN GJ, 1992, THESIS U SHEFFIELD
[5]  
Hyvärinen A, 2001, INDEPENDENT COMPONENT ANALYSIS: PRINCIPLES AND PRACTICE, P71
[6]   Sparseness-based 2ch BSS using the em algorithm in reverberant environment [J].
Izumi, Yosuke ;
Ono, Nobutaka ;
Sagayama, Shigeki .
2007 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS, 2007, :147-150
[7]  
Mandel M., 2006, P NEURAL INFO P SYS
[8]   An approach to blind source separation based on temporal structure of speech signals [J].
Murata, N ;
Ikeda, S ;
Ziehe, A .
NEUROCOMPUTING, 2001, 41 :1-24
[9]  
Nakatani T., ICASSP 2010 UNPUB
[10]  
O'Grady PD, 2004, LECT NOTES COMPUT SC, V3195, P430