Blind extraction of a dominant source from mixtures of many sources using ICA and time-frequency masking

被引:11
作者
Sawada, H [1 ]
Araki, S [1 ]
Mukai, R [1 ]
Makino, S [1 ]
机构
[1] NTT Corp, NTT Commun Sci Labs, Kyoto 6190237, Japan
来源
2005 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), VOLS 1-6, CONFERENCE PROCEEDINGS | 2005年
关键词
D O I
10.1109/ISCAS.2005.1465977
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper presents a method for enhancing a target source of interest and suppressing other interference sources. The target source is assumed to be close to sensors, to have dominant power at these sensors, and to have non-Gaussianity. The enhancement is performed blindly, i.e. without knowing the total number of sources or information about each source, such as position and active time. We consider a general case where the number of sources is larger than the number of sensors. We employ a two-stage process where independent component analysis (ICA) is first employed in each frequency bin and time-frequency masking is then used to improve the performance further. We propose a new sophisticated method for selecting the target source frequency components, and also a new criterion for specifying time-frequency masks. Experimental results for simulated cocktail party situations in a room (reverberation time was 130 ms) are presented to show the effectiveness and characteristics of the proposed method.
引用
收藏
页码:5882 / 5885
页数:4
相关论文
共 15 条
[11]  
Mukai R, 2002, NEURAL NETWORKS FOR SIGNAL PROCESSING XII, PROCEEDINGS, P435, DOI 10.1109/NNSP.2002.1030055
[12]  
MUKAI R, 2004, P ISCAS 2004 MAY, V5, P1
[13]  
Roman N, 2004, INT CONF ACOUST SPEE, P373
[14]   A robust and precise method for solving the permutation problem of frequency-domain blind source separation [J].
Sawada, H ;
Mukai, R ;
Araki, S ;
Makino, S .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2004, 12 (05) :530-538
[15]  
Sawada H, 2003, IEICE T FUND ELECTR, VE86A, P590