Audio source separation with multiple microphones on time-frequency representations

被引:0
作者
Sawada, Hiroshi [1 ]
机构
[1] NTT Corp, NTT Commun Sci Labs, Seika, Kyoto 6190237, Japan
来源
INDEPENDENT COMPONENT ANALYSES, COMPRESSIVE SAMPLING, WAVELETS, NEURAL NET, BIOSYSTEMS, AND NANOENGINEERING XI | 2013年 / 8750卷
关键词
Source separation; Short-time Fourier transform; Time-frequency representation; Independent component analysis; Gaussian mixture model; Non-negative matrix factorization; Permutation problem; BLIND SOURCE SEPARATION; NONNEGATIVE MATRIX FACTORIZATION; PERMUTATION PROBLEM; MIXTURES;
D O I
10.1117/12.2018632
中图分类号
O43 [光学];
学科分类号
070207 ; 0803 ;
摘要
This paper presents various source separation methods that utilize multiple microphones. We classify them into two classes. Methods that fall into the first class apply independent component analysis (ICA) or Gaussian mixture model (GMM) to frequency bin-wise observations, and then solve the permutation problem to reconstruct separated signals. The second type of method extends non-negative matrix factorization (NMF) to a multi-microphone situation, in which NMF bases are clustered according to their spatial properties. We have a unified understanding that all methods analyze a time-frequency representation with an additional microphone axis.
引用
收藏
页数:10
相关论文
共 25 条
[1]   Natural gradient works efficiently in learning [J].
Amari, S .
NEURAL COMPUTATION, 1998, 10 (02) :251-276
[2]  
Aoki M., 2001, Acoustical Science and Technology, V22, P149, DOI 10.1250/ast.22.149
[3]   Underdetermined blind sparse source separation for arbitrarily arranged multiple sensors [J].
Araki, Shoko ;
Sawada, Hiroshi ;
Mukai, Ryo ;
Makino, Shoji .
SIGNAL PROCESSING, 2007, 87 (08) :1833-1847
[4]   SIMULTANEOUS CLUSTERING OF MIXING AND SPECTRAL MODEL PARAMETERS FOR BLIND SPARSE SOURCE SEPARATION [J].
Araki, Shoko ;
Nakatani, Tomohiro ;
Sawada, Hiroshi .
2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, :5-8
[5]  
Cichocki A., 2009, NONNEGATIVE MATRIX T
[6]   Nonnegative Matrix Factorization with the Itakura-Saito Divergence: With Application to Music Analysis [J].
Fevotte, Cedric ;
Bertin, Nancy ;
Durrieu, Jean-Louis .
NEURAL COMPUTATION, 2009, 21 (03) :793-830
[7]  
Hiroe A, 2006, LECT NOTES COMPUT SC, V3889, P601
[8]  
Hyvärinen A, 2001, INDEPENDENT COMPONENT ANALYSIS: PRINCIPLES AND PRACTICE, P71
[9]   Blind source separation exploiting higher-order frequency dependencies [J].
Kim, Taesu ;
Attias, Hagai T. ;
Lee, Soo-Young ;
Lee, Te-Won .
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (01) :70-79
[10]   Learning the parts of objects by non-negative matrix factorization [J].
Lee, DD ;
Seung, HS .
NATURE, 1999, 401 (6755) :788-791