Beamspace-Domain Multichannel Nonnegative Matrix Factorization for Audio Source Separation

被引:16
作者
Lee, Seokjin [1 ]
Park, Sang Ha [1 ]
Sung, Koeng-Mo [1 ]
机构
[1] Seoul Natl Univ, INMC, Seoul 151472, South Korea
关键词
Acoustic signal processing; blind source separation; multichannel audio; nonnegative matrix factorization (NMF); MIXTURES;
D O I
10.1109/LSP.2011.2173192
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this letter, we develop a multichannel blind source separation algorithm based on a beamspace transform and the multichannel nonnegative matrix factorization (NMF) method. The conventional multichannel NMF algorithm performs well with multichannel mixing data, but there is still room for enhancement in multichannel real-world recording data. In this letter, we consider a beamspace-time-frequency domain data model for multichannel NMF method, and enhance the conventional method using a beamspace transform. Our decomposition algorithm is applied to 2-channel and 4-channel unsupervised audio source separation, using a dataset from the international Signal Separation Evaluation Campaign 2010 (SiSEC 2010). Our algorithm shows a better performance than the conventional NMF method in an evaluation results.
引用
收藏
页码:43 / 46
页数:4
相关论文
共 16 条
  • [1] [Anonymous], 2010, SIGNAL SEPARATION EV
  • [2] Arberet S., 2010, 2010 10th International Conference on Information Sciences, Signal Processing and their Applications (ISSPA 2010), P1, DOI 10.1109/ISSPA.2010.5605570
  • [3] FitzGerald D., 2005, P IR SIGN SYST C DUB, P8
  • [4] COMPLEX NMF: A NEW SPARSE REPRESENTATION FOR ACOUSTIC SIGNALS
    Kameoka, Hirokazu
    Ono, Nobutaka
    Kashino, Kunio
    Sagayama, Shigeki
    [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 3437 - +
  • [5] Koh C. L., 2009, BROADBAND ADAPTIVE B
  • [6] Learning the parts of objects by non-negative matrix factorization
    Lee, DD
    Seung, HS
    [J]. NATURE, 1999, 401 (6755) : 788 - 791
  • [7] Liu W., 2010, Wideband Beamforming: Concepts and Tech-niques
  • [8] Multichannel Nonnegative Matrix Factorization in Convolutive Mixtures for Audio Source Separation
    Ozerov, Alexey
    Fevotte, Cedric
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (03): : 550 - 563
  • [9] Parry RM, 2006, LECT NOTES COMPUT SC, V3889, P666
  • [10] Smaragdis P, 2003, 2003 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS PROCEEDINGS, P177, DOI 10.1109/ASPAA.2003.1285860