Blind audio source counting and separation of anechoic mixtures using the multichannel complex NMF framework

被引:22
|
作者
Mirzaei, Sayeh [1 ]
Van hamme, Hugo [1 ]
Norouzi, Yaser [2 ]
机构
[1] KULeuven, Dept Elect Engn, Leuven, Belgium
[2] Amirkabir Univ, Dept Elect Engn, Tehran, Iran
关键词
Blind Source Separation (BSS); Complex Non-negative Matrix Factorization (CNMF); Binary masking; Anechoic mixture; NONNEGATIVE MATRIX FACTORIZATION; PERMUTATION PROBLEM; SPARSENESS; ROBUST;
D O I
10.1016/j.sigpro.2015.03.006
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, we address the tasks of audio source counting and separation for a stereo anechoic mixture of audio signals. This will be achieved in two stages. In the first stage, a novel approach is introduced for estimating the number of sources as well as the channel mixing coefficients. For this purpose, a 2-D spectrum is evaluated against both the phase and amplitude differences of the two channels. Hence, obtaining the peak locations of the spectrum yields the number of the sources and the corresponding channel coefficients. In the second stage, an extension of a single channel complex matrix factorization method to multichannel is developed to extract the individual source signals. We find primary estimates of the sources via binary masking and then apply the complex factorization to the complex spectrogram of each source. The obtained factors are then utilized as initial values in the complex multichannel factorization model. We also suggest a method for estimating the number of required components for modeling each source. The separation performance improvement over the conventional methods is investigated by calculating BSS evaluation metrics. The comparison is also carried out in terms of source counting and localization with the recently proposed DeMIX-Anechoic method. (C) 2015 The Authors. Published by Elsevier B.V.
引用
收藏
页码:27 / 37
页数:11
相关论文
共 50 条
  • [41] ALPHA-STABLE MULTICHANNEL AUDIO SOURCE SEPARATION
    Leglaive, Simon
    Simsekli, Umut
    Liutkus, Antoine
    Badeau, Roland
    Richard, Gael
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 576 - 580
  • [42] Extended Semantic Initialization for NMF-based Audio Source Separation
    Rohlfing, Christian
    Becker, Julian M.
    2015 INTERNATIONAL SYMPOSIUM ON INTELLIGENT SIGNAL PROCESSING AND COMMUNICATION SYSTEMS (ISPACS), 2015, : 95 - 100
  • [43] Joint Multichannel Deconvolution and Blind Source Separation
    Jiang, Ming
    Bobin, Jerome
    Starck, Jean-Luc
    SIAM JOURNAL ON IMAGING SCIENCES, 2017, 10 (04): : 1997 - 2021
  • [44] Blind source separation of multichannel neuromagnetic responses
    Tang, AC
    Pearlmutter, BA
    Zibulevsky, M
    Carter, SA
    NEUROCOMPUTING, 2000, 32 (32-33) : 1115 - 1120
  • [45] Estimation of propagation delays using orientation histograms for anechoic blind source separation.
    Yamashita, J
    Tatsuta, S
    Hirai, Y
    2004 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-4, PROCEEDINGS, 2004, : 2175 - 2180
  • [46] Fetal QRS Complex Detection using Semi-Blind Source Separation Framework
    Razavipour, Fatemeh
    Haghpanahi, Masoumeh
    Sameni, Reza
    2013 COMPUTING IN CARDIOLOGY CONFERENCE (CINC), 2013, 40 : 181 - 184
  • [47] A MULTICHANNEL MMSE-BASED FRAMEWORK FOR JOINT BLIND SOURCE SEPARATION AND NOISE REDUCTION
    Souden, Mehrez
    Araki, Shoko
    Kinoshita, Keisuke
    Nakatani, Tomohiro
    Sawada, Hiroshi
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 109 - 112
  • [48] Performance measurement in blind audio source separation
    Vincent, Emmanuel
    Gribonval, Remi
    Févotte, Cedric
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2006, 14 (04): : 1462 - 1469
  • [49] An improved technique for blind audio source separation
    Cho, Namgook
    Shiu, Yu
    Kuo, C. -C. Jay
    IIH-MSP: 2006 INTERNATIONAL CONFERENCE ON INTELLIGENT INFORMATION HIDING AND MULTIMEDIA SIGNAL PROCESSING, PROCEEDINGS, 2006, : 525 - +
  • [50] Blind Source Separation for Convolutive Audio Mixing
    Rosebell, V. Jerine Rini
    Sugumar, D.
    Shindu
    Sherin
    INFORMATION TECHNOLOGY AND MOBILE COMMUNICATION, 2011, 147 : 473 - 476