Blind audio source counting and separation of anechoic mixtures using the multichannel complex NMF framework

被引:22
|
作者
Mirzaei, Sayeh [1 ]
Van hamme, Hugo [1 ]
Norouzi, Yaser [2 ]
机构
[1] KULeuven, Dept Elect Engn, Leuven, Belgium
[2] Amirkabir Univ, Dept Elect Engn, Tehran, Iran
关键词
Blind Source Separation (BSS); Complex Non-negative Matrix Factorization (CNMF); Binary masking; Anechoic mixture; NONNEGATIVE MATRIX FACTORIZATION; PERMUTATION PROBLEM; SPARSENESS; ROBUST;
D O I
10.1016/j.sigpro.2015.03.006
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, we address the tasks of audio source counting and separation for a stereo anechoic mixture of audio signals. This will be achieved in two stages. In the first stage, a novel approach is introduced for estimating the number of sources as well as the channel mixing coefficients. For this purpose, a 2-D spectrum is evaluated against both the phase and amplitude differences of the two channels. Hence, obtaining the peak locations of the spectrum yields the number of the sources and the corresponding channel coefficients. In the second stage, an extension of a single channel complex matrix factorization method to multichannel is developed to extract the individual source signals. We find primary estimates of the sources via binary masking and then apply the complex factorization to the complex spectrogram of each source. The obtained factors are then utilized as initial values in the complex multichannel factorization model. We also suggest a method for estimating the number of required components for modeling each source. The separation performance improvement over the conventional methods is investigated by calculating BSS evaluation metrics. The comparison is also carried out in terms of source counting and localization with the recently proposed DeMIX-Anechoic method. (C) 2015 The Authors. Published by Elsevier B.V.
引用
收藏
页码:27 / 37
页数:11
相关论文
共 50 条
  • [31] Music retiler: Using NMF2D source separation for audio mosaicing
    Aarabi, Hadrien Foroughmand
    Peeters, Geoffroy
    2018 CONFERENCE ON INTERACTION WITH SOUND (AUDIO MOSTLY): SOUND IN IMMERSION AND EMOTION (AM'18), 2018,
  • [32] BLIND AUDIO SOURCE SEPARATION OF STEREO MIXTURES USING BAYESIAN NON-NEGATIVE MATRIX FACTORIZATION
    Mirzaei, S.
    Van Hamme, H.
    Norouzi, Y.
    2014 PROCEEDINGS OF THE 22ND EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2014, : 621 - 625
  • [33] Audio source separation of convolutive mixtures
    Mitianoudis, N
    Davies, ME
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2003, 11 (05): : 489 - 497
  • [34] Blind separation of anechoic under-determined speech mixtures using multiple sensors
    Saab, Rayan
    Yilmaz, Ozgur
    McKeown, Martin J.
    Abugharbieh, Rafeef
    2006 IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY, VOLS 1 AND 2, 2006, : 642 - 646
  • [35] Underdetermined Blind Audio Source Separation Using Modal Decomposition
    Abdeldjalil Aïssa-El-Bey
    Karim Abed-Meraim
    Yves Grenier
    EURASIP Journal on Audio, Speech, and Music Processing, 2007
  • [36] Underdetermined Blind Audio Source Separation Using Modal Decomposition
    Aissa-El-Bey, Abdeldjalil
    Abed-Meraim, Karim
    Grenier, Yves
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2007, 2007 (1)
  • [37] Blind Audio Source Separation Using Wiener Filtering Approach
    Sharma, Pardeep
    Mehra, Rajesh
    Dubey, Naveen
    2015 4TH INTERNATIONAL CONFERENCE ON RELIABILITY, INFOCOM TECHNOLOGIES AND OPTIMIZATION (ICRITO) (TRENDS AND FUTURE DIRECTIONS), 2015,
  • [38] Multichannel Audio Source Separation With Probabilistic Reverberation Priors
    Leglaive, Simon
    Badeau, Roland
    Richard, Gael
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2016, 24 (12) : 2453 - 2465
  • [39] MULTICHANNEL AUDIO SOURCE SEPARATION WITH PROBABILISTIC REVERBERATION MODELING
    Leglaive, Simon
    Badeau, Roland
    Richard, Gael
    2015 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2015,
  • [40] Multichannel Audio Source Separation With Deep Neural Networks
    Nugraha, Aditya Arie
    Liutkus, Antoine
    Vincent, Emmanuel
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2016, 24 (09) : 1652 - 1664