Blind audio source counting and separation of anechoic mixtures using the multichannel complex NMF framework

被引:22
|
作者
Mirzaei, Sayeh [1 ]
Van hamme, Hugo [1 ]
Norouzi, Yaser [2 ]
机构
[1] KULeuven, Dept Elect Engn, Leuven, Belgium
[2] Amirkabir Univ, Dept Elect Engn, Tehran, Iran
关键词
Blind Source Separation (BSS); Complex Non-negative Matrix Factorization (CNMF); Binary masking; Anechoic mixture; NONNEGATIVE MATRIX FACTORIZATION; PERMUTATION PROBLEM; SPARSENESS; ROBUST;
D O I
10.1016/j.sigpro.2015.03.006
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, we address the tasks of audio source counting and separation for a stereo anechoic mixture of audio signals. This will be achieved in two stages. In the first stage, a novel approach is introduced for estimating the number of sources as well as the channel mixing coefficients. For this purpose, a 2-D spectrum is evaluated against both the phase and amplitude differences of the two channels. Hence, obtaining the peak locations of the spectrum yields the number of the sources and the corresponding channel coefficients. In the second stage, an extension of a single channel complex matrix factorization method to multichannel is developed to extract the individual source signals. We find primary estimates of the sources via binary masking and then apply the complex factorization to the complex spectrogram of each source. The obtained factors are then utilized as initial values in the complex multichannel factorization model. We also suggest a method for estimating the number of required components for modeling each source. The separation performance improvement over the conventional methods is investigated by calculating BSS evaluation metrics. The comparison is also carried out in terms of source counting and localization with the recently proposed DeMIX-Anechoic method. (C) 2015 The Authors. Published by Elsevier B.V.
引用
收藏
页码:27 / 37
页数:11
相关论文
共 50 条
  • [1] A novel Directional Framework for Source Counting and Source Separation in Instantaneous Underdetermined Audio Mixtures
    Sgouros, Thomas
    Mitianoudis, Nikolaos
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2020, 28 : 2025 - 2035
  • [2] Multichannel Audio Source Separation Exploiting NMF-Based Generic Source Spectral Model in Gaussian Modeling Framework
    Thanh Thi Hien Duong
    Duong, Ngoc Q. K.
    Cong-Phuong Nguyen
    Quoc-Cuong Nguyen
    LATENT VARIABLE ANALYSIS AND SIGNAL SEPARATION (LVA/ICA 2018), 2018, 10891 : 547 - 557
  • [3] Improvements in Blind Source Separation of Anechoic Underdetermined Speech Mixtures
    Pires Filho, Jorge Costa
    Petraglia, Mariane Rembold
    2014 INTERNATIONAL TELECOMMUNICATIONS SYMPOSIUM (ITS), 2014,
  • [4] Semi-Blind Student's t Source Separation for Multichannel Audio Convolutive Mixtures
    Leglaive, Simon
    Badeau, Roland
    Richard, Gael
    2017 25TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2017, : 2259 - 2263
  • [5] UNDERDETERMINED AUDIO SOURCE SEPARATION FROM ANECHOIC MIXTURES WITH LONG TIME DELAY
    Cho, Namgook
    Kuo, C. -C Jay
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 1557 - +
  • [6] Anechoic Blind Source Separation Using Wigner Marginals
    Omlor, Lars
    Giese, Martin A.
    JOURNAL OF MACHINE LEARNING RESEARCH, 2011, 12 : 1111 - 1148
  • [7] MULTICHANNEL NONNEGATIVE MATRIX FACTORIZATION IN CONVOLUTIVE MIXTURES. WITH APPLICATION TO BLIND AUDIO SOURCE SEPARATION.
    Ozerov, Alexey
    Fevotte, Cedric
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 3137 - +
  • [8] Two-stage blind audio source counting and separation of stereo instantaneous mixtures using Bayesian tensor factorisation
    Mirzaei, Sayeh
    Norouzi, Yaser
    Van Hamme, Hugo
    IET SIGNAL PROCESSING, 2015, 9 (08) : 587 - 595
  • [9] MULTICHANNEL NMF FOR SOURCE SEPARATION WITH AMBISONIC SIGNALS
    Nikunen, Joonas
    Politis, Archontis
    2018 16TH INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC), 2018, : 251 - 255
  • [10] Multichannel blind deconvolution for source separation in convolutive mixtures of speech
    Kokkinakis, K
    Nandi, AK
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2006, 14 (01): : 200 - 212