Blind audio source counting and separation of anechoic mixtures using the multichannel complex NMF framework

被引:22
|
作者
Mirzaei, Sayeh [1 ]
Van hamme, Hugo [1 ]
Norouzi, Yaser [2 ]
机构
[1] KULeuven, Dept Elect Engn, Leuven, Belgium
[2] Amirkabir Univ, Dept Elect Engn, Tehran, Iran
关键词
Blind Source Separation (BSS); Complex Non-negative Matrix Factorization (CNMF); Binary masking; Anechoic mixture; NONNEGATIVE MATRIX FACTORIZATION; PERMUTATION PROBLEM; SPARSENESS; ROBUST;
D O I
10.1016/j.sigpro.2015.03.006
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, we address the tasks of audio source counting and separation for a stereo anechoic mixture of audio signals. This will be achieved in two stages. In the first stage, a novel approach is introduced for estimating the number of sources as well as the channel mixing coefficients. For this purpose, a 2-D spectrum is evaluated against both the phase and amplitude differences of the two channels. Hence, obtaining the peak locations of the spectrum yields the number of the sources and the corresponding channel coefficients. In the second stage, an extension of a single channel complex matrix factorization method to multichannel is developed to extract the individual source signals. We find primary estimates of the sources via binary masking and then apply the complex factorization to the complex spectrogram of each source. The obtained factors are then utilized as initial values in the complex multichannel factorization model. We also suggest a method for estimating the number of required components for modeling each source. The separation performance improvement over the conventional methods is investigated by calculating BSS evaluation metrics. The comparison is also carried out in terms of source counting and localization with the recently proposed DeMIX-Anechoic method. (C) 2015 The Authors. Published by Elsevier B.V.
引用
收藏
页码:27 / 37
页数:11
相关论文
共 50 条
  • [21] JOINT AUDIO SOURCE LOCALIZATION AND SEPARATION WITH DISTRIBUTED MICROPHONE ARRAYS BASED ON SPATIALLY-REGULARIZED MULTICHANNEL NMF
    Sumura, Yoshiaki
    Di Carlo, Diego
    Nugraha, Aditya Arie
    Bando, Yoshiaki
    Yoshii, Kazuyoshi
    2024 18TH INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT, IWAENC 2024, 2024, : 145 - 149
  • [22] Minimum-Volume Multichannel Nonnegative Matrix Factorization for Blind Audio Source Separation
    Wang, Jianyu
    Guan, Shanzheng
    Liu, Shupei
    Zhang, Xiao-Lei
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 29 (29) : 3089 - 3103
  • [23] Blind source separation and multichannel deconvolution
    De Lathauwer, L
    Comon, P
    SIGNAL PROCESSING, 1999, 73 (1-2) : 1 - 2
  • [24] A Joint Diagonalization Based Efficient Approach to Underdetermined Blind Audio Source Separation Using the Multichannel Wiener Filter
    Ito, Nobutaka
    Ikeshita, Rintaro
    Sawada, Hiroshi
    Nakatani, Tomohiro
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 29 : 1950 - 1965
  • [25] COMPLEX NMF UNDER PHASE CONSTRAINTS BASED ON SIGNAL MODELING: APPLICATION TO AUDIO SOURCE SEPARATION
    Magron, Paul
    Badeau, Roland
    David, Bertrand
    2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 46 - 50
  • [26] PHASE RECOVERY IN NMF FOR AUDIO SOURCE SEPARATION: AN INSIGHTFUL BENCHMARK
    Magron, Paul
    Badeau, Roland
    David, Bertrand
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 81 - 85
  • [27] Incremental Approach to NMF Basis Estimation for Audio Source Separation
    Kwon, Kisoo
    Shin, Jong Won
    Choi, Inkyu
    Kim, Hyung Yong
    Kim, Nam Soo
    2016 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2016,
  • [28] Multichannel blind source separation using convolution kernel compensation
    Holobar, Ales
    Zazula, Damjan
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2007, 55 (09) : 4487 - 4496
  • [29] Initialization for NMF-Based Audio Source Separation Using Priors on Encoding Vectors
    Byun, Jacuk
    Shin, Jong Won
    CHINA COMMUNICATIONS, 2019, 16 (09) : 177 - 186
  • [30] Initialization for NMF-Based Audio Source Separation Using Priors on Encoding Vectors
    Jaeuk Byun
    Jong Won Shin
    中国通信, 2019, 16 (09) : 177 - 186