SPECTRO-TEMPORAL POST-SMOOTHING IN NMF BASED SINGLE-CHANNEL SOURCE SEPARATION

被引:0
作者
Grais, Emad M. [1 ]
Erdogan, Hakan [1 ]
机构
[1] Sabanci Univ, Fac Engn & Nat Sci, TR-34956 Istanbul, Turkey
来源
2012 PROCEEDINGS OF THE 20TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO) | 2012年
关键词
Single channel source separation; nonnegative matrix factorization; and speech-music separation; NONNEGATIVE MATRIX FACTORIZATION;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, we propose a new, simple, fast, and effective method to enforce temporal smoothness on nonnegative matrix factorization (NMF) solutions by post-smoothing the NMF decomposition results. In NMF based single-channel source separation, NMF is used to decompose the magnitude spectra of the mixed signal as a weighted linear combination of the trained basis vectors. The decomposition results are used to build spectral masks. To get temporal smoothness of the estimated sources, we deal with the spectral masks as 2-D images, and we pass the masks through a smoothing filter. The smoothing direction of the filter is the time direction of the spectral masks. The smoothed masks are used to find estimates for the source signals. Experimental results show that, using the smoothed masks give better separation results than enforcing temporal smoothness prior using regularized NMF.
引用
收藏
页码:584 / 588
页数:5
相关论文
共 48 条
  • [21] Synthesizing the note-specific atoms based on their fundamental frequency, used for single-channel musical source separation
    Azamian, Mohammadali
    Kabir, Ehsanollah
    MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (13) : 17929 - 17948
  • [22] Single-Channel Speech Separation Based on Deep Clustering with Local Optimization
    Fu, Taotao
    Yu, Ge
    Guo, Lili
    Wang, Yan
    Liang, Ji
    2017 3RD INTERNATIONAL CONFERENCE ON FRONTIERS OF SIGNAL PROCESSING (ICFSP), 2017, : 44 - 49
  • [23] Online Noisy Single-Channel Source Separation Using Adaptive Spectrum Amplitude Estimator and Masking
    Tengtrairat, N.
    Woo, W. L.
    Dlay, S. S.
    Gao, Bin
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2016, 64 (07) : 1881 - 1895
  • [24] Single-Channel Source Separation Using EMD-Subband Variable Regularized Sparse Features
    Gao, Bin
    Woo, W. L.
    Dlay, S. S.
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (04): : 961 - 976
  • [25] Unsupervised single-channel music source separation by average harmonic structure modeling
    Duan, Zhiyao
    Zhang, Yungang
    Zhang, Changshui
    Shi, Zhenwei
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2008, 16 (04): : 766 - 778
  • [26] Perceptually enhanced blind single-channel music source separation by Non-negative Matrix Factorization
    Kirbiz, S.
    Gunsel, B.
    DIGITAL SIGNAL PROCESSING, 2013, 23 (02) : 646 - 658
  • [27] Adaptive Sparsity Non-Negative Matrix Factorization for Single-Channel Source Separation
    Gao, Bin
    Woo, W. L.
    Dlay, S. S.
    IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2011, 5 (05) : 989 - 1001
  • [28] SINGLE-CHANNEL MUSIC SOURCE SEPARATION BY HARMONIC STRUCTURE MODEL AND SUPPORT VECTOR MACHINE
    Fang J.-T.
    Yang C.-W.
    International Journal of Electrical Engineering, 2022, 29 (02): : 43 - 51
  • [29] Informed Single-Channel Speech Separation Using HMM-GMM User-Generated Exemplar Source
    Wang, Qi
    Woo, W. L.
    Dlay, S. S.
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2014, 22 (12) : 2087 - 2100
  • [30] Adaptation of Bayesian models for single-channel source separation and its application to voice/music separation in popular songs
    Ozerov, Alexey
    Philippe, Pierrick
    Bimbot, Frederic
    Gribonval, Remi
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (05): : 1564 - 1578