Stereophonic Music Separation Based on Non-negative Tensor Factorization with Cepstrum Regularization

被引:0
|
作者
Seki, Shogo [1 ]
Toda, Tomoki [2 ]
Takeda, Kazuya [1 ]
机构
[1] Nagoya Univ, Grad Sch Informat Sci, Chikusa Ku, Furo Cho, Nagoya, Aichi 4648601, Japan
[2] Nagoya Univ, Ctr Informat Technol, Chikusa Ku, Furo Cho, Nagoya, Aichi 4648601, Japan
关键词
AUDIO SOURCE SEPARATION; MATRIX FACTORIZATION; MIXTURES;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper presents a novel approach to stereophonic music separation based on Non-negative Tensor Factorization (NTF). Stereophonic music is roughly divided into two types; recorded music or synthesized music, which we focus on synthesized one in this paper. Synthesized music signals are often generated as linear combinations of many individual source signals with their mixing gains (i.e., time-invariant amplitude scaling) to each channel signal. Therefore, the synthesized stereophonic music separation is the underdetermined source separation problem where phase components are not helpful for the separation. NTF is one of the effective techniques to handle this problem, decomposing amplitude spectrograms of the stereo channel music signal into basis vectors and activations of individual music source signals and their corresponding mixing gains. However, it is essentially difficult to obtain sufficient separation performance in this separation problem as available acoustic cues for separation are limited. To address this issue, we propose a cepstrum regularization method for NTF-based stereo channel separation. The proposed method makes the separated music source signals follow the corresponding Gaussian mixture models of individual music source signals, which are trained in advance using their available samples. An experimental evaluation using real music signals is conducted to investigate the effectiveness of the proposed method in both supervised and unsupervised separation frameworks. The experimental results demonstrate that the proposed method yields significant improvements in separation performance in both frameworks.
引用
收藏
页码:981 / 985
页数:5
相关论文
共 50 条
  • [11] Non-negative Tensor Factorization for Speech Enhancement
    He, Liang
    Zhang, Weiqiang
    Shi, Mengnan
    PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE: TECHNOLOGIES AND APPLICATIONS, 2016, 127
  • [12] Non-negative Matrix Factorization with Symmetric Manifold Regularization
    Yang, Shangming
    Liu, Yongguo
    Li, Qiaoqin
    Yang, Wen
    Zhang, Yi
    Wen, Chuanbiao
    NEURAL PROCESSING LETTERS, 2020, 51 (01) : 723 - 748
  • [13] Non-negative Matrix Factorization with Symmetric Manifold Regularization
    Shangming Yang
    Yongguo Liu
    Qiaoqin Li
    Wen Yang
    Yi Zhang
    Chuanbiao Wen
    Neural Processing Letters, 2020, 51 : 723 - 748
  • [14] Single Channel Music and Speech Separation Using Non-negative Matrix Factorization
    Yidirim, Sinan
    Saraclar, Murat
    2009 IEEE 17TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, VOLS 1 AND 2, 2009, : 543 - 546
  • [15] Graph-based non-negative tensor factorization for image classification
    Luo, B. (luobin@ahu.edu.cn), 1600, Binary Information Press, P.O. Box 162, Bethel, CT 06801-0162, United States (10):
  • [16] MULTICHANNEL AUDIO UPMIXING BASED ON NON-NEGATIVE TENSOR FACTORIZATION REPRESENTATION
    Nikunen, J.
    Virtanen, T.
    Vilermo, M.
    2011 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2011, : 33 - 36
  • [17] SOUND SOURCE SEPARATION BASED ON NON-NEGATIVE TENSOR FACTORIZATION INCORPORATING SPATIAL CUE AS PRIOR KNOWLEDGE
    Mitsufuji, Yuki
    Roebel, Axel
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 71 - 75
  • [18] FacetCube: a general framework for non-negative tensor factorization
    Chi, Yun
    Zhu, Shenghuo
    KNOWLEDGE AND INFORMATION SYSTEMS, 2013, 37 (01) : 155 - 179
  • [19] A multiresolution non-negative tensor factorization approach for single channel sound source separation
    Kirbiz, S.
    Gunsel, B.
    SIGNAL PROCESSING, 2014, 105 : 56 - 69
  • [20] FacetCube: a general framework for non-negative tensor factorization
    Yun Chi
    Shenghuo Zhu
    Knowledge and Information Systems, 2013, 37 : 155 - 179