Stereophonic Music Separation Based on Non-negative Tensor Factorization with Cepstrum Regularization

被引:0
|
作者
Seki, Shogo [1 ]
Toda, Tomoki [2 ]
Takeda, Kazuya [1 ]
机构
[1] Nagoya Univ, Grad Sch Informat Sci, Chikusa Ku, Furo Cho, Nagoya, Aichi 4648601, Japan
[2] Nagoya Univ, Ctr Informat Technol, Chikusa Ku, Furo Cho, Nagoya, Aichi 4648601, Japan
关键词
AUDIO SOURCE SEPARATION; MATRIX FACTORIZATION; MIXTURES;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper presents a novel approach to stereophonic music separation based on Non-negative Tensor Factorization (NTF). Stereophonic music is roughly divided into two types; recorded music or synthesized music, which we focus on synthesized one in this paper. Synthesized music signals are often generated as linear combinations of many individual source signals with their mixing gains (i.e., time-invariant amplitude scaling) to each channel signal. Therefore, the synthesized stereophonic music separation is the underdetermined source separation problem where phase components are not helpful for the separation. NTF is one of the effective techniques to handle this problem, decomposing amplitude spectrograms of the stereo channel music signal into basis vectors and activations of individual music source signals and their corresponding mixing gains. However, it is essentially difficult to obtain sufficient separation performance in this separation problem as available acoustic cues for separation are limited. To address this issue, we propose a cepstrum regularization method for NTF-based stereo channel separation. The proposed method makes the separated music source signals follow the corresponding Gaussian mixture models of individual music source signals, which are trained in advance using their available samples. An experimental evaluation using real music signals is conducted to investigate the effectiveness of the proposed method in both supervised and unsupervised separation frameworks. The experimental results demonstrate that the proposed method yields significant improvements in separation performance in both frameworks.
引用
收藏
页码:981 / 985
页数:5
相关论文
共 50 条
  • [1] Stereophonic Music Separation Based on Non-Negative Tensor Factorization with Cepstral Distance Regularization
    Seki, Shogo
    Toda, Tomoki
    Takeda, Kazuya
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2018, E101A (07): : 1057 - 1064
  • [2] A Singing Voice/Music Separation Method Based on Non-negative Tensor Factorization and Repeat Pattern Extraction
    Zhang, Yong
    Ma, Xiaohong
    ADVANCES IN NEURAL NETWORKS - ISNN 2015, 2015, 9377 : 287 - 296
  • [3] Non-Negative Tensor Factorization Applied to Music Genre Classification
    Benetos, Emmanouil
    Kotropoulos, Constantine
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (08): : 1955 - 1967
  • [4] Model-Oriented Non-negative Matrix Factorization based Music Separation
    Yan, Yujia
    Du, Zhenlong
    Wang, Rui
    Cheng, Xiao
    2012 5TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING (CISP), 2012, : 1669 - 1672
  • [5] Non-negative Multiple Tensor Factorization
    Takeuchi, Koh
    Tomioka, Ryota
    Ishiguro, Katsuhiko
    Kimura, Akisato
    Sawada, Hiroshi
    2013 IEEE 13TH INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2013, : 1199 - 1204
  • [6] MULTICHANNEL BLIND SOURCE SEPARATION BASED ON NON-NEGATIVE TENSOR FACTORIZATION IN WAVENUMBER DOMAIN
    Mitsufuji, Yuki
    Koyama, Shoichi
    Saruwatari, Hiroshi
    2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 56 - 60
  • [7] On Ambisonic Source Separation With Spatially Informed Non-Negative Tensor Factorization
    Guzik, Mateusz
    Kowalczyk, Konrad
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 : 3238 - 3255
  • [8] Controlling sparseness in non-negative tensor factorization
    Heiler, M
    Schnörr, C
    COMPUTER VISION - ECCV 2006 , PT 1, PROCEEDINGS, 2006, 3951 : 56 - 67
  • [9] Non-negative tensor factorization using α and β divergences
    Cichocki, Andrzej
    Zdunek, Rafal
    Choi, Seungjin
    Plemmons, Robert
    Amari, Shun-ichi
    2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL III, PTS 1-3, PROCEEDINGS, 2007, : 1393 - +
  • [10] CitySpectrum: A Non-negative Tensor Factorization Approach
    Fan, Zipei
    Song, Xuan
    Shibasaki, Ryosuke
    UBICOMP'14: PROCEEDINGS OF THE 2014 ACM INTERNATIONAL JOINT CONFERENCE ON PERVASIVE AND UBIQUITOUS COMPUTING, 2014, : 213 - 223