Singing Voice Separation Using RPCA with Weighted l1-norm

被引:15
作者
Jeong, Il-Young [1 ]
Lee, Kyogu [1 ]
机构
[1] Seoul Natl Univ, Mus & Audio Res Grp, 1 Gwanak Ro, Seoul 08826, South Korea
来源
LATENT VARIABLE ANALYSIS AND SIGNAL SEPARATION (LVA/ICA 2017) | 2017年 / 10169卷
关键词
Singing voice separation; Robust principal component analysis; Weighted l(1)-norm minimization; MONAURAL RECORDINGS; SPARSITY;
D O I
10.1007/978-3-319-53547-0_52
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, we present an extension of robust principal component analysis (RPCA) with weighted l(1)-norm minimization for singing voice separation. While the conventional RPCA applies a uniform weight between the low-rank and sparse matrices, we use different weighting parameters for each frequency bin in a spectrogram by estimating the variance ratio between the singing voice and accompaniment. In addition, we incorporate the results of vocal activation detection into the formation of the weighting matrix, and use it in the final decomposition framework. From the experimental results using the DSD100 dataset, we found that proposed algorithm yields a meaningful improvement in the separation performance compared to the conventional RPCA.
引用
收藏
页码:553 / 562
页数:10
相关论文
共 50 条
  • [31] Separation of Singing Voice Using Nonnegative Matrix Partial Co-Factorization for Singer Identification
    Hu, Ying
    Liu, Guizhong
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2015, 23 (04) : 643 - 653
  • [32] Unsupervised Singing Voice Separation from Music Accompaniment Using Robust Principal Componenet Analysis
    Umap, Priyanka. K.
    Chaudhari, Kirti. B.
    Joshi, Madhuri A.
    2015 INTERNATIONAL CONFERENCE ON INDUSTRIAL INSTRUMENTATION AND CONTROL (ICIC), 2015, : 1433 - 1436
  • [33] SINGING-VOICE SEPARATION FROM MONAURAL RECORDINGS USING ROBUST PRINCIPAL COMPONENT ANALYSIS
    Huang, Po-Sen
    Chen, Scott Deeann
    Smaragdis, Paris
    Hasegawa-Johnson, Mark
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 57 - 60
  • [34] Convergence analysis of sparse LMS algorithms with l1-norm penalty based on white input signal
    Shi, Kun
    Shi, Peng
    SIGNAL PROCESSING, 2010, 90 (12) : 3289 - 3293
  • [35] A Trace Lasso Regularized L1-norm Graph Cut for Highly Correlated Noisy Hyperspectral Image
    Mohanty, Ramanarayan
    Happy, S. L.
    Suthar, Nilesh
    Routray, Aurobinda
    2018 26TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2018, : 2220 - 2224
  • [36] Reconstruction method for fluorescence molecular tomography based on L1-norm primal accelerated proximal gradient
    Liu, Yuhao
    Jiang, Shixin
    Liu, Jie
    An, Yu
    Zhang, Guanglei
    Gao, Yuan
    Wang, Kun
    Tian, Jie
    JOURNAL OF BIOMEDICAL OPTICS, 2018, 23 (08)
  • [37] Singing Voice Separation for Mono-Channel Music Using Non-negative Matrix Factorization
    Chanrungutai, Angkana
    Ratanamahatana, Chotirat Ann
    2008 INTERNATIONAL CONFERENCE ON ADVANCED TECHNOLOGIES FOR COMMUNICATIONS, PROCEEDINGS, 2008, : 247 - 250
  • [38] A Two-stage Singing Voice Separation Algorithm Using Spectro-temporal Modulation Features
    Yen, Frederick Z.
    Huang, Mao-Chang
    Chi, Tai-Shih
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 3321 - 3324
  • [39] Blind monaural singing voice separation using rank-1 constraint robust principal component analysis and vocal activity detection
    Li, Feng
    Akagi, Masato
    NEUROCOMPUTING, 2019, 350 : 44 - 52
  • [40] Fast Saddle-Point Algorithm for Generalized Dantzig Selector and FDR Control with the Ordered l1-Norm
    Lee, Sangkyun
    Brzyski, Damian
    Bogdan, Malgorzata
    ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 51, 2016, 51 : 780 - 789