Singing Voice Separation Using RPCA with Weighted l1-norm

被引：15

作者：

Jeong, Il-Young ^{[1
]}

Lee, Kyogu ^{[1
]}

机构：

[1] Seoul Natl Univ, Mus & Audio Res Grp, 1 Gwanak Ro, Seoul 08826, South Korea

来源：

LATENT VARIABLE ANALYSIS AND SIGNAL SEPARATION (LVA/ICA 2017) | 2017年 / 10169卷

关键词：

Singing voice separation; Robust principal component analysis; Weighted l(1)-norm minimization; MONAURAL RECORDINGS; SPARSITY;

D O I：

10.1007/978-3-319-53547-0_52

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

In this paper, we present an extension of robust principal component analysis (RPCA) with weighted l(1)-norm minimization for singing voice separation. While the conventional RPCA applies a uniform weight between the low-rank and sparse matrices, we use different weighting parameters for each frequency bin in a spectrogram by estimating the variance ratio between the singing voice and accompaniment. In addition, we incorporate the results of vocal activation detection into the formation of the weighting matrix, and use it in the final decomposition framework. From the experimental results using the DSD100 dataset, we found that proposed algorithm yields a meaningful improvement in the separation performance compared to the conventional RPCA.

引用

页码：553 / 562

页数：10

共 50 条

[31] Separation of Singing Voice Using Nonnegative Matrix Partial Co-Factorization for Singer Identification
Hu, Ying
Liu, Guizhong
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2015, 23 (04) : 643 - 653
[32] Unsupervised Singing Voice Separation from Music Accompaniment Using Robust Principal Componenet Analysis
Umap, Priyanka. K.
Chaudhari, Kirti. B.
Joshi, Madhuri A.
2015 INTERNATIONAL CONFERENCE ON INDUSTRIAL INSTRUMENTATION AND CONTROL (ICIC), 2015, : 1433 - 1436
[33] SINGING-VOICE SEPARATION FROM MONAURAL RECORDINGS USING ROBUST PRINCIPAL COMPONENT ANALYSIS
Huang, Po-Sen
Chen, Scott Deeann
Smaragdis, Paris
Hasegawa-Johnson, Mark
2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 57 - 60
[34] Convergence analysis of sparse LMS algorithms with l1-norm penalty based on white input signal
Shi, Kun
Shi, Peng
SIGNAL PROCESSING, 2010, 90 (12) : 3289 - 3293
[35] A Trace Lasso Regularized L1-norm Graph Cut for Highly Correlated Noisy Hyperspectral Image
Mohanty, Ramanarayan
Happy, S. L.
Suthar, Nilesh
Routray, Aurobinda
2018 26TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2018, : 2220 - 2224
[36] Reconstruction method for fluorescence molecular tomography based on L1-norm primal accelerated proximal gradient
Liu, Yuhao
Jiang, Shixin
Liu, Jie
An, Yu
Zhang, Guanglei
Gao, Yuan
Wang, Kun
Tian, Jie
JOURNAL OF BIOMEDICAL OPTICS, 2018, 23 (08)
[37] Singing Voice Separation for Mono-Channel Music Using Non-negative Matrix Factorization
Chanrungutai, Angkana
Ratanamahatana, Chotirat Ann
2008 INTERNATIONAL CONFERENCE ON ADVANCED TECHNOLOGIES FOR COMMUNICATIONS, PROCEEDINGS, 2008, : 247 - 250
[38] A Two-stage Singing Voice Separation Algorithm Using Spectro-temporal Modulation Features
Yen, Frederick Z.
Huang, Mao-Chang
Chi, Tai-Shih
16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 3321 - 3324
[39] Blind monaural singing voice separation using rank-1 constraint robust principal component analysis and vocal activity detection
Li, Feng
Akagi, Masato
NEUROCOMPUTING, 2019, 350 : 44 - 52
[40] Fast Saddle-Point Algorithm for Generalized Dantzig Selector and FDR Control with the Ordered l1-Norm
Lee, Sangkyun
Brzyski, Damian
Bogdan, Malgorzata
ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 51, 2016, 51 : 780 - 789

← 1 2 3 4 5 →