Informed Group-Sparse Representation for Singing Voice Separation

被引:10
作者
Chan, Tak-Shing T. [1 ]
Yang, Yi-Hsuan [1 ]
机构
[1] Acad Sinica, Res Ctr Informat Technol Innovat, Taipei 11564, Taiwan
关键词
Group-sparse representation (GSR); informed source separation; low-rank representation (LRR); singing voice separation (SVS); RECORDINGS; REGRESSION; SHRINKAGE; SELECTION; BLIND;
D O I
10.1109/LSP.2017.2647810
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Singing voice separation attempts to separate the vocal and instrumental parts of a music recording, which is a fundamental problem in music information retrieval. Recent work on singing voice separation has shown that the low-rank representation and informed separation approaches are both able to improve separation quality. However, low-rank optimizations are computationally inefficient due to the use of singular value decompositions. Therefore, in this letter, we propose a new linear-time algorithm called informed group-sparse representation, and use it to separate the vocals from music using pitch annotations as side information. Experimental results on the iKala dataset confirm the efficacy of our approach, suggesting that the music accompaniment follows a group-sparse structure given a pretrained instrumental dictionary. We also show how our work can be easily extended to accommodate multiple dictionaries using the DSD100 dataset.
引用
收藏
页码:156 / 160
页数:5
相关论文
共 50 条
[1]  
[Anonymous], [No title captured]
[2]  
[Anonymous], 2006, Journal of the Royal Statistical Society, Series B
[3]  
[Anonymous], 2011, P 25 ADV NEUR INF PR
[4]  
[Anonymous], 2005, ISMIR
[5]  
Bryan N., 2013, P INT C MUSIC INFORM, P119
[6]   A SINGULAR VALUE THRESHOLDING ALGORITHM FOR MATRIX COMPLETION [J].
Cai, Jian-Feng ;
Candes, Emmanuel J. ;
Shen, Zuowei .
SIAM JOURNAL ON OPTIMIZATION, 2010, 20 (04) :1956-1982
[7]   Robust Principal Component Analysis? [J].
Candes, Emmanuel J. ;
Li, Xiaodong ;
Ma, Yi ;
Wright, John .
JOURNAL OF THE ACM, 2011, 58 (03)
[8]   Regularization techniques for discrete cepstrum estimation [J].
Cappe, O ;
Moulines, E .
IEEE SIGNAL PROCESSING LETTERS, 1996, 3 (04) :100-102
[9]  
Chan TS, 2015, INT CONF ACOUST SPEE, P718, DOI 10.1109/ICASSP.2015.7178063
[10]  
Coker Jerry., 1964, Improvising Jazz