RPCA-DRNN technique for monaural singing voice separation

被引：8

作者：

Lai, Wen-Hsing ^{[1
]}

Wang, Siou-Lin ^{[2
]}

机构：

[1] Natl Kaohsiung Univ Sci & Technol, Dept Comp & Commun Engn, Kaohsiung 824005, Taiwan

[2] Natl Kaohsiung Univ Sci & Technol, Coll Engn, PhD Program Engn Sci & Technol, Kaohsiung 824005, Taiwan

来源：

EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING | 2022年 / 2022卷 / 01期

关键词：

Singing separation; Robust principal component analysis; Deep recurrent neural network; Stacked recurrent neural network; FACTORIZATION; OPTIMIZATION; ENHANCEMENT; CONTINUITY; MODELS;

D O I：

10.1186/s13636-022-00236-9

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

In this study, we propose a methodology for separating a singing voice from musical accompaniment in a monaural musical mixture. The proposed method uses robust principal component analysis (RPCA), followed by postprocessing, including median filter, morphology, and high-pass filter, to decompose the mixture. Subsequently, a deep recurrent neural network comprising two jointly optimized parallel-stacked recurrent neural networks (sRNNs) with mask layers and trained on limited data and computation is applied to the decomposed components to optimize the final estimated separated singing voice and background music to further correct misclassified or residual singing and background music in the initial separation. The experimental results of MIR-1K, ccMixter, and MUSDB18 datasets and the comparison with ten existing techniques indicate that the proposed method achieves competitive performance in monaural singing voice separation. On MUSDB18, the proposed method reaches the comparable separation quality in less training data and lower computational cost compared to the other state-of-the-art technique.

引用

页数：21

共 50 条

[1] RPCA-DRNN technique for monaural singing voice separation
Wen-Hsing Lai
Siou-Lin Wang
EURASIP Journal on Audio, Speech, and Music Processing, 2022
[2] A Skip Attention Mechanism for Monaural Singing Voice Separation
Yuan, Weitao
Wang, Shengbei
Li, Xiangrui
Unoki, Masashi
Wang, Wenwu
IEEE SIGNAL PROCESSING LETTERS, 2019, 26 (10) : 1481 - 1485
[3] Enhanced feature network for monaural singing voice separation
Yuan, Weitao
He, Boxin
Wang, Shengbei
Wang, Jianming
Unoki, Masashi
SPEECH COMMUNICATION, 2019, 106 : 1 - 6
[4] Separation of singing voice from music accompaniment for monaural recordings
Li, Yipeng
Wang, DeLiang
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (04): : 1475 - 1487
[5] PROXIMAL DEEP RECURRENT NEURAL NETWORK FOR MONAURAL SINGING VOICE SEPARATION
Yuan, Weitao
Wang, Shengbei
Li, Xiangrui
Unoki, Masashi
Wang, Wenwu
2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 286 - 290
[6] Monaural singing voice separation based on high-resolution network
Zhang Y.
Niu Z.
Niu B.
Chang Y.
Niu, Zhixian (niuniurose63@163.com), 1600, Beijing University of Aeronautics and Astronautics (BUAA) (46): : 1555 - 1563
[7] Singing Voice Separation Using RPCA with Weighted l1-norm
Jeong, Il-Young
Lee, Kyogu
LATENT VARIABLE ANALYSIS AND SIGNAL SEPARATION (LVA/ICA 2017), 2017, 10169 : 553 - 562
[8] Evolving Multi-Resolution Pooling CNN for Monaural Singing Voice Separation
Yuan, Weitao
Dong, Bofei
Wang, Shengbei
Unoki, Masashi
Wang, Wenwu
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 29 (29) : 807 - 822
[9] Singing-Voice Separation From Monaural Recordings using Empirical Wavelet Transform
Nerkar, Anuja R.
Joshi, Madhuri A.
PROCEEDINGS OF 2016 INTERNATIONAL CONFERENCE ON ADVANCED COMMUNICATION CONTROL AND COMPUTING TECHNOLOGIES (ICACCCT), 2016, : 795 - 800
[10] Depthwise Separable Convolutions Versus Recurrent Neural Networks for Monaural Singing Voice Separation
Pyykkonen, Pyry
Mimilakis, Styliannos, I
Drossos, Konstantinos
Virtanen, Tuomas
2020 IEEE 22ND INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2020,

← 1 2 3 4 5 →