RPCA-DRNN technique for monaural singing voice separation

被引:8
|
作者
Lai, Wen-Hsing [1 ]
Wang, Siou-Lin [2 ]
机构
[1] Natl Kaohsiung Univ Sci & Technol, Dept Comp & Commun Engn, Kaohsiung 824005, Taiwan
[2] Natl Kaohsiung Univ Sci & Technol, Coll Engn, PhD Program Engn Sci & Technol, Kaohsiung 824005, Taiwan
关键词
Singing separation; Robust principal component analysis; Deep recurrent neural network; Stacked recurrent neural network; FACTORIZATION; OPTIMIZATION; ENHANCEMENT; CONTINUITY; MODELS;
D O I
10.1186/s13636-022-00236-9
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this study, we propose a methodology for separating a singing voice from musical accompaniment in a monaural musical mixture. The proposed method uses robust principal component analysis (RPCA), followed by postprocessing, including median filter, morphology, and high-pass filter, to decompose the mixture. Subsequently, a deep recurrent neural network comprising two jointly optimized parallel-stacked recurrent neural networks (sRNNs) with mask layers and trained on limited data and computation is applied to the decomposed components to optimize the final estimated separated singing voice and background music to further correct misclassified or residual singing and background music in the initial separation. The experimental results of MIR-1K, ccMixter, and MUSDB18 datasets and the comparison with ten existing techniques indicate that the proposed method achieves competitive performance in monaural singing voice separation. On MUSDB18, the proposed method reaches the comparable separation quality in less training data and lower computational cost compared to the other state-of-the-art technique.
引用
收藏
页数:21
相关论文
共 50 条
  • [1] RPCA-DRNN technique for monaural singing voice separation
    Wen-Hsing Lai
    Siou-Lin Wang
    EURASIP Journal on Audio, Speech, and Music Processing, 2022
  • [2] A Skip Attention Mechanism for Monaural Singing Voice Separation
    Yuan, Weitao
    Wang, Shengbei
    Li, Xiangrui
    Unoki, Masashi
    Wang, Wenwu
    IEEE SIGNAL PROCESSING LETTERS, 2019, 26 (10) : 1481 - 1485
  • [3] Enhanced feature network for monaural singing voice separation
    Yuan, Weitao
    He, Boxin
    Wang, Shengbei
    Wang, Jianming
    Unoki, Masashi
    SPEECH COMMUNICATION, 2019, 106 : 1 - 6
  • [4] Separation of singing voice from music accompaniment for monaural recordings
    Li, Yipeng
    Wang, DeLiang
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (04): : 1475 - 1487
  • [5] PROXIMAL DEEP RECURRENT NEURAL NETWORK FOR MONAURAL SINGING VOICE SEPARATION
    Yuan, Weitao
    Wang, Shengbei
    Li, Xiangrui
    Unoki, Masashi
    Wang, Wenwu
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 286 - 290
  • [6] Monaural singing voice separation based on high-resolution network
    Zhang Y.
    Niu Z.
    Niu B.
    Chang Y.
    Niu, Zhixian (niuniurose63@163.com), 1600, Beijing University of Aeronautics and Astronautics (BUAA) (46): : 1555 - 1563
  • [7] Singing Voice Separation Using RPCA with Weighted l1-norm
    Jeong, Il-Young
    Lee, Kyogu
    LATENT VARIABLE ANALYSIS AND SIGNAL SEPARATION (LVA/ICA 2017), 2017, 10169 : 553 - 562
  • [8] Evolving Multi-Resolution Pooling CNN for Monaural Singing Voice Separation
    Yuan, Weitao
    Dong, Bofei
    Wang, Shengbei
    Unoki, Masashi
    Wang, Wenwu
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 29 (29) : 807 - 822
  • [9] Singing-Voice Separation From Monaural Recordings using Empirical Wavelet Transform
    Nerkar, Anuja R.
    Joshi, Madhuri A.
    PROCEEDINGS OF 2016 INTERNATIONAL CONFERENCE ON ADVANCED COMMUNICATION CONTROL AND COMPUTING TECHNOLOGIES (ICACCCT), 2016, : 795 - 800
  • [10] Depthwise Separable Convolutions Versus Recurrent Neural Networks for Monaural Singing Voice Separation
    Pyykkonen, Pyry
    Mimilakis, Styliannos, I
    Drossos, Konstantinos
    Virtanen, Tuomas
    2020 IEEE 22ND INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2020,