ADVERSARIAL SEMI-SUPERVISED AUDIO SOURCE SEPARATION APPLIED TO SINGING VOICE EXTRACTION

被引:0
|
作者
Stoller, Daniel [1 ]
Ewert, Sebastian [2 ]
Dixon, Simon [1 ]
机构
[1] Queen Mary Univ London, London, England
[2] Spotify, Luxembourg, Luxembourg
来源
2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2018年
基金
英国工程与自然科学研究理事会;
关键词
Source separation; Deep neural networks; Adversarial training; Semi-supervised learning;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The state of the art in music source separation employs neural networks trained in a supervised fashion on multi-track databases to estimate the sources from a given mixture. With only few datasets available, often extensive data augmentation is used to combat overfitting. Mixing random tracks, however, can even reduce separation performance as instruments in real music are strongly correlated. The key concept in our approach is that source estimates of an optimal separator should be indistinguishable from real source signals. Based on this idea, we drive the separator towards outputs deemed as realistic by discriminator networks that are trained to tell apart real from separator samples. This way, we can also use unpaired source and mixture recordings without the drawbacks of creating unrealistic music mixtures. Our framework is widely applicable as it does not assume a specific network architecture or number of sources. To our knowledge, this is the first adoption of adversarial training for music source separation. In a prototype experiment for singing voice separation, separation performance increases with our approach compared to purely supervised training.
引用
收藏
页码:2391 / 2395
页数:5
相关论文
共 50 条
  • [21] Audio-visual domain adaptation using conditional semi-supervised Generative Adversarial Networks
    Athanasiadis, Christos
    Hortal, Enrique
    Asteriadis, Stylianos
    NEUROCOMPUTING, 2020, 397 : 331 - 344
  • [22] METRIC LEARNING FOR SEMI-SUPERVISED SPARSE SOURCE SEPARATION WITH SPECTRAL EXAMPLES
    Bobin, Jerome
    Acero, Fabio
    Picquenot, Adrien
    2019 IEEE 8TH INTERNATIONAL WORKSHOP ON COMPUTATIONAL ADVANCES IN MULTI-SENSOR ADAPTIVE PROCESSING (CAMSAP 2019), 2019, : 450 - 454
  • [23] Semi-Supervised Learning Based on Generative Adversarial Network and Its Applied to Lithology Recognition
    Li, Guohe
    Qiao, Yinghan
    Zheng, Yifeng
    Li, Ying
    Wu, Weijiang
    IEEE ACCESS, 2019, 7 : 67428 - 67437
  • [24] ADVERSARIAL ATTACKS ON AUDIO SOURCE SEPARATION
    Takahashi, Naoya
    Inoue, Shota
    Mitsufuji, Yuki
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 521 - 525
  • [25] Latent Space Virtual Adversarial Training for Supervised and Semi-Supervised Learning
    Osada, Genki
    Ahsan, Budrul
    Prasad Bora, Revoti
    Nishide, Takashi
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2022, E105D (03) : 667 - 678
  • [26] Virtual Adversarial Training: A Regularization Method for Supervised and Semi-Supervised Learning
    Miyato, Takeru
    Maeda, Shin-Ichi
    Koyama, Masanori
    Ishii, Shin
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2019, 41 (08) : 1979 - 1993
  • [27] Virtual Adversarial Training for Semi-supervised Verification Tasks
    Noroozi, Vahid
    Bahaadini, Sara
    Zheng, Lei
    Xie, Sihong
    Yu, Philip S.
    2019 27TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2019,
  • [28] Adversarial Variational Embedding for Robust Semi-supervised Learning
    Zhang, Xiang
    Yao, Lina
    Yuan, Feng
    KDD'19: PROCEEDINGS OF THE 25TH ACM SIGKDD INTERNATIONAL CONFERENCCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2019, : 139 - 147
  • [29] Generative adversarial network for semi-supervised image captioning
    Liang, Xu
    Li, Chen
    Tian, Lihua
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2024, 249
  • [30] Semi-supervised Seizure Prediction with Generative Adversarial Networks
    Nhan Duy Truong
    Zhou, Luping
    Kavehei, Omid
    2019 41ST ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2019, : 2369 - 2372