ADVERSARIAL SEMI-SUPERVISED AUDIO SOURCE SEPARATION APPLIED TO SINGING VOICE EXTRACTION

被引：0

作者：

Stoller, Daniel ^{[1
]}

Ewert, Sebastian ^{[2
]}

Dixon, Simon ^{[1
]}

机构：

[1] Queen Mary Univ London, London, England

[2] Spotify, Luxembourg, Luxembourg

来源：

2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2018年

基金：

英国工程与自然科学研究理事会;

关键词：

Source separation; Deep neural networks; Adversarial training; Semi-supervised learning;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

The state of the art in music source separation employs neural networks trained in a supervised fashion on multi-track databases to estimate the sources from a given mixture. With only few datasets available, often extensive data augmentation is used to combat overfitting. Mixing random tracks, however, can even reduce separation performance as instruments in real music are strongly correlated. The key concept in our approach is that source estimates of an optimal separator should be indistinguishable from real source signals. Based on this idea, we drive the separator towards outputs deemed as realistic by discriminator networks that are trained to tell apart real from separator samples. This way, we can also use unpaired source and mixture recordings without the drawbacks of creating unrealistic music mixtures. Our framework is widely applicable as it does not assume a specific network architecture or number of sources. To our knowledge, this is the first adoption of adversarial training for music source separation. In a prototype experiment for singing voice separation, separation performance increases with our approach compared to purely supervised training.

引用

页码：2391 / 2395

页数：5

共 50 条

[21] Audio-visual domain adaptation using conditional semi-supervised Generative Adversarial Networks
Athanasiadis, Christos
Hortal, Enrique
Asteriadis, Stylianos
NEUROCOMPUTING, 2020, 397 : 331 - 344
[22] METRIC LEARNING FOR SEMI-SUPERVISED SPARSE SOURCE SEPARATION WITH SPECTRAL EXAMPLES
Bobin, Jerome
Acero, Fabio
Picquenot, Adrien
2019 IEEE 8TH INTERNATIONAL WORKSHOP ON COMPUTATIONAL ADVANCES IN MULTI-SENSOR ADAPTIVE PROCESSING (CAMSAP 2019), 2019, : 450 - 454
[23] Semi-Supervised Learning Based on Generative Adversarial Network and Its Applied to Lithology Recognition
Li, Guohe
Qiao, Yinghan
Zheng, Yifeng
Li, Ying
Wu, Weijiang
IEEE ACCESS, 2019, 7 : 67428 - 67437
[24] ADVERSARIAL ATTACKS ON AUDIO SOURCE SEPARATION
Takahashi, Naoya
Inoue, Shota
Mitsufuji, Yuki
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 521 - 525
[25] Latent Space Virtual Adversarial Training for Supervised and Semi-Supervised Learning
Osada, Genki
Ahsan, Budrul
Prasad Bora, Revoti
Nishide, Takashi
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2022, E105D (03) : 667 - 678
[26] Virtual Adversarial Training: A Regularization Method for Supervised and Semi-Supervised Learning
Miyato, Takeru
Maeda, Shin-Ichi
Koyama, Masanori
Ishii, Shin
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2019, 41 (08) : 1979 - 1993
[27] Virtual Adversarial Training for Semi-supervised Verification Tasks
Noroozi, Vahid
Bahaadini, Sara
Zheng, Lei
Xie, Sihong
Yu, Philip S.
2019 27TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2019,
[28] Adversarial Variational Embedding for Robust Semi-supervised Learning
Zhang, Xiang
Yao, Lina
Yuan, Feng
KDD'19: PROCEEDINGS OF THE 25TH ACM SIGKDD INTERNATIONAL CONFERENCCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2019, : 139 - 147
[29] Generative adversarial network for semi-supervised image captioning
Liang, Xu
Li, Chen
Tian, Lihua
COMPUTER VISION AND IMAGE UNDERSTANDING, 2024, 249
[30] Semi-supervised Seizure Prediction with Generative Adversarial Networks
Nhan Duy Truong
Zhou, Luping
Kavehei, Omid
2019 41ST ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2019, : 2369 - 2372

← 1 2 3 4 5 →