共 27 条
[1]
One-shot Voice Conversion by Separating Speaker and Content Representations with Instance Normalization
[J].
INTERSPEECH 2019,
2019,
:664-668
[2]
Chou JC, 2018, INTERSPEECH, P501
[3]
Goodfellow IJ, 2014, ADV NEUR IN, V27, P2672
[4]
Hasegawa-Johnson Mark, 2019, ARXIV190505879
[5]
Voice Conversion from Unaligned Corpora using Variational Autoencoding Wasserstein Generative Adversarial Networks
[J].
18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION,
2017,
:3364-3368
[6]
Huang WC, 2018, 2018 11TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), P51, DOI 10.1109/ISCSLP.2018.8706604
[7]
Arbitrary Style Transfer in Real-time with Adaptive Instance Normalization
[J].
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV),
2017,
:1510-1519
[8]
Kameoka H, 2018, IEEE W SP LANG TECH, P266, DOI 10.1109/SLT.2018.8639535
[9]
Kaneko T., 2017, arXiv
[10]
StarGAN-VC2: Rethinking Conditional Methods for StarGAN-Based Voice Conversion
[J].
INTERSPEECH 2019,
2019,
:679-683