共 29 条
[1]
Aaron~ van den Oord Yazhe Li, 2018, PR MACH LEARN RES, P3918
[2]
[Anonymous], 2017, Char2wav: End-to-end speech synthesis
[3]
Arik SÖ, 2017, ADV NEUR IN, V30
[4]
End-to-end Text-to-speech for Low-resource Languages by Cross-Lingual Transfer Learning
[J].
INTERSPEECH 2019,
2019,
:2075-2079
[5]
Chung YA, 2019, INT CONF ACOUST SPEE, P6940, DOI 10.1109/ICASSP.2019.8683862
[6]
Voice Conversion from Unaligned Corpora using Variational Autoencoding Wasserstein Generative Adversarial Networks
[J].
18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION,
2017,
:3364-3368
[7]
Kalchbrenner N., 2018, P INT C MACH LEARN, P2410, DOI DOI 10.48550/ARXIV.1802.08435
[8]
Kaneko T, 2019, INT CONF ACOUST SPEE, P6820, DOI [10.1109/icassp.2019.8682897, 10.1109/ICASSP.2019.8682897]
[9]
CopyCat: Many-to-Many Fine-Grained Prosody Transfer for Neural Text-to-Speech
[J].
INTERSPEECH 2020,
2020,
:4387-4391
[10]
Kingma D. P., 2013, ARXIV13126114