共 34 条
[1]
Ardila R., 2019, ARXIV191206670
[2]
Arik SÖ, 2018, ADV NEUR IN, V31
[3]
Ba Jimmy Lei, 2016, arXiv, DOI DOI 10.48550/ARXIV.1607.06450
[4]
Cai W., 2018, P OD SPEAK LANG REC, P74, DOI DOI 10.21437/ODYSSEY.2018-11
[5]
Attentron: Few-Shot Text-to-Speech Utilizing Attention-Based Variable-Length Embedding
[J].
INTERSPEECH 2020,
2020,
:2007-2011
[7]
Chung JS, 2018, INTERSPEECH, P1086
[8]
Cooper E, 2020, INT CONF ACOUST SPEE, P6184, DOI [10.1109/ICASSP40776.2020.9054535, 10.1109/icassp40776.2020.9054535]
[9]
Dauphin YN, 2017, PR MACH LEARN RES, V70
[10]
Golge E., 2019, GRADUAL TRAINING TAC