共 26 条
[1]
Expressive, Variable, and Controllable Duration Modelling in TTS
[J].
INTERSPEECH 2022,
2022,
:4546-4550
[3]
Clark R. A., 1999, INT C PHON SCI
[4]
PARALLEL TACOTRON: NON-AUTOREGRESSIVE AND CONTROLLABLE TTS
[J].
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021),
2021,
:5709-5713
[5]
REMAP, WARP AND ATTEND: NON-PARALLEL MANY-TO-MANY ACCENT CONVERSION WITH NORMALIZING FLOWS
[J].
2022 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP, SLT,
2022,
:984-990
[6]
Hodari Z., 2019, PROC 10 ISCA SPEECH, P239, DOI 10.21437/SSW.2019-43
[7]
Jeong M., 2021, P INTERSPEECH
[8]
UNIVERSAL NEURAL VOCODING WITH PARALLEL WAVENET
[J].
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021),
2021,
:6044-6048
[10]
Kim Jaehyeon, 2020, ADV NEURAL INFORM PR, V33, P8067