共 177 条
[11]
EMOTION CONTROLLABLE SPEECH SYNTHESIS USING EMOTION-UNLABELED DATASET WITH THE ASSISTANCE OF CROSS-DOMAIN SPEECH EMOTION RECOGNITION
[J].
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021),
2021,
:5734-5738
[12]
FINE-GRAINED STYLE CONTROL IN TRANSFORMER-BASED TEXT-TO-SPEECH SYNTHESIS
[J].
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP),
2022,
:7907-7911
[13]
Cheng PY, 2020, PR MACH LEARN RES, V119
[14]
Gated Recurrent Attention for Multi-Style Speech Synthesis
[J].
APPLIED SCIENCES-BASEL,
2020, 10 (15)
[15]
Choi H, 2019, INT CONF ACOUST SPEE, P6950, DOI 10.1109/ICASSP.2019.8683682
[16]
ON-THE-FLY DATA AUGMENTATION FOR TEXT-TO-SPEECH STYLE TRANSFER
[J].
2021 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU),
2021,
:634-641
[17]
Clark K, 2020, Arxiv, DOI arXiv:2003.10555
[18]
INTERACTIVE MULTI-LEVEL PROSODY CONTROL FOR EXPRESSIVE SPEECH SYNTHESIS
[J].
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP),
2022,
:8312-8316
[19]
cstr, Voice cloning toolkit
[20]
cstr, The blizzard challenge