共 33 条
[1]
Agostinelli Andrea, 2023, Musiclm: Generating music from text
[2]
Brown T., 2020, P NEURIPS VANC CAN D
[3]
Bruce G., 1995, P EUR, P1169
[4]
Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
[5]
CONVERSATIONAL END-TO-END TTS FOR VOICE AGENTS
[J].
2021 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP (SLT),
2021,
:403-409
[6]
Guo Z., 2022, Prompttts: Controllable text-to-speech with text descriptions
[8]
Ho J., 2020, P NEURIPS VANC CAN D
[9]
Evaluating Intention Communication by TTS using Explicit Definitions of Illocutionary Act Performance
[J].
INTERSPEECH 2019,
2019,
:1536-1540
[10]
Kim S., 2014, P APSIPA ASC SIEM RE