共 22 条
[1]
[Anonymous], 2017, LJ SPEECH DATASET
[2]
Bae J., 2020, ARXIV PREPRINT ARXIV
[3]
WHISPERED AND LOMBARD NEURAL SPEECH SYNTHESIS
[J].
2021 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP (SLT),
2021,
:454-461
[4]
LOW-RESOURCE EXPRESSIVE TEXT-TO-SPEECH USING DATA AUGMENTATION
[J].
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021),
2021,
:6593-6597
[5]
Karlapati Sri, 2020, ARXIV PREPRINT ARXIV
[6]
Kingma DP, 2014, ADV NEUR IN, V27
[8]
Liu DR, 2018, IEEE W SP LANG TECH, P640, DOI 10.1109/SLT.2018.8639672
[9]
Montreal Forced Aligner: trainable text-speech alignment using Kaldi
[J].
18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION,
2017,
:498-502
[10]
Paul Dipjyoti, 2020, ARXIV PREPRINT ARXIV