共 65 条
[2]
Baevski A., 2020, PROC NEURIPS, V33
[3]
FastPitchFormant: Source-filter based Decomposed Modeling for Speech Synthesis
[J].
INTERSPEECH 2021,
2021,
:116-120
[4]
Battenberg E, 2019, Arxiv, DOI arXiv:1906.03402
[5]
Battenberg E, 2020, INT CONF ACOUST SPEE, P6194, DOI [10.1109/ICASSP40776.2020.9054106, 10.1109/icassp40776.2020.9054106]
[6]
A Neural Parametric Singing Synthesizer Modeling Timbre and Expression from Natural Songs
[J].
APPLIED SCIENCES-BASEL,
2017, 7 (12)
[7]
Boersma P., 1993, P I PHONETIC SCI IFA, P97
[8]
Chalamandaris A., 2009, PROC 4 C HUMAN LANGU, P35
[9]
Chen LW, 2022, Arxiv, DOI arXiv:2110.06306
[10]
SPEECH BERT EMBEDDING FOR IMPROVING PROSODY IN NEURAL TTS
[J].
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021),
2021,
:6563-6567