共 6 条
[1]
Baby A., 2016, Resources for In-dian language, V09
[2]
WhisperX: Time-Accurate Speech Transcription of Long-Form Audio
[J].
INTERSPEECH 2023,
2023,
:4489-4493
[3]
Garg S., 2021, 2021 NATL C COMMUN, P1
[4]
Jalili Sabet M., P 2020 C EMPIRICAL
[5]
FASTPITCH: PARALLEL TEXT-TO-SPEECH WITH PITCH PREDICTION
[J].
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021),
2021,
:6588-6592
[6]
Tiedemann J, 2020, P 22 AN NUAL CONFE