共 36 条
[21]
CONVNEXT-TTS AND CONVNEXT-VC: CONVNEXT-BASED FAST END-TO-END SEQUENCE-TO-SEQUENCE TEXT-TO-SPEECH AND VOICE CONVERSION
[J].
2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2024),
2024,
:12456-12460
[22]
An End-to-End Chinese and Japanese Bilingual Speech Recognition Systems with Shared Character Decomposition
[J].
NEURAL INFORMATION PROCESSING, ICONIP 2022, PT VI,
2023, 1793
:493-503
[23]
Text Only Domain Adaptation with Phoneme Guided Data Splicing for End-to-End Speech Recognition
[J].
INTERSPEECH 2023,
2023,
:3347-3351
[25]
IMPROVING END-TO-END SPEECH TRANSLATION MODEL WITH BERT-BASED CONTEXTUAL INFORMATION
[J].
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP),
2022,
:6227-6231
[26]
Investigating Radical-based End-to-End Speech Recognition Systems for Chinese Dialects and Japanese
[J].
INTERSPEECH 2019,
2019,
:2200-2204
[27]
Audio-conditioned phonemic and prosodic annotation for building text-to-speech models from unlabeled speech data
[J].
INTERSPEECH 2024,
2024,
:2795-2799
[29]
End-to-end speech recognition modeling from de-identified data
[J].
INTERSPEECH 2022,
2022,
:1382-1386
[30]
A UNIVERSAL BERT-BASED FRONT-END MODEL FOR MANDARIN TEXT-TO-SPEECH SYNTHESIS
[J].
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021),
2021,
:6074-6078