共 50 条
- [11] END-TO-END TEXT-TO-SPEECH USING LATENT DURATION BASED ON VQ-VAE 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 5694 - 5698
- [12] EXPLICIT ALIGNMENT OF TEXT AND SPEECH ENCODINGS FOR ATTENTION-BASED END-TO-END SPEECH RECOGNITION 2019 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU 2019), 2019, : 913 - 919
- [13] Optimization for Low-Resource Speaker Adaptation in End-to-End Text-to-Speech 2024 IEEE 21ST CONSUMER COMMUNICATIONS & NETWORKING CONFERENCE, CCNC, 2024, : 1060 - 1061
- [14] Adversarial Learning of Intermediate Acoustic Feature for End-to-End Lightweight Text-to-Speech INTERSPEECH 2023, 2023, : 3023 - 3027
- [15] Phonetic and Prosodic Information Estimation from Texts for Genuine Japanese End-to-End Text-to-Speech INTERSPEECH 2021, 2021, : 126 - 130
- [16] SANE-TTS: Stable And Natural End-to-End Multilingual Text-to-Speech INTERSPEECH 2022, 2022, : 1 - 5
- [17] Reinforce-Aligner: Reinforcement Alignment Search for Robust End-to-End Text-to-Speech INTERSPEECH 2021, 2021, : 3635 - 3639
- [18] ESPNET-TTS: UNIFIED, REPRODUCIBLE, AND INTEGRATABLE OPEN SOURCE END-TO-END TEXT-TO-SPEECH TOOLKIT 2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 7654 - 7658
- [19] You Do Not Need More Data: Improving End-To-End Speech Recognition by Text-To-Speech Data Augmentation 2020 13TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, BIOMEDICAL ENGINEERING AND INFORMATICS (CISP-BMEI 2020), 2020, : 439 - 444
- [20] BLSTM-CRF Based End-to-End Prosodic Boundary Prediction with Context Sensitive Embeddings in A Text-to-Speech Front-End 19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 47 - 51