共 14 条
- [1] Pruning Self-Attention for Zero-Shot Multi-Speaker Text-to-Speech INTERSPEECH 2023, 2023, : 4299 - 4303
- [4] ECAPA-TDNN for Multi-speaker Text-to-speech Synthesis 2022 13TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2022, : 230 - 234
- [5] Content-Dependent Fine-Grained Speaker Embedding for Zero-Shot Speaker Adaptation in Text-to-Speech Synthesis INTERSPEECH 2022, 2022, : 2573 - 2577
- [6] INVESTIGATING ON INCORPORATING PRETRAINED AND LEARNABLE SPEAKER REPRESENTATIONS FOR MULTI-SPEAKER MULTI-STYLE TEXT-TO-SPEECH 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 8588 - 8592
- [7] EXACT PROSODY CLONING IN ZERO-SHOT MULTISPEAKER TEXT-TO-SPEECH 2022 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP, SLT, 2022, : 962 - 969
- [8] Generalizable Zero-Shot Speaker Adaptive Speech Synthesis with Disentangled Representations INTERSPEECH 2023, 2023, : 4454 - 4458
- [9] Hierarchical Timbre-Cadence Speaker Encoder for Zero-shot Speech Synthesis INTERSPEECH 2023, 2023, : 4334 - 4338