共 50 条
- [21] StyleFusion TTS: Multimodal Style-Control and Enhanced Feature Fusion for Zero-Shot Text-to-Speech Synthesis PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2024, PT XI, 2025, 15041 : 263 - 277
- [23] MultiCQA: Zero-Shot Transfer of Self-Supervised Text Matching Models on a Massive Scale PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 2471 - 2486
- [24] Speech Enhancement with Zero-Shot Model Selection 29TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2021), 2021, : 491 - 495
- [25] INJECTING TEXT IN SELF-SUPERVISED SPEECH PRETRAINING 2021 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU), 2021, : 251 - 258
- [26] MIIPHER: A ROBUST SPEECH RESTORATION MODEL INTEGRATING SELF-SUPERVISED SPEECH AND TEXT REPRESENTATIONS 2023 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS, WASPAA, 2023,
- [28] Content-Dependent Fine-Grained Speaker Embedding for Zero-Shot Speaker Adaptation in Text-to-Speech Synthesis INTERSPEECH 2022, 2022, : 2573 - 2577
- [30] AdaSpeech 4: Adaptive Text to Speech in Zero-Shot Scenarios INTERSPEECH 2022, 2022, : 2568 - 2572