共 50 条
- [42] Glow-WaveGAN 2: High-quality Zero-shot Text-to-speech Synthesis and Any-to-any Voice Conversion INTERSPEECH 2022, 2022, : 2563 - 2567
- [43] DeID-VC: Speaker De-identification via Zero-shot Pseudo Voice Conversion INTERSPEECH 2022, 2022, : 2593 - 2597
- [45] Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration INTERSPEECH 2021, 2021, : 3600 - 3604
- [46] StreamVoice: Streamable Context-Aware Language Modeling for Real-time Zero-Shot Voice Conversion PROCEEDINGS OF THE 62ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1: LONG PAPERS, 2024, : 7328 - 7338
- [47] LDM-SVC: Latent Diffusion Model Based Zero-Shot Any-to-Any Singing Voice Conversion with Singer Guidance INTERSPEECH 2024, 2024, : 2770 - 2774
- [48] SIG-VC: A SPEAKER INFORMATION GUIDED ZERO-SHOT VOICE CONVERSION SYSTEM FOR BOTH HUMAN BEINGS AND MACHINES 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 6567 - 6571
- [49] Flow-VAE VC: End-to-End Flow Framework with Contrastive Loss for Zero-shot Voice Conversion INTERSPEECH 2023, 2023, : 2293 - 2297
- [50] Enhancing Zero-Shot Many to Many Voice Conversion via Self-Attention VAE with Structurally Regularized Layers 2022 5TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE FOR INDUSTRIES, AI4I, 2022, : 59 - 63