共 50 条
[31]
GANSpeech: Adversarial Training for High-Fidelity Multi-Speaker Speech Synthesis
[J].
INTERSPEECH 2021,
2021,
:2202-2206
[32]
MULTI-SPEAKER EMOTIONAL ACOUSTIC MODELING FOR CNN-BASED SPEECH SYNTHESIS
[J].
2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP),
2019,
:6950-6954
[33]
An objective evaluation of the effects of recording conditions and speaker characteristics in multi-speaker deep neural speech synthesis
[J].
KNOWLEDGE-BASED AND INTELLIGENT INFORMATION & ENGINEERING SYSTEMS (KSE 2021),
2021, 192
:756-765
[34]
A set of corpus-based text-to-speech synthesis technologies for Mandarin Chinese
[J].
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING,
2002, 10 (07)
:481-494
[35]
Optimization for Low-Resource Speaker Adaptation in End-to-End Text-to-Speech
[J].
2024 IEEE 21ST CONSUMER COMMUNICATIONS & NETWORKING CONFERENCE, CCNC,
2024,
:1060-1061
[36]
1000 African Voices: Advancing inclusive multi-speaker multi-accent speech synthesis
[J].
INTERSPEECH 2024,
2024,
:1855-1859
[38]
BOOTSTRAPPING NON-PARALLEL VOICE CONVERSION FROM SPEAKER-ADAPTIVE TEXT-TO-SPEECH
[J].
2019 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU 2019),
2019,
:200-207
[39]
The paradigm for creating multi-lingual text-to-speech voice databases
[J].
CHINESE SPOKEN LANGUAGE PROCESSING, PROCEEDINGS,
2006, 4274
:736-+
[40]
Content-Dependent Fine-Grained Speaker Embedding for Zero-Shot Speaker Adaptation in Text-to-Speech Synthesis
[J].
INTERSPEECH 2022,
2022,
:2573-2577