共 50 条
[41]
An Improved Method for Predicting Fundamental Frequency Contour in Mandarin Text-to-Speech System with a Small Corpus
[J].
TENCON 2010: 2010 IEEE REGION 10 CONFERENCE,
2010,
:751-754
[42]
Expressive paragraph text-to-speech synthesis with multi-step variational autoencoder
[J].
INTERSPEECH 2024,
2024,
:1815-1819
[43]
TDASS: Target Domain Adaptation Speech Synthesis Framework for Multi-speaker Low-Resource TTS
[J].
2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN),
2022,
[44]
Cross-lingual Text-To-Speech Synthesis via Domain Adaptation and Perceptual Similarity Regression in Speaker Space
[J].
INTERSPEECH 2020,
2020,
:2947-2951
[45]
MULTI-RATE ATTENTION ARCHITECTURE FOR FAST STREAMABLE TEXT-TO-SPEECH SPECTRUM MODELING
[J].
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021),
2021,
:5689-5693
[46]
MULTI-BAND MELGAN: FASTERWAVEFORM GENERATION FOR HIGH-QUALITY TEXT-TO-SPEECH
[J].
2021 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP (SLT),
2021,
:492-498
[47]
Open-source Multi-speaker Speech Corpora for Building Gujarati, Kannada, Malayalam, Marathi, Tamil and Telugu Speech Synthesis Systems
[J].
PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020),
2020,
:6494-6503
[49]
PROMPTTTS plus plus : CONTROLLING SPEAKER IDENTITY IN PROMPT-BASED TEXT-TO-SPEECH USING NATURAL LANGUAGE DESCRIPTIONS
[J].
2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2024),
2024,
:12672-12676
[50]
A Unified Accent Estimation Method Based on Multi-Task Learning for Japanese Text-to-Speech
[J].
INTERSPEECH 2022,
2022,
:1931-1935