共 50 条
[31]
Speech Synthesis Adaption Method Based on Phoneme-Level Speaker Embedding Under Small Data
[J].
Jisuanji Xuebao/Chinese Journal of Computers,
2022, 45 (05)
:1003-1017
[32]
FOCUSING ON ATTENTION: PROSODY TRANSFER AND ADAPTATIVE OPTIMIZATION STRATEGY FOR MULTI-SPEAKER END-TO-END SPEECH SYNTHESIS
[J].
2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING,
2020,
:6709-6713
[33]
Waveform-Based Speaker Representations for Speech Synthesis
[J].
19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES,
2018,
:897-901
[34]
LINEAR NETWORKS BASED SPEAKER ADAPTATION FOR SPEECH SYNTHESIS
[J].
2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP),
2018,
:5319-5323
[36]
A study of speaker adaptation for DNN-based speech synthesis
[J].
16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5,
2015,
:879-883
[37]
A Method for Emotional Speech Synthesis Based on Speaker Adaptive Training
[J].
2018 11TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP),
2018,
:31-35
[39]
SPEAKER-AWARE TRAINING OF ATTENTION-BASED END-TO-END SPEECH RECOGNITION USING NEURAL SPEAKER EMBEDDINGS
[J].
2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING,
2020,
:7064-7068
[40]
Multi-Scale Speaker Vectors for Zero-Shot Speech Synthesis
[J].
2022 IEEE 46TH ANNUAL COMPUTERS, SOFTWARE, AND APPLICATIONS CONFERENCE (COMPSAC 2022),
2022,
:496-501