Cross-lingual speaker adaptation using domain adaptation and speaker consistency loss for text-to-speech synthesis

被引：0

作者：

Xin, Detai ^{[1
]}

Saito, Yuki ^{[1
]}

Takamichi, Shinnosuke ^{[1
]}

Koriyama, Tomoki ^{[1
]}

Saruwatari, Hiroshi ^{[1
]}

机构：

[1] Graduate School of Information Science and Technology, The University of Tokyo, Japan

来源：

Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH | 2021年 / 5卷

关键词：

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Adaptation methods - Cross-lingual - Cross-lingual speaker adaptations - Domain adaptation - Fine tuning - Source language - Speaker adaptation - Speaker verification - Speech models - Text to speech

引用

页码：3376 / 3380

共 50 条

[21] UNSUPERVISED CROSS-LINGUAL SPEAKER ADAPTATION FOR HMM-BASED SPEECH SYNTHESIS USING TWO-PASS DECISION TREE CONSTRUCTION
Gibson, Matthew
Hirsimaki, Teemu
Karhila, Reima
Kurimo, Mikko
Byrne, William
2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4642 - 4645
[22] Phonological Knowledge Guided HMM State Mapping for Cross-Lingual Speaker Adaptation
Liang, Hui
Dines, John
12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 1836 - +
[23] Content-Dependent Fine-Grained Speaker Embedding for Zero-Shot Speaker Adaptation in Text-to-Speech Synthesis
Zhou, Yixuan
Song, Changhe
Li, Xiang
Zhang, Luwen
Wu, Zhiyong
Bian, Yanyao
Su, Dan
Meng, Helen
INTERSPEECH 2022, 2022, : 2573 - 2577
[24] Investigations on speaker adaptation using a continuous vocoder within recurrent neural network based text-to-speech synthesis
Ali Raheem Mandeel
Mohammed Salah Al-Radhi
Tamás Gábor Csapó
Multimedia Tools and Applications, 2023, 82 : 15635 - 15649
[25] Investigations on speaker adaptation using a continuous vocoder within recurrent neural network based text-to-speech synthesis
Mandeel, Ali Raheem
Al-Radhi, Mohammed Salah
Csapo, Tamas Gabor
MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (10) : 15635 - 15649
[26] Multi-Lingual Multi-Speaker Text-to-Speech Synthesis for Voice Cloning with Online Speaker Enrollment
Liu, Zhaoyu
Mak, Brian
INTERSPEECH 2020, 2020, : 2932 - 2936
[27] Unsupervised Intralingual and Cross-Lingual Speaker Adaptation for HMM-Based Speech Synthesis Using Two-Pass Decision Tree Construction
Gibson, Matthew
Byrne, William
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (04): : 895 - 904
[28] SpeakerNet for Cross-lingual Text-Independent Speaker Verification
Habib, Hafsa
Tauseef, Huma
Fahiem, Muhammad Abuzar
Farhan, Saima
Usman, Ghousia
ARCHIVES OF ACOUSTICS, 2020, 45 (04) : 573 - 583
[29] Domain Adaptation for Text Dependent Speaker Verification
Aronowitz, Hagai
Rendel, Asaf
15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4, 2014, : 1337 - 1341
[30] Using speaker adaptive training to realize Mandarin-Tibetan cross-lingual speech synthesis
Yang, Hongwu
Oura, Keiichiro
Wang, Haiyan
Gan, Zhenye
Tokuda, Keiichi
MULTIMEDIA TOOLS AND APPLICATIONS, 2015, 74 (22) : 9927 - 9942

← 1 2 3 4 5 →