Cross-lingual speaker adaptation using domain adaptation and speaker consistency loss for text-to-speech synthesis

被引:0
|
作者
Xin, Detai [1 ]
Saito, Yuki [1 ]
Takamichi, Shinnosuke [1 ]
Koriyama, Tomoki [1 ]
Saruwatari, Hiroshi [1 ]
机构
[1] Graduate School of Information Science and Technology, The University of Tokyo, Japan
关键词
Compilation and indexing terms; Copyright 2024 Elsevier Inc;
D O I
暂无
中图分类号
学科分类号
摘要
Adaptation methods - Cross-lingual - Cross-lingual speaker adaptations - Domain adaptation - Fine tuning - Source language - Speaker adaptation - Speaker verification - Speech models - Text to speech
引用
收藏
页码:3376 / 3380
相关论文
共 50 条
  • [21] UNSUPERVISED CROSS-LINGUAL SPEAKER ADAPTATION FOR HMM-BASED SPEECH SYNTHESIS USING TWO-PASS DECISION TREE CONSTRUCTION
    Gibson, Matthew
    Hirsimaki, Teemu
    Karhila, Reima
    Kurimo, Mikko
    Byrne, William
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4642 - 4645
  • [22] Phonological Knowledge Guided HMM State Mapping for Cross-Lingual Speaker Adaptation
    Liang, Hui
    Dines, John
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 1836 - +
  • [23] Content-Dependent Fine-Grained Speaker Embedding for Zero-Shot Speaker Adaptation in Text-to-Speech Synthesis
    Zhou, Yixuan
    Song, Changhe
    Li, Xiang
    Zhang, Luwen
    Wu, Zhiyong
    Bian, Yanyao
    Su, Dan
    Meng, Helen
    INTERSPEECH 2022, 2022, : 2573 - 2577
  • [24] Investigations on speaker adaptation using a continuous vocoder within recurrent neural network based text-to-speech synthesis
    Ali Raheem Mandeel
    Mohammed Salah Al-Radhi
    Tamás Gábor Csapó
    Multimedia Tools and Applications, 2023, 82 : 15635 - 15649
  • [25] Investigations on speaker adaptation using a continuous vocoder within recurrent neural network based text-to-speech synthesis
    Mandeel, Ali Raheem
    Al-Radhi, Mohammed Salah
    Csapo, Tamas Gabor
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (10) : 15635 - 15649
  • [26] Multi-Lingual Multi-Speaker Text-to-Speech Synthesis for Voice Cloning with Online Speaker Enrollment
    Liu, Zhaoyu
    Mak, Brian
    INTERSPEECH 2020, 2020, : 2932 - 2936
  • [27] Unsupervised Intralingual and Cross-Lingual Speaker Adaptation for HMM-Based Speech Synthesis Using Two-Pass Decision Tree Construction
    Gibson, Matthew
    Byrne, William
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (04): : 895 - 904
  • [28] SpeakerNet for Cross-lingual Text-Independent Speaker Verification
    Habib, Hafsa
    Tauseef, Huma
    Fahiem, Muhammad Abuzar
    Farhan, Saima
    Usman, Ghousia
    ARCHIVES OF ACOUSTICS, 2020, 45 (04) : 573 - 583
  • [29] Domain Adaptation for Text Dependent Speaker Verification
    Aronowitz, Hagai
    Rendel, Asaf
    15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4, 2014, : 1337 - 1341
  • [30] Using speaker adaptive training to realize Mandarin-Tibetan cross-lingual speech synthesis
    Yang, Hongwu
    Oura, Keiichiro
    Wang, Haiyan
    Gan, Zhenye
    Tokuda, Keiichi
    MULTIMEDIA TOOLS AND APPLICATIONS, 2015, 74 (22) : 9927 - 9942