CROSS-LINGUAL TEXT-TO-SPEECH VIA HIERARCHICAL STYLE TRANSFER

被引:0
|
作者
Lee, Sang-Hoon [1 ]
Choi, Ha-Yeong [1 ]
Lee, Seong-Whan [1 ]
机构
[1] Korea Univ, Dept Artificial Intelligence, Seoul, South Korea
关键词
Cross-lingual TTS; Multi-lingual TTS;
D O I
10.1109/ICASSPW62465.2024.10627450
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper presents LIMITLESS, a cross-lingual text-to-speech via hierarchical style transfer that can transfer the prosody and voice style, respectively. Building upon HierSpeech++, we utilize the 2-stage hierarchical speech synthesis frameworks with text-to-vector (TTV) and vector-to-speech. We simply modify the TTV by adding the language embedding of each language on the text representation and use the hierarchical speech synthesizer without modification. We train the TTV model with 7 languages and 14 speakers from the Indic languages dataset which was released for LIMMITS 2024 and fine-tuned the TTV model with target speakers for Track 1 and 2. The results show that our framework can transfer voice style robustly in terms of speaker similarity.
引用
收藏
页码:25 / 26
页数:2
相关论文
共 50 条
  • [31] LNACont: Language-normalized Affine Coupling Layer with contrastive learning for Cross-lingual Multi-speaker Text-to-speech
    Hwang, Sungwoong
    Kim, Changhwan
    32ND EUROPEAN SIGNAL PROCESSING CONFERENCE, EUSIPCO 2024, 2024, : 391 - 395
  • [32] Hybrid Knowledge Transfer for Improved Cross-Lingual Event Detection via Hierarchical Sample Selection
    Guzman-Nateras, Luis F.
    Dernoncourt, Franck
    Nguyen, Thien Huu
    PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 1, 2023, : 5414 - 5427
  • [33] Cross-lingual Distillation for Text Classification
    Xu, Ruochen
    Yang, Yiming
    PROCEEDINGS OF THE 55TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2017), VOL 1, 2017, : 1415 - 1425
  • [34] PromptStyle: Controllable Style Transfer for Text-to-Speech with Natural Language Descriptions
    Liu, Guanghou
    Zhang, Yongmao
    Lei, Yi
    Chen, Yunlin
    Wang, Rui
    Li, Zhifei
    Xie, Lei
    INTERSPEECH 2023, 2023, : 4888 - 4892
  • [35] Knowledge Translator: Cross-Lingual Course Video Text Style Transform via Imposed Sequential Attention Networks
    Zhang, Jingyi
    Zhao, Bocheng
    Zhang, Wenxing
    Miao, Qiguang
    ELECTRONICS, 2025, 14 (06):
  • [36] CL2CM: Improving Cross-Lingual Cross-Modal Retrieval via Cross-Lingual Knowledge Transfer
    Wang, Yabing
    Wang, Fan
    Dong, Jianfeng
    Luo, Hao
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 6, 2024, : 5651 - 5659
  • [37] CROSS-LINGUAL TRANSFER LEARNING FOR LOW-RESOURCE SPEECH TRANSLATION
    Khurana, Sameer
    Dawalatabad, Nauman
    Laurent, Antoine
    Vicente, Luis
    Gimeno, Pablo
    Mingote, Victoria
    Glass, James
    2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING WORKSHOPS, ICASSPW 2024, 2024, : 670 - 674
  • [38] CROSS-LINGUAL TRANSFER FOR SPEECH PROCESSING USING ACOUSTIC LANGUAGE SIMILARITY
    Wu, Peter
    Shi, Jiatong
    Zhong, Yifan
    Watanabe, Shinji
    Black, Alan W.
    2021 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU), 2021, : 1050 - 1057
  • [39] Cross-lingual Text Classification via Model Translation with Limited Dictionaries
    Xu, Ruochen
    Yang, Yiming
    Liu, Hanxiao
    Hsi, Andrew
    CIKM'16: PROCEEDINGS OF THE 2016 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2016, : 95 - 104
  • [40] Cross-lingual Opinion Analysis via Negative Transfer Detection
    Gui, Lin
    Xu, Ruifeng
    Lu, Qin
    Xu, Jun
    Xu, Jian
    Liu, Bin
    Wang, Xiaolong
    PROCEEDINGS OF THE 52ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 2, 2014, : 860 - 865