Hola-TTS: A Cross-Lingual Zero-Shot Text-to-Speech System for Chinese, English, Japanese, and Korean

被引:0
|
作者
Ding, Hongwu [1 ]
Zhou, Yiquan [2 ]
Wang, Wenyu [2 ]
Xu, JiaCheng [1 ]
Mei, Jiaqi [1 ]
机构
[1] School of Computer Science and Technology, Anhui University, Hefei, China
[2] School of Software Engineering, Xi’an Jiaotong University, Xi’an, China
关键词
Compilation and indexing terms; Copyright 2025 Elsevier Inc;
D O I
暂无
中图分类号
学科分类号
摘要
Artificial intelligence - Computational linguistics - Speech enhancement
引用
收藏
页码:601 / 605
相关论文
共 50 条
  • [31] EXACT PROSODY CLONING IN ZERO-SHOT MULTISPEAKER TEXT-TO-SPEECH
    Lux, Florian
    Koch, Julia
    Vu, Ngoc Thang
    2022 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP, SLT, 2022, : 962 - 969
  • [32] CROSS-LINGUAL TEXT-TO-SPEECH VIA HIERARCHICAL STYLE TRANSFER
    Lee, Sang-Hoon
    Choi, Ha-Yeong
    Lee, Seong-Whan
    2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING WORKSHOPS, ICASSPW 2024, 2024, : 25 - 26
  • [33] A joint learning approach with knowledge injection for zero-shot cross-lingual hate speech detection
    Pamungkas, Endang Wahyu
    Basile, Valerio
    Patti, Viviana
    INFORMATION PROCESSING & MANAGEMENT, 2021, 58 (04)
  • [34] Don't Use English Dev: On the Zero-Shot Cross-Lingual Evaluation of Contextual Embeddings
    Keung, Phillip
    Lu, Yichao
    Salazar, Julian
    Bhardwaj, Vikas
    PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 549 - 554
  • [35] StyleFusion TTS: Multimodal Style-Control and Enhanced Feature Fusion for Zero-Shot Text-to-Speech Synthesis
    Chene, Zhiyong
    Li, Xinnuo
    Ai, Zhiqi
    Xu, Shugong
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2024, PT XI, 2025, 15041 : 263 - 277
  • [36] English Intermediate-Task Training Improves Zero-Shot Cross-Lingual Transfer Too
    Phang, Jason
    Calixto, Iacer
    Htut, Phu Mon
    Pruksachatkun, Yada
    Liu, Haokun
    Vania, Clara
    Kann, Katharina
    Bowman, Samuel R.
    1ST CONFERENCE OF THE ASIA-PACIFIC CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 10TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (AACL-IJCNLP 2020), 2020, : 557 - 575
  • [37] Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration
    Tang, Chuanxin
    Luo, Chong
    Zhao, Zhiyuan
    Yin, Dacheng
    Zhao, Yucheng
    Zeng, Wenjun
    INTERSPEECH 2021, 2021, : 3600 - 3604
  • [38] Combining Cross-lingual and Cross-task Supervision for Zero-Shot Learning
    Pikuliak, Matus
    Simko, Marian
    TEXT, SPEECH, AND DIALOGUE (TSD 2020), 2020, 12284 : 162 - 170
  • [39] Substructure Distribution Projection for Zero-Shot Cross-Lingual Dependency Parsing
    Shi, Freda
    Gimpel, Kevin
    Livescu, Karen
    PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), 2022, : 6547 - 6563
  • [40] Curriculum meta-learning for zero-shot cross-lingual transfer
    Doan, Toan
    Le, Bac
    KNOWLEDGE-BASED SYSTEMS, 2024, 301