Current State of Text-to-Speech System ARTIC: A Decade of Research on the Field of Speech Technologies

被引:27
|
作者
Tihelka, Daniel [1 ]
Hanzlicek, Zdenek [1 ]
Juzova, Marketa [2 ]
Vit, Jakub [2 ]
Matousek, Jindrich [1 ,2 ]
Gruber, Martin [1 ]
机构
[1] Univ West Bohemia, Fac Appl Sci, New Technol Informat Soc, Plzen, Czech Republic
[2] Univ West Bohemia, Fac Appl Sci, Dept Cybernet, Plzen, Czech Republic
来源
TEXT, SPEECH, AND DIALOGUE (TSD 2018) | 2018年 / 11107卷
关键词
Speech synthesis; Unit selection; Statistical-parametric synthesis; DNN; WaveNet; Hybrid synthesis; Personalized speech synthesis; Voice banking; VOICE CONSERVATION; ANNOTATION;
D O I
10.1007/978-3-030-00794-2_40
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper provides a survey of the current state of ARTIC - the modern Czech concatenative corpus-based text-to-speech system. Through more than a decade of research & development in the field of speech technologies and applications, the system was enriched with new languages (and, as a consequence, language-dependent NLP methods), and its speech generation capabilities were significantly improved when new progressive speech generation modules (SPS, DNN, HSS) were (and are still being to) designed and incorporated into it. Also, ARTIC has to deal with various requirements on data used to generate speech from, ranging in size, quality and domain of the output speech, while there always was the requirement to achieve the highest quality in terms of both naturalness and intelligibility. Thus, the paper summarizes some of the most significant achievements and demanding tasks which had to be tackled by the system, illustrating the universality and flexibility of this Czech TTS system.
引用
收藏
页码:369 / 378
页数:10
相关论文
共 50 条
  • [1] Dealing with prosody in a text-to-speech system
    Goldsmith J.
    International Journal of Speech Technology, 1999, 3 (1) : 51 - 63
  • [2] TTTS: TURKISH TEXT-TO-SPEECH SYSTEM
    Gormez, Zeliha
    Orhan, Zeynep
    PROCEEDINGS OF THE 12TH WSEAS INTERNATIONAL CONFERENCE ON COMPUTERS , PTS 1-3: NEW ASPECTS OF COMPUTERS, 2008, : 977 - +
  • [3] Design and Implementation of a Diacritic Arabic Text-To-Speech System
    Amrouche, Aissa
    Falek, Leila
    Teffahi, Hocine
    INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2017, 14 (04) : 488 - 494
  • [4] Using Audio Books for Training a Text-to-Speech System
    Chalamandaris, Aimilios
    Tsiakoulis, Pirros
    Karabetsos, Sotiris
    Raptis, Spryos
    LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2014, : 3076 - 3080
  • [5] Text-To-Speech Intelligibility across Speech Rates
    Syrdal, Ann K.
    Bunnell, H. Timothy
    Hertz, Susan R.
    Mishra, Taniya
    Spiegel, Murray
    Bickley, Corine
    Rekart, Deborah
    Makashay, Matthew J.
    13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 622 - 625
  • [6] Indonesian Text-To-Speech System Using Syllable Concatenation: Speech Optimization
    Mengko, Richard
    Ayuningtyas, Aulia
    PROCEEDINGS OF 2013 3RD INTERNATIONAL CONFERENCE ON INSTRUMENTATION, COMMUNICATIONS, INFORMATION TECHNOLOGY, AND BIOMEDICAL ENGINEERING (ICICI-BME), 2013, : 412 - 415
  • [7] A Prosodic Text-to-Speech System for Yoruba Language
    Akinwonmi, Akintoba Emmanuel
    Alese, Boniface Kayode
    2013 8TH INTERNATIONAL CONFERENCE FOR INTERNET TECHNOLOGY AND SECURED TRANSACTIONS (ICITST), 2013, : 630 - 635
  • [8] Towards a Modern Text-to-Speech System for Latvian
    Dargis, Roberts
    Auzina, Ilze
    HUMAN LANGUAGE TECHNOLOGIES - THE BALTIC PERSPECTIVE, BALTIC HLT 2018, 2018, 307 : 26 - 29
  • [9] The German Text-to-Speech Synthesis System MARY: A Tool for Research, Development and Teaching
    Marc Schröder
    Jürgen Trouvain
    International Journal of Speech Technology, 2003, 6 (4) : 365 - 377
  • [10] Towards Universal Text-to-Speech
    Yang, Jingzhou
    He, Lei
    INTERSPEECH 2020, 2020, : 3171 - 3175