Current State of Text-to-Speech System ARTIC: A Decade of Research on the Field of Speech Technologies

被引:27
作者
Tihelka, Daniel [1 ]
Hanzlicek, Zdenek [1 ]
Juzova, Marketa [2 ]
Vit, Jakub [2 ]
Matousek, Jindrich [1 ,2 ]
Gruber, Martin [1 ]
机构
[1] Univ West Bohemia, Fac Appl Sci, New Technol Informat Soc, Plzen, Czech Republic
[2] Univ West Bohemia, Fac Appl Sci, Dept Cybernet, Plzen, Czech Republic
来源
TEXT, SPEECH, AND DIALOGUE (TSD 2018) | 2018年 / 11107卷
关键词
Speech synthesis; Unit selection; Statistical-parametric synthesis; DNN; WaveNet; Hybrid synthesis; Personalized speech synthesis; Voice banking; VOICE CONSERVATION; ANNOTATION;
D O I
10.1007/978-3-030-00794-2_40
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper provides a survey of the current state of ARTIC - the modern Czech concatenative corpus-based text-to-speech system. Through more than a decade of research & development in the field of speech technologies and applications, the system was enriched with new languages (and, as a consequence, language-dependent NLP methods), and its speech generation capabilities were significantly improved when new progressive speech generation modules (SPS, DNN, HSS) were (and are still being to) designed and incorporated into it. Also, ARTIC has to deal with various requirements on data used to generate speech from, ranging in size, quality and domain of the output speech, while there always was the requirement to achieve the highest quality in terms of both naturalness and intelligibility. Thus, the paper summarizes some of the most significant achievements and demanding tasks which had to be tackled by the system, illustrating the universality and flexibility of this Czech TTS system.
引用
收藏
页码:369 / 378
页数:10
相关论文
共 40 条
[1]  
[Anonymous], 2010, P SSW7 ISCA KYOT
[2]  
[Anonymous], 2015, P MLSLP
[3]  
[Anonymous], 2009, Text-to-speech synthesis
[4]  
[Anonymous], 2017, CORR
[5]  
Hanzlicek Z., 2018, LNAI, V11107, P445
[6]   Optimal Number of States in HMM-Based Speech Synthesis [J].
Hanzlicek, Zdenek .
TEXT, SPEECH, AND DIALOGUE, TSD 2017, 2017, 10415 :353-361
[7]   Voice Conservation: Towards Creating a Speech-Aid System for Total Laryngectomees [J].
Hanzlicek, Zdenek ;
Romportl, Jan ;
Matousek, Jindrich .
BEYOND ARTIFICIAL INTELLIGENCE: CONTEMPLATIONS, EXPECTATIONS, APPLICATIONS, 2013, 4 :203-212
[8]  
Hanzlícek Z, 2013, LECT NOTES COMPUT SC, V8082, P249, DOI 10.1007/978-3-642-40585-3_32
[9]  
Hanzlícek Z, 2011, LECT NOTES ARTIF INT, V6836, P107, DOI 10.1007/978-3-642-23538-2_14
[10]  
Hanzlícek Z, 2010, LECT NOTES ARTIF INT, V6231, P291, DOI 10.1007/978-3-642-15760-8_37