Current State of Text-to-Speech System ARTIC: A Decade of Research on the Field of Speech Technologies

被引：27

作者：

Tihelka, Daniel ^{[1
]}

Hanzlicek, Zdenek ^{[1
]}

Juzova, Marketa ^{[2
]}

Vit, Jakub ^{[2
]}

Matousek, Jindrich ^{[1
,2
]}

Gruber, Martin ^{[1
]}

机构：

[1] Univ West Bohemia, Fac Appl Sci, New Technol Informat Soc, Plzen, Czech Republic

[2] Univ West Bohemia, Fac Appl Sci, Dept Cybernet, Plzen, Czech Republic

来源：

TEXT, SPEECH, AND DIALOGUE (TSD 2018) | 2018年 / 11107卷

关键词：

Speech synthesis; Unit selection; Statistical-parametric synthesis; DNN; WaveNet; Hybrid synthesis; Personalized speech synthesis; Voice banking; VOICE CONSERVATION; ANNOTATION;

D O I：

10.1007/978-3-030-00794-2_40

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper provides a survey of the current state of ARTIC - the modern Czech concatenative corpus-based text-to-speech system. Through more than a decade of research & development in the field of speech technologies and applications, the system was enriched with new languages (and, as a consequence, language-dependent NLP methods), and its speech generation capabilities were significantly improved when new progressive speech generation modules (SPS, DNN, HSS) were (and are still being to) designed and incorporated into it. Also, ARTIC has to deal with various requirements on data used to generate speech from, ranging in size, quality and domain of the output speech, while there always was the requirement to achieve the highest quality in terms of both naturalness and intelligibility. Thus, the paper summarizes some of the most significant achievements and demanding tasks which had to be tackled by the system, illustrating the universality and flexibility of this Czech TTS system.

引用

页码：369 / 378

页数：10

共 40 条

[1]

[Anonymous], 2010, P SSW7 ISCA KYOT

[2]

[Anonymous], 2015, P MLSLP

[3]

[Anonymous], 2009, Text-to-speech synthesis

[4]

[Anonymous], 2017, CORR

[5]

Hanzlicek Z., 2018, LNAI, V11107, P445

[6] Optimal Number of States in HMM-Based Speech Synthesis [J].

Hanzlicek, Zdenek .

TEXT, SPEECH, AND DIALOGUE, TSD 2017, 2017, 10415 :353-361

[7] Voice Conservation: Towards Creating a Speech-Aid System for Total Laryngectomees [J].

Hanzlicek, Zdenek ;

Romportl, Jan ;

Matousek, Jindrich .

BEYOND ARTIFICIAL INTELLIGENCE: CONTEMPLATIONS, EXPECTATIONS, APPLICATIONS, 2013, 4 :203-212

[8]

Hanzlícek Z, 2013, LECT NOTES COMPUT SC, V8082, P249, DOI 10.1007/978-3-642-40585-3_32

[9]

Hanzlícek Z, 2011, LECT NOTES ARTIF INT, V6836, P107, DOI 10.1007/978-3-642-23538-2_14

[10]

Hanzlícek Z, 2010, LECT NOTES ARTIF INT, V6231, P291, DOI 10.1007/978-3-642-15760-8_37

← 1 2 3 4 →