On the Impact of Labialization Contexts on Unit Selection Speech Synthesis

被引：0

作者：

Tihelka, Daniel ^{[1
]}

Hanzlicek, Zdenek ^{[1
]}

Machac, Pavel ^{[2
]}

Skarnitzl, Radek ^{[2
]}

Matousek, Jindrich ^{[1
]}

机构：

[1] Univ W Bohemia, Dept Cybernet, Plzen, Czech Republic

[2] Charles Univ Prague, Inst Phonet, Prague, Czech Republic

来源：

2012 IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY (ISSPIT) | 2012年

关键词：

coarticulatory labialization; speech synthesis; unit selection;

D O I：

暂无

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper presents a study on coarticulatory labialization and the significance of its respecting/violation during selection and concatenation of speech units in the unit selection speech synthesis. The aim of this study is to improve the overall speech quality, especially to increase the perceptual inconspicuousness between concatenated units. The labialization importance was verified by two listening tests-for phonetic laymen and specialists. To suppress the influence of other factors, both tests contained utterances with specially selected phones in specific contexts with respected and violated labialization. The preference for items with correct labialization was evident, which confirms the benefit of considering coarticulatory labialization in a unit selection speech synthesis.

引用

页码：187 / 192

页数：6

共 50 条

[41] Analysis of statistical parametric and unit selection speech synthesis systems applied to emotional speech
Barra-Chicote, Roberto
Yamagishi, Junichi
King, Simon
Manuel Montero, Juan
Macias-Guarasa, Javier
SPEECH COMMUNICATION, 2010, 52 (05) : 394 - 404
[42] Unit Selection Speech Synthesis Using Frame-Sized Speech Segments and Neural Network Based Acoustic Models
Zhen-Hua Ling
Zhi-Ping Zhou
Journal of Signal Processing Systems, 2018, 90 : 1053 - 1062
[43] Unit Selection Speech Synthesis Using Frame-Sized Speech Segments and Neural Network Based Acoustic Models
Ling, Zhen-Hua
Zhou, Zhi-Ping
JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2018, 90 (07): : 1053 - 1062
[44] Learning and Modeling Unit Embeddings Using Deep Neural Networks for Unit-Selection-Based Mandarin Speech Synthesis
Zhou, Xiao
Ling, Zhen-Hua
Dai, Li-Rong
ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2020, 19 (03)
[45] A classifier-based target cost for unit selection speech synthesis trained on perceptual data
Strom, Volker
King, Simon
11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 150 - 153
[46] One-Class Classification for Spectral Join Cost Calculation in Unit Selection Speech Synthesis
Karabetsos, Sotiris
Tsiakoulis, Pirros
Chalamandaris, Aimilios
Raptis, Spyros
IEEE SIGNAL PROCESSING LETTERS, 2010, 17 (08) : 746 - 749
[47] Application of Genetic Algorithm in unit selection for Malay speech synthesis system
Lim, Yee Chea
Tan, Tian Swee
Hussain, Sheikh
Salleh, Shaikh
Ling, Dandy Kwong
EXPERT SYSTEMS WITH APPLICATIONS, 2012, 39 (05) : 5376 - 5383
[48] Concatenative speech synthesis based on the plural unit selection and fusion method
Mizutani, T
Kagoshima, T
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2005, E88D (11): : 2565 - 2572
[49] An Overview of the ILSP Unit Selection Text-to-Speech Synthesis System
Tsiakoulis, Pirros
Karabetsos, Sotiris
Chalamandaris, Aimilios
Raptis, Spyros
ARTIFICIAL INTELLIGENCE: METHODS AND APPLICATIONS, 2014, 8445 : 370 - 383
[50] Continuity Metric for Unit Selection based Text-to-Speech Synthesis
Lakkavalli, Vikram Ramesh
Arulmozhi, P.
Ramakrishnan, A. G.
2010 INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATIONS (SPCOM), 2010,

← 1 2 3 4 5 →