On the Impact of Labialization Contexts on Unit Selection Speech Synthesis

被引:0
作者
Tihelka, Daniel [1 ]
Hanzlicek, Zdenek [1 ]
Machac, Pavel [2 ]
Skarnitzl, Radek [2 ]
Matousek, Jindrich [1 ]
机构
[1] Univ W Bohemia, Dept Cybernet, Plzen, Czech Republic
[2] Charles Univ Prague, Inst Phonet, Prague, Czech Republic
来源
2012 IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY (ISSPIT) | 2012年
关键词
coarticulatory labialization; speech synthesis; unit selection;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents a study on coarticulatory labialization and the significance of its respecting/violation during selection and concatenation of speech units in the unit selection speech synthesis. The aim of this study is to improve the overall speech quality, especially to increase the perceptual inconspicuousness between concatenated units. The labialization importance was verified by two listening tests-for phonetic laymen and specialists. To suppress the influence of other factors, both tests contained utterances with specially selected phones in specific contexts with respected and violated labialization. The preference for items with correct labialization was evident, which confirms the benefit of considering coarticulatory labialization in a unit selection speech synthesis.
引用
收藏
页码:187 / 192
页数:6
相关论文
共 50 条
  • [41] Analysis of statistical parametric and unit selection speech synthesis systems applied to emotional speech
    Barra-Chicote, Roberto
    Yamagishi, Junichi
    King, Simon
    Manuel Montero, Juan
    Macias-Guarasa, Javier
    SPEECH COMMUNICATION, 2010, 52 (05) : 394 - 404
  • [42] Unit Selection Speech Synthesis Using Frame-Sized Speech Segments and Neural Network Based Acoustic Models
    Zhen-Hua Ling
    Zhi-Ping Zhou
    Journal of Signal Processing Systems, 2018, 90 : 1053 - 1062
  • [43] Unit Selection Speech Synthesis Using Frame-Sized Speech Segments and Neural Network Based Acoustic Models
    Ling, Zhen-Hua
    Zhou, Zhi-Ping
    JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2018, 90 (07): : 1053 - 1062
  • [44] Learning and Modeling Unit Embeddings Using Deep Neural Networks for Unit-Selection-Based Mandarin Speech Synthesis
    Zhou, Xiao
    Ling, Zhen-Hua
    Dai, Li-Rong
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2020, 19 (03)
  • [45] A classifier-based target cost for unit selection speech synthesis trained on perceptual data
    Strom, Volker
    King, Simon
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 150 - 153
  • [46] One-Class Classification for Spectral Join Cost Calculation in Unit Selection Speech Synthesis
    Karabetsos, Sotiris
    Tsiakoulis, Pirros
    Chalamandaris, Aimilios
    Raptis, Spyros
    IEEE SIGNAL PROCESSING LETTERS, 2010, 17 (08) : 746 - 749
  • [47] Application of Genetic Algorithm in unit selection for Malay speech synthesis system
    Lim, Yee Chea
    Tan, Tian Swee
    Hussain, Sheikh
    Salleh, Shaikh
    Ling, Dandy Kwong
    EXPERT SYSTEMS WITH APPLICATIONS, 2012, 39 (05) : 5376 - 5383
  • [48] Concatenative speech synthesis based on the plural unit selection and fusion method
    Mizutani, T
    Kagoshima, T
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2005, E88D (11): : 2565 - 2572
  • [49] An Overview of the ILSP Unit Selection Text-to-Speech Synthesis System
    Tsiakoulis, Pirros
    Karabetsos, Sotiris
    Chalamandaris, Aimilios
    Raptis, Spyros
    ARTIFICIAL INTELLIGENCE: METHODS AND APPLICATIONS, 2014, 8445 : 370 - 383
  • [50] Continuity Metric for Unit Selection based Text-to-Speech Synthesis
    Lakkavalli, Vikram Ramesh
    Arulmozhi, P.
    Ramakrishnan, A. G.
    2010 INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATIONS (SPCOM), 2010,