Predicting utterance pitch targets in Yoruba for tone realisation in speech synthesis

被引:3
作者
Van Niekerk, Daniel R. [1 ,2 ]
Barnard, Etienne [1 ]
机构
[1] North West Univ, Vanderbijlpark, South Africa
[2] CSIR, Meraka Inst, Human Language Technol Res Grp, ZA-0001 Pretoria, South Africa
关键词
Yoruba; Tone language; Speech synthesis; Fundamental frequency; UNIVERSALITY; INTONATION;
D O I
10.1016/j.specom.2013.01.009
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Pitch is a fundamental acoustic feature of speech and as such needs to be determined during the process of speech synthesis. While a range of communicative functions are attributed to pitch variation in speech of all languages, it plays a vital role in distinguishing meaning of lexical items in tone languages. As a number of factors are assumed to affect the realisation of pitch, it is important to know which mechanisms are systematically responsible for pitch realisation in order to be able to model these effectively and thus develop robust speech synthesis systems in under-resourced environments. To this end, features influencing syllable pitch targets in continuous utterances in Yoruba are investigated in a small speech corpus of 4 speakers. It is found that the previous syllable pitch level is strongly correlated with pitch changes between syllables and a number of approaches and features are evaluated in this context. The resulting models can be used to predict utterance pitch targets for speech synthesisers (whether it be concatenative or statistical parametric systems), and may also prove useful in speech-recognition systems. (C) 2013 Elsevier B.V. All rights reserved.
引用
收藏
页码:229 / 242
页数:14
相关论文
共 28 条
[1]  
Adegbola T., 2009, 1 WORKSH LANG TECHN, P53, DOI DOI 10.3115/1564508.1564519
[2]  
[Anonymous], P 3 INT WORKSH SPOK
[3]  
[Anonymous], 2012, P SLTU
[4]  
[Anonymous], 2009, proceedings of the 10th Annual Conference of the International Speech Communication Association (Interspeech 2009)
[5]  
[Anonymous], STUDIES AFRICAN LING
[6]   SmcHD1, containing a structural-maintenance-of-chromosomes hinge domain, has a critical role in X inactivation [J].
Blewitt, Marnie E. ;
Gendrel, Anne-Valerie ;
Pang, Zhenyi ;
Sparrow, Duncan B. ;
Whitelaw, Nadia ;
Craig, Jeffrey M. ;
Apedaile, Anwyn ;
Hilton, Douglas J. ;
Dunwoodie, Sally L. ;
Brockdorff, Neil ;
Kay, Graham F. ;
Whitelaw, Emma .
NATURE GENETICS, 2008, 40 (05) :663-669
[7]  
Boersma P., 2001, GLOT INT
[8]   LIBSVM: A Library for Support Vector Machines [J].
Chang, Chih-Chung ;
Lin, Chih-Jen .
ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2011, 2 (03)
[9]   Tone languages and the universality of intrinsic F0:: evidence from Africa [J].
Connell, B .
JOURNAL OF PHONETICS, 2002, 30 (01) :101-129
[10]  
Connell B., 1990, PHONOLOGY, V7, P1