Phonetic alignment for speech synthesis in under-resourced languages

被引:0
作者
van Niekerk, D. R. [1 ]
Barnard, E.
机构
[1] CSIR, Human Language Technol Res Grp, Meraka Inst, Pretoria, South Africa
来源
INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5 | 2009年
关键词
speech synthesis; phonetic speech segmentation; resource scarce language;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The rapid development of concatenative speech synthesis systems in resource scarce languages requires an efficient and accurate solution with regard to automated phonetic alignment. However, in this context corpora are often minimally designed due to a lack of resources and expertise necessary for large scale development. Under these circumstances many techniques toward accurate segmentation are not feasible and it is unclear which approaches should be followed. In this paper we investigate this problem by evaluating alignment approaches and demonstrating how these approaches can be applied to limit manual interaction while achieving acceptable alignment accuracy with minimal ideal resources.
引用
收藏
页码:856 / +
页数:2
相关论文
共 16 条
[1]  
Adell J, 2005, INT CONF ACOUST SPEE, P309
[2]  
Black AlanW., 2007, BUILDING SYNTHETIC V
[3]   Multisyn: Open-domain unit selection for the Festival speech synthesis system [J].
Clark, Robert A. J. ;
Richmond, Korin ;
King, Simon .
SPEECH COMMUNICATION, 2007, 49 (04) :317-330
[4]  
Garofolo JS, 1993, TIMIT acoustic-phonetic continuous speech corpus, DOI DOI 10.35111/17GK-BN40
[5]  
Kim Y.-J., 2002, P INT C SPOK LANG PR, P145
[6]  
KOMINEK J, 2003, EUROSPEECH, P313
[7]  
Louw J.A., 2006, S AFRICAN J AFRICAN, V2, P1
[8]  
Makashay M.J., 2000, Proc. ICSLP, V2, P431
[9]   Phonetic alignment:: speech synthesis-based vs. Viterbi-based [J].
Malfrère, F ;
Deroo, O ;
Dutoit, T ;
Ris, C .
SPEECH COMMUNICATION, 2003, 40 (04) :503-515
[10]  
MALFRERE F, 1997, EUROSPEECH97, P2631