Integration of Yoruba language into MaryTTS

被引:3
作者
Aoga J.O.R. [1 ]
Dagba T.K. [2 ]
Fanou C.C. [3 ]
机构
[1] Polytechnic School of Abomey-Calavi (EPAC), University of Abomey-Calavi, Abomey-Calavi
[2] National School of Applied Economics and Management (ENEAM), University of Abomey-Calavi, Abomey-Calavi
[3] School of Administration (ENAM), University of Abomey-Calavi, Abomey-Calavi
关键词
MaryTTS; MOS; TTS; Unit-selection;
D O I
10.1007/s10772-016-9334-8
中图分类号
学科分类号
摘要
This paper presents the integration of Yoruba language into MaryTTS and the evaluation of the resulting text-to-speech (TTS) system using the unit-selection method. Yoruba language sounds system contains 54 elements: 12 vowels, 03 tones for each vowel and 18 consonants. The design of our system is based on the extraction of relevant features from the corpus and their use to select the sequences that can give the best synthesized voice output. The evaluation based on the mean opinion score gave us 2.9 out of 5 as score which is equivalent to a good quality. Integration into MaryTTS consisted in adding Yoruba synthesis component to MaryTTS and making Yoruba language available for usage and sharing for research works. Future work will be performed on some challenges such as objective evaluations of output audio stream, expressiveness (which will need prosodic modification) and hidden markov model synthesis for comparison purposes so as to improve the efficiency. © 2016, Springer Science+Business Media New York.
引用
收藏
页码:151 / 158
页数:7
相关论文
共 16 条
[1]  
Adedjouma A.S., Aoga J.O.R., Igue A.M., Part-of-speech tagging of yoruba standard, language of niger-congo family, Research Journal of Computer and IT Science, 1, pp. 2-5, (2013)
[2]  
Afolabi A., Omidiora E., Arulogun T., Development of text to speech system for yoruba language, Innovative Systems Design and Engineering—Special Issue of 2nd International Conference on Engineering and Technology Research, 4, pp. 1-8, (2013)
[3]  
Akinlabi A., Yorùbá sound system, Understanding yorùbá life and culture. Africa world press inc, (2004)
[4]  
Awoyemi T., Emergent trends in english usage: language and the internet, International Journal of English Literature and Culture, 1, pp. 33-40, (2013)
[5]  
Barker D., Patterson J.K., Ritucci L.N., Folding wagon/cart. Google Patents/US Patent, 6, pp. 845-991, (2005)
[6]  
Boersma P., Praat, a system for doing phonetics by computer, Glot international, 5, pp. 341-345, (2002)
[7]  
Dagba T.K., Boco C., A text to speech system for Fon language using multisyn algorithm. 18th International Conference on Knowledge-Based and Intelligent Information and Engineering Systems-KES2014 (Vol. 35, pp, Elsevier: Procedia Computer Science, (2014)
[8]  
Gakuru M., Iraki F., Tucker R., Shalanova K., Ngugi K., Development of a Kiswahili text to speech system, Proceedings of the Interspeech, pp. 1481-1484, (2005)
[9]  
Hunt A.J., Black A.W., Unit selection in a concatenative speech synthesis system using a large speech database, IEEE Acoustics, Speech, and Signal Processing, 1, pp. 373-376, (1996)
[10]  
Igue A.M., Grammaire Yorùbá de base abrégée, Center for Advanced Studies of African Society (CASAS), monograph, 238, pp. 1-49, (2009)