Developing an Open-Source Corpus of Yoruba Speech

被引:14
作者
Gutkin, Alexander [1 ]
Demirsahin, Isin [1 ]
Kjartansson, Oddur [1 ]
Rivera, Clara [1 ]
Tnbastin, Kola [2 ]
机构
[1] Google Res, London, England
[2] British Lib, London, England
来源
INTERSPEECH 2020 | 2020年
关键词
speech corpora; open-source; West Africa;
D O I
10.21437/Interspeech.2020-1096
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
This paper introduces an open-source speech dataset for Yoruba - one of the largest low-resource West African languages spoken by at least 22 million people. Yoruba is one of the official languages of Nigeria, Benin and Togo, and is spoken in other neighboring African countries and beyond. The corpus consists of over four hours of 48 kHz recordings from 36 male and female volunteers and the corresponding transcriptions that include disfluency annotation. The transcriptions have full diacritization, which is vital for pronunciation and lexical disambiguation. The annotated speech dataset described in this paper is primarily intended for use in text-to-speech systems, serve as adaptation data in automatic speech recognition and speech-to-speech translation, and provide insights in West African corpus linguistics. We demonstrate the use of this corpus in a simple statistical parametric speech synthesis (SPSS) scenario evaluating it against the related languages from the CMU Wilderness dataset and the Yoruba Lagos-NWU corpus.
引用
收藏
页码:404 / 408
页数:5
相关论文
共 47 条
[11]  
Black AW, 2019, INT CONF ACOUST SPEE, P5971, DOI 10.1109/ICASSP.2019.8683536
[12]  
Black AW, 2015, 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, P1211
[13]  
Black AW, 2006, INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, P1762
[14]  
Blench R., 2019, AN ATLAS OF NIGERIAN LANGUAGES
[15]  
Clements GN, 2008, CAMB APPROACH LANG, P36
[16]  
Creative Commons, 2019, ATTR SHAREALIKE 4 0
[17]  
Crowther Samuel., 1852, A Grammar of the Yoruba Language
[18]   Design of a Yoruba Language Speech Corpus for the Purposes of Text-to-Speech (TTS) Synthesis [J].
Dagba, Theophile K. ;
Aoga, John O. R. ;
Fanou, Codjo C. .
INTELLIGENT INFORMATION AND DATABASE SYSTEMS, ACIIDS 2016, PT I, 2016, 9621 :161-169
[19]  
Fagbolu O., 2015, International Journal of Innovative Science, Engineering and Technology, V2, P2348
[20]  
Fajobi E., 2005, Yorb Creativity: Fiction, Language and Songs, P183