Developing an Open-Source Corpus of Yoruba Speech

被引:14
作者
Gutkin, Alexander [1 ]
Demirsahin, Isin [1 ]
Kjartansson, Oddur [1 ]
Rivera, Clara [1 ]
Tnbastin, Kola [2 ]
机构
[1] Google Res, London, England
[2] British Lib, London, England
来源
INTERSPEECH 2020 | 2020年
关键词
speech corpora; open-source; West Africa;
D O I
10.21437/Interspeech.2020-1096
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
This paper introduces an open-source speech dataset for Yoruba - one of the largest low-resource West African languages spoken by at least 22 million people. Yoruba is one of the official languages of Nigeria, Benin and Togo, and is spoken in other neighboring African countries and beyond. The corpus consists of over four hours of 48 kHz recordings from 36 male and female volunteers and the corresponding transcriptions that include disfluency annotation. The transcriptions have full diacritization, which is vital for pronunciation and lexical disambiguation. The annotated speech dataset described in this paper is primarily intended for use in text-to-speech systems, serve as adaptation data in automatic speech recognition and speech-to-speech translation, and provide insights in West African corpus linguistics. We demonstrate the use of this corpus in a simple statistical parametric speech synthesis (SPSS) scenario evaluating it against the related languages from the CMU Wilderness dataset and the Yoruba Lagos-NWU corpus.
引用
收藏
页码:404 / 408
页数:5
相关论文
共 47 条
[1]  
Adediran B, 1994, FRONTIER STATES W YO
[2]   Development of Standard YorA(1)ba speech-to-text system using HTK [J].
Adetunmbi, O. A. ;
Obe, O. O. ;
Iyanda, J. N. .
INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2016, 19 (04) :929-944
[3]  
Agic E, 2019, 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), P3204
[4]  
Alabi JO, 2020, Arxiv, DOI arXiv:1912.02481
[5]  
[Anonymous], 2006, ATLAS REGIONAL INTEG
[6]  
[Anonymous], 2018, BBC News
[7]  
[Anonymous], 2012, PROC SLT 2012
[8]   Integration of Yoruba language into MaryTTS [J].
Aoga J.O.R. ;
Dagba T.K. ;
Fanou C.C. .
International Journal of Speech Technology, 2016, 19 (01) :151-158
[9]  
Ayogu I. I., 2018, ASIAN J RES COMPUTER, P1
[10]  
Bamgbose A., 1966, W AFRICAN LANGUAGE M, V5