ALGERIAN ARABIC SPEECH DATABASE (ALGASD): CORPUS DESIGN AND AUTOMATIC SPEECH RECOGNITION APPLICATION

被引:0
作者
Droua-Hamdani, Ghania [1 ]
Selouani, Sid Ahmed [2 ]
Boudraa, Malika [3 ]
机构
[1] CRSTDLA, Speech Proc Lab, Algiers, Algeria
[2] Univ Moncton, LARIHS Lab, Moncton, NB E1A 3E9, Canada
[3] USTHB Univ, Speech Commun Lab, Algiers, Algeria
来源
ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING | 2010年 / 35卷 / 2C期
关键词
speech corpus; Algerian speakers; modern standard Arabic; automatic speech recognition; hidden Markov models;
D O I
暂无
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
This paper presents the Algerian Arabic Speech Database (ALGASD), a Modern Standard Arabic (MSA) speech corpus composed of utterances pronounced by 300 Algerian native speakers selected from eleven regions of Algeria. One of the objectives of this corpus design is to be representative of the regional accents of MSA spoken in Algeria. Useful information related to the speakers, such as gender, age, and education level, is provided. This paper also reports the results of the Automatic Speech Recognition (ASR) application of the corpus and outlines an original global monophone recognition model designed to handle linguistic variability. The global phone recognition rate for this ASR reference system is satisfactory and may constitute a useful baseline ASR system dedicated to MSA.
引用
收藏
页码:157 / 166
页数:10
相关论文
共 21 条
  • [1] ALGHAMDI M, 2008, J SAUD U COMPUTER H, V1429, P43
  • [2] Alotaibi YA, 2010, LECT NOTES COMPUT SC, V6134, P122, DOI 10.1007/978-3-642-13681-8_15
  • [3] Experiments on Automatic Recognition of Nonnative Arabic Speech
    Alotaibi, Yousef Ajami
    Selouani, Sid-Ahmed
    O'Shaughnessy, Douglas
    [J]. EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2008, 2008 (1)
  • [4] BONNOT JF, 1979, TRAVAUX I PHONETIQUE, V11, P109
  • [5] Boudraa M., 1998, ACUSTICA ACTA ACUSTI, V86
  • [6] Caubet D., 2000, ESTUDIOS DIALECTOLOG, P73
  • [7] CHOUKRI K, 2004, P 4 INT C LANG RES E
  • [8] DROUAHAMADANI G, 2009, INT VAR AR V IVA09 S
  • [9] DROUAHAMDANI G, 2010, TUT RES WORKSH EXP L
  • [10] ELMAHDY M, 2010, INT C SPEECH LANG PR