Study on pharyngeal and uvular consonants in foreign accented Arabic for ASR

被引:13
作者
Alotaibi, Yousef Ajami [1 ]
Muhammad, Ghulam [1 ]
机构
[1] King Saud Univ, Dept Comp Engn, Coll Comp & Informat Sci, Riyadh 11543, Saudi Arabia
关键词
Arabic; Foreign accents; HMMs; Pharyngeal; Uvular; Speech recognition; HIDDEN MARKOV-MODELS; SPEECH;
D O I
10.1016/j.csl.2009.04.005
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper investigates the unique pharyngeal and uvular consonants of Arabic from the point of view of automatic speech recognition (ASR). Comparisons of the recognition error rates for these phonemes are analyzed in five experiments that involve different combinations of native and non-native Arabic speakers. The most three confusing consonants for every investigated consonant are discussed. All experiments use the Hidden Markov Model Toolkit (HTK) and the Language Data Consortium (LDC) WestPoint Modern Standard Arabic (MSA) database. Results confirm that these Arabic distinct consonants are a major source of difficulty for Arabic ASR. While the recognition rate for certain of these unique consonants such as /h/ can drop below 35% when uttered by non-native speakers, there is advantage to include non-native speakers in ASR. Besides, regional differences in pronunciation of MSA by native Arabic speakers require the attention of Arabic ASR research. (C) 2009 Elsevier Ltd. All rights reserved.
引用
收藏
页码:219 / 231
页数:13
相关论文
共 29 条
[1]  
AIZABIBI M, 1990, ACOUSTICPHONETIC APP
[2]   The effect of teaching English phonotactics on the lexical segmentation of English as a foreign language [J].
Al-jasser, Faisal .
SYSTEM, 2008, 36 (01) :94-106
[3]  
Al-Muhtaseb Husni, 2000, 3 WORKSH COMP INF SC, P73
[4]  
Alghamdi M., 2001, ARABIC PHONETICS
[5]  
Alghamdi Mansour, 2004, ANAL SYNTHESIS PERCE
[6]  
Alkhouli M, 1990, ALASWAAT ALAGHAWAIYA
[7]   Experiments on Automatic Recognition of Nonnative Arabic Speech [J].
Alotaibi, Yousef Ajami ;
Selouani, Sid-Ahmed ;
O'Shaughnessy, Douglas .
EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2008, 2008 (1)
[8]  
Awais MM, 2007, LECT NOTES COMPUT SC, V4681, P897
[9]  
BISHER KM, 1990, ARABIC PHONETICS
[10]  
BORDEN GJ, 1990, SPEECH SCI PRIMER DA