Speech Recognition of Moroccan Dialect Using Hidden Markov Models

被引:11
作者
Mouaz, Bezoui [1 ]
Abderrahim, Beni Hssane [1 ]
Abdelmajid, Elmoutaouakkil [1 ]
机构
[1] Chouaib Doukkali Univ, Dept Comp Sci, El Jadida, Morocco
来源
10TH INTERNATIONAL CONFERENCE ON AMBIENT SYSTEMS, NETWORKS AND TECHNOLOGIES (ANT 2019) / THE 2ND INTERNATIONAL CONFERENCE ON EMERGING DATA AND INDUSTRY 4.0 (EDI40 2019) / AFFILIATED WORKSHOPS | 2019年 / 151卷
关键词
ASR; DA; MSA; HMM; MFCC;
D O I
10.1016/j.procs.2019.04.138
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
This paper addresses the development of an Automatic Speech Recognition (ASR) system for the Moroccan Dialect. Dialectal Arabic (DA) refers to the day-to-day vernaculars spoken in the Arab world. In fact, Moroccan Dialect is very different from the Modern Standard Arabic (MSA) because it is highly influenced by the French Language. It is observed throughout all Arab countries that standard Arabic widely written and used for official speech, news papers, public administration and school but not used in everyday conversation and dialect is widely spoken in everyday life but almost never written. we propose to use the Mel Frequency Cepstral Coefficient (MFCC) features to specify the best speaker identification system. The extracted speech features are quantized to a number of centroids using vector quantization algorithm. These centroids constitute the codebook of that speaker. MFCC's are calculated in training phase and again in testing phase. Speakers uttered same words once in a training session and once in a testing session later. The Euclidean distance between the MFCC's of each speaker in training phase to the centroids of individual speaker in testing phase is measured and the speaker is identified according to the minimum Euclidean distance. The code is developed in the MATLAB environment and performs the identification satisfactorily. (C) 2019 The Authors. Published by Elsevier B.V.
引用
收藏
页码:985 / 991
页数:7
相关论文
共 14 条
  • [1] Afify M, 2006, INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, P277
  • [2] Ali A., 2014, INT WORKSH SPOK LANG
  • [3] Automatic Dialect Detection in Arabic Broadcast Speech
    Ali, Ahmed
    Dehak, Najim
    Cardinal, Patrick
    Khurana, Sameer
    Yella, Sree Harsha
    Glass, James
    Bell, Peter
    Renals, Steve
    [J]. 17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 2934 - 2938
  • [4] [Anonymous], 2005, P INT CIT
  • [5] Connel S., 1996, THESIS
  • [6] Elmahdy M., 2012, P ESOLEC
  • [7] Elmahdy M, 2014, LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, P3057
  • [8] Harrat Salima, 2016, IJACSA INT J ADV COM, V7
  • [9] Menacer Mohamed Amine, 2017, 3 INT C AR COMP LING
  • [10] Rabiner L., 1993, Fundamentals of Speech Recognition