EMARATI SPEAKER IDENTIFICATION

被引:0
作者
Shahin, Ismail [1 ]
Ba-Hutair, Mohammed Nasser [1 ]
机构
[1] Univ Sharjah, Dept Elect & Comp Engn, Sharjah, U Arab Emirates
来源
2014 12TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP) | 2014年
关键词
Emarati speech database; Gaussian mixture models; hidden Markov models; speaker identification; vector quantization; RECOGNITION;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this work we focus on Emarati speaker identification systems in neutral talking environments based on each of Vector Quantization (VQ), Gaussian Mixture Models (GMMs), and Hidden Markov Models (HMMs) as classifiers. These systems have been tested on our collected Emarati speech database which is composed of 25 male and 25 female Emarati speakers using MelFrequency Cepstral Coefficients (MFCCs). Our results yield an average text-dependent Emarati speaker identification performance of 100.00%, %99.81, and 99.69% based on VQ, GMMs, and HMMs, respectively. For text-independent systems, the average Emarati speaker identification performance based on VQ, GMMs, and HMMs is 94.48%, 86.55%, and 74.83%, respectively. The achieved results based on VQ are close to those obtained in subjective assessment by human listeners.
引用
收藏
页码:488 / 493
页数:6
相关论文
共 15 条
  • [1] ALDAHRI SS, 2008, SIGNAL PROCESSING IN, P198
  • [2] [Anonymous], 2010, PROC 5 INT C DESIGN
  • [3] [Anonymous], 1990, Hidden markov models for speech recognition
  • [4] ISOLATED WORD RECOGNITION USING MARKOV-CHAIN MODELS
    DAI, JN
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1995, 3 (06): : 458 - 463
  • [5] Modulation Spectral Features for Robust Far-Field Speaker Identification
    Falk, Tiago H.
    Chan, Wai-Yip
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (01): : 90 - 100
  • [6] Speaker Recognition Using Neural Networks and Conventional Classifiers
    Farrell, Kevin R.
    Mammone, Richard J.
    Assaleh, Khaled T.
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1994, 2 (01): : 194 - 205
  • [7] SPEAKER-DEPENDENT-FEATURE EXTRACTION, RECOGNITION AND PROCESSING TECHNIQUES
    FURUI, S
    [J]. SPEECH COMMUNICATION, 1991, 10 (5-6) : 505 - 520
  • [8] Kandali A. B., 2008, TENCON 2008 2008 IEE, P1
  • [9] Maesa A., 2012, Journal of Information Security, V3, P335
  • [10] ROBUST TEXT-INDEPENDENT SPEAKER IDENTIFICATION USING GAUSSIAN MIXTURE SPEAKER MODELS
    REYNOLDS, DA
    ROSE, RC
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1995, 3 (01): : 72 - 83