Gaussian Mixture Model Based Classification of Stuttering Dysfluencies

被引:9
|
作者
Mahesha, P. [1 ]
Vinod, D. S. [2 ]
机构
[1] SJ Coll Engn, Dept Comp Sci & Engn, Mysore, Karnataka, India
[2] SJ Coll Engn, Dept Informat Sci & Engn, Mysore, Karnataka, India
关键词
Dysfluency; EM algorithm; GMM; MFCC; stuttering;
D O I
10.1515/jisys-2014-0140
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The classification of dysfluencies is one of the important steps in objective measurement of stuttering disorder. In this work, the focus is on investigating the applicability of automatic speaker recognition (ASR) method for stuttering dysfluency recognition. The system designed for this particular task relies on the Gaussian mixture model (GMM), which is the most widely used probabilistic modeling technique in ASR. The GMM parameters are estimated from Mel frequency cepstral coefficients (MFCCs). This statistical speaker-modeling technique represents the fundamental characteristic sounds of speech signal. Using this model, we build a dysfluency recognizer that is capable of recognizing dysfluencies irrespective of a person as well as what is being said. The performance of the system is evaluated for different types of dysfluencies such as syllable repetition, word repetition, prolongation, and interjection using speech samples from the University College London Archive of Stuttered Speech (UCLASS).
引用
收藏
页码:387 / 399
页数:13
相关论文
共 50 条
  • [1] LP-Hillbert Transform Based MFCC for Effective Discrimination of Stuttering Dysfluencies
    Mahesha, P.
    Vinod, D. S.
    2017 2ND IEEE INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, SIGNAL PROCESSING AND NETWORKING (WISPNET), 2017, : 2561 - 2565
  • [2] ANN AND SVM BASED RECOGNITION OF THE DYSFLUENCIES OF SPEAKERS WITH STUTTERING
    Palfy, Juraj
    MENDEL 2011 - 17TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING, 2011, : 440 - 447
  • [3] Real-time Traffic Status Classification Based on Gaussian Mixture Model
    Liu, Xiong
    Pan, Li
    Sun, Xiaoliang
    2016 IEEE FIRST INTERNATIONAL CONFERENCE ON DATA SCIENCE IN CYBERSPACE (DSC 2016), 2016, : 573 - 578
  • [4] Discriminative Model Selection for Gaussian Mixture Models for Classification
    Liu, Xiao-Hua
    Liu, Cheng-Lin
    2011 FIRST ASIAN CONFERENCE ON PATTERN RECOGNITION (ACPR), 2011, : 62 - 66
  • [5] Research on the Algorithm of Image Classification Based on Gaussian Mixture Model
    Meng, Z.
    Yao, G. Q.
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON COMPUTER INFORMATION SYSTEMS AND INDUSTRIAL APPLICATIONS (CISIA 2015), 2015, 18 : 659 - 662
  • [6] Classification of stressed speech using Gaussian mixture model
    Patro, H
    Raja, GS
    Dandapat, S
    INDICON 2005 Proceedings, 2005, : 342 - 346
  • [7] Gaussian Mixture Model with Semantic Distance for Image Classification
    Wu, Wei
    Gao, Guanglai
    Nie, Jianyun
    26TH CHINESE CONTROL AND DECISION CONFERENCE (2014 CCDC), 2014, : 1687 - 1691
  • [8] Contextual classification for smart machining based on unsupervised machine learning by Gaussian mixture model
    Wang, Zhiqiang
    Ritou, Mathieu
    Da Cunha, Catherine
    Furet, Benoit
    INTERNATIONAL JOURNAL OF COMPUTER INTEGRATED MANUFACTURING, 2020, 33 (10-11) : 1042 - 1054
  • [9] Discriminative Training of Subspace Gaussian Mixture Model for Pattern Classification
    Liu, Xiao-Hua
    Liu, Cheng-Lin
    ADVANCED INTELLIGENT COMPUTING THEORIES AND APPLICATIONS, 2010, 6215 : 213 - 221
  • [10] Classification of speech dysfluencies with MFCC and LPCC features
    Ai, Ooi Chia
    Hariharan, M.
    Yaacob, Sazali
    Chee, Lim Sin
    EXPERT SYSTEMS WITH APPLICATIONS, 2012, 39 (02) : 2157 - 2165