Gaussian Mixture Model Based Classification of Stuttering Dysfluencies

被引:9
作者
Mahesha, P. [1 ]
Vinod, D. S. [2 ]
机构
[1] SJ Coll Engn, Dept Comp Sci & Engn, Mysore, Karnataka, India
[2] SJ Coll Engn, Dept Informat Sci & Engn, Mysore, Karnataka, India
关键词
Dysfluency; EM algorithm; GMM; MFCC; stuttering;
D O I
10.1515/jisys-2014-0140
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The classification of dysfluencies is one of the important steps in objective measurement of stuttering disorder. In this work, the focus is on investigating the applicability of automatic speaker recognition (ASR) method for stuttering dysfluency recognition. The system designed for this particular task relies on the Gaussian mixture model (GMM), which is the most widely used probabilistic modeling technique in ASR. The GMM parameters are estimated from Mel frequency cepstral coefficients (MFCCs). This statistical speaker-modeling technique represents the fundamental characteristic sounds of speech signal. Using this model, we build a dysfluency recognizer that is capable of recognizing dysfluencies irrespective of a person as well as what is being said. The performance of the system is evaluated for different types of dysfluencies such as syllable repetition, word repetition, prolongation, and interjection using speech samples from the University College London Archive of Stuttered Speech (UCLASS).
引用
收藏
页码:387 / 399
页数:13
相关论文
共 50 条
[41]   A two-stage mechanism for registration and classification of ECG using Gaussian mixture model [J].
Martis, Roshan Joy ;
Chakraborty, Chandan ;
Ray, Ajoy K. .
PATTERN RECOGNITION, 2009, 42 (11) :2979-2988
[42]   A Comprehensive Analysis on Breast Cancer Classification with Radial Basis Function and Gaussian Mixture Model [J].
Rajaguru, Harikumar ;
Prabhakar, Sunil Kumar .
16TH INTERNATIONAL CONFERENCE ON BIOMEDICAL ENGINEERING, 2017, 61 :21-27
[43]   A Voice Morphing Model Based on the Gaussian Mixture Model and Generative Topographic Mapping [J].
Rassam, Murad A. ;
Almekhlafi, Rasha ;
Alosaily, Eman ;
Hassan, Haneen ;
Hassan, Reem ;
Saeed, Eman ;
Alqershi, Elham .
EMERGING TRENDS IN INTELLIGENT COMPUTING AND INFORMATICS: DATA SCIENCE, INTELLIGENT INFORMATION SYSTEMS AND SMART COMPUTING, 2020, 1073 :396-406
[44]   Multiscale Sample Entropy-Based Feature Extraction with Gaussian Mixture Model for Detection and Classification of Blue Whale Vocalization [J].
Babalola, Oluwaseyi Paul ;
Ogundile, Olayinka Olaolu ;
Balyan, Vipin .
ENTROPY, 2025, 27 (04)
[45]   MODEL SELECTION FOR GAUSSIAN MIXTURE MODELS [J].
Huang, Tao ;
Peng, Heng ;
Zhang, Kun .
STATISTICA SINICA, 2017, 27 (01) :147-169
[46]   Real Life Emotion Classification using Spectral Features and Gaussian Mixture Models [J].
Koolagudi, Shashidhar G. ;
Barthwal, Anurag ;
Devliyal, Swati ;
Rao, K. Sreenivasa .
INTERNATIONAL CONFERENCE ON MODELLING OPTIMIZATION AND COMPUTING, 2012, 38 :3892-3899
[47]   Real Life Emotion Classification from Speech Using Gaussian Mixture Models [J].
Koolagudi, Shashidhar G. ;
Barthwal, Anurag ;
Devliyal, Swati ;
Rao, K. Sreenivasa .
CONTEMPORARY COMPUTING, 2012, 306 :250-+
[48]   Detection of Unmanned Aerial Vehicle Signal Based on Gaussian Mixture Model [J].
Zhao, Caidan ;
Shi, Mingxian ;
Cai, Zhibiao ;
Chen, Caiyun .
2017 12TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND EDUCATION (ICCSE 2017), 2017, :289-293
[49]   Speaker recognition based on dynamic time warping and Gaussian mixture model [J].
Zhang, Nannan ;
Yao, Yanru .
PROCEEDINGS OF THE 39TH CHINESE CONTROL CONFERENCE, 2020, :1174-1177
[50]   OPTICAL-TO-SAR IMAGE REGISTRATION BASED ON GAUSSIAN MIXTURE MODEL [J].
Wang, Hanyun ;
Wang, Cheng ;
Li, Peng ;
Chen, Ziyi ;
Cheng, Ming ;
Luo, Lun ;
Liu, Yinsheng .
XXII ISPRS CONGRESS, TECHNICAL COMMISSION I, 2012, 39-B1 :179-183