Support vector machine-based stuttering dysfluency classification using GMM supervectors

被引:22
作者
Mahesha, P. [1 ]
Vinod, D. S. [1 ]
机构
[1] SJ Coll Engn, Dept Comp Sci & Engn, Mysore, Karnataka, India
关键词
GMM; Gaussian mixture model; SVM; support vector machine; stuttering dysfluency; cepstral;
D O I
10.1504/IJGUC.2015.070680
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
It is generally acknowledged that recognition and classification of dysfluencies are an important criterion in the objective and accurate assessment of stuttered speech. For this reason, there is a growing interest in the application of Automatic Speech Recognition (ASR) technology to automate the dysfluency recognition. In this perspective, several studies have been carried out on the classification of dysfluencies by means of acoustic analysis, parametric and nonparametric feature extraction and statistical methods. This work is focused on introducing and evaluating Support Vector Machine (SVM) based dysfluency recognition system using a Gaussian Mixture Model (GMM) supervector. The experimental evaluation of the proposed system reveals that an SVM-based GMM supervector is effective for dysfluency classification. We have obtained substantial improvements in the performance by considering cepstral and their delta features.
引用
收藏
页码:143 / 149
页数:7
相关论文
共 30 条
[1]  
[Anonymous], 2010, J COMPUT
[2]  
BLOODSTEIN O, 1995, HDB STUTTERING
[3]   Support vector machines using GMM supervectors for speaker verification [J].
Campbell, WM ;
Sturim, DE ;
Reynolds, DA .
IEEE SIGNAL PROCESSING LETTERS, 2006, 13 (05) :308-311
[4]   MFCC based Recognition of Repetitions and Prolongations in Stuttered Speech using k-NN and LDA [J].
Chee, Lim Sin ;
Ai, Ooi Chia ;
Hariharan, M. ;
Yaacob, Sazali .
2009 IEEE STUDENT CONFERENCE ON RESEARCH AND DEVELOPMENT: SCORED 2009, PROCEEDINGS, 2009, :146-149
[5]   COMPARISON OF PROCEDURES FOR SCALING SEVERITY OF STUTTERING [J].
CULLINAN, WL ;
PRATHER, EM ;
WILLIAMS, DE .
JOURNAL OF SPEECH AND HEARING RESEARCH, 1963, 6 (02) :187-194
[6]   Intelligent processing of stuttered speech [J].
Czyzewski, A ;
Kaczmarek, A ;
Kostek, B .
JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2003, 21 (02) :143-171
[7]   Fast approximation of Kullback-Leibler distance for dependence trees and hidden Markov models [J].
Do, MN .
IEEE SIGNAL PROCESSING LETTERS, 2003, 10 (04) :115-118
[8]  
Feng Liu, 2004, THESIS
[9]   Dimensionality reduction of a pathological voice quality assessment system based on Gaussian mixture models and short-term cepstral parameters [J].
Godino-Llorente, Juan Ignacio ;
Gomez-Vilda, Pedro ;
Blanco-Velasco, Manuel .
IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2006, 53 (10) :1943-1953
[10]   Development of a two-stage procedure for the automatic recognition of dysfluencies in the speech of children who stutter .2. ANN recognition of repetitions and prolongations with supplied word segment markers [J].
Howell, P ;
Sackin, S ;
Glenn, K .
JOURNAL OF SPEECH LANGUAGE AND HEARING RESEARCH, 1997, 40 (05) :1085-1096