Feature analysis of pathological speech signals using local discriminant bases technique

被引:22
作者
Umapathy, K
Krishnan, S [1 ]
机构
[1] Ryerson Univ, Dept Elect & Comp Engn, Toronto, ON, Canada
[2] Univ Western Ontario, Dept Elect & Comp Engn, London, ON, Canada
关键词
local discriminant bases; dissimilarity measures; wavelet packets; linear discriminant analysis; pathological speech signals;
D O I
10.1007/BF02344726
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Speech is an integral part of the human communication system. Various pathological conditions affect the vocal functions, inducing speech disorders. Acoustic parameters of speech are commonly used for the assessment of speech disorders and for monitoring the progress of the patient over the course of therapy. In the last two decades, signal-processing techniques have been successfully applied in screening speech disorders. In the paper, a novel approach is proposed to classify pathological speech signals using a local discriminant bases (LDB) algorithm and wavelet packet decompositions. The focus of the paper was to demonstrate the significance of identifying the signal subspaces that contribute to the discriminatory characteristics of normal and pathological speech signals in a computationally efficient way. Features were extracted from target subspaces for classification, and time-frequency decomposition was used to eliminate the need for segmentation of the speech signals. The technique was tested with a database of 212 speech signals (51 normal and 161 pathological) using the Daubechies wavelet (db4). Classification accuracies up to 96% were achieved for a two-group classification as normal and pathological speech signals, and 74% was achieved for a four-group classification as male normal, female normal, male pathological and female pathological signals.
引用
收藏
页码:457 / 464
页数:8
相关论文
共 36 条
[1]  
AGBINYA JI, 1996, P 1996 IEEE REG 10 T, V2, P514
[2]  
[Anonymous], 1997, A Wavelet Tour of Signal Processing
[3]   Frequency domain linear prediction for temporal features [J].
Athineos, M ;
Ellis, DPW .
ASRU'03: 2003 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING ASRU '03, 2003, :261-266
[4]  
Baken RJ, 2000, CLIN MEASUREMENT SPE
[5]  
CHRISTIAN B, 2002, P IEEE SENSORS, V2, P1654
[6]   ENTROPY-BASED ALGORITHMS FOR BEST BASIS SELECTION [J].
COIFMAN, RR ;
WICKERHAUSER, MV .
IEEE TRANSACTIONS ON INFORMATION THEORY, 1992, 38 (02) :713-718
[7]   ACOUSTIC CORRELATES OF VOCAL QUALITY [J].
ESKENAZI, L ;
CHILDERS, DG ;
HICKS, DM .
JOURNAL OF SPEECH AND HEARING RESEARCH, 1990, 33 (02) :298-306
[8]  
Fukunaga K., 1990, INTRO STAT PATTERN R
[9]   Automatic detection of voice impairments by means of short-term cepstral parameters and neural network based detectors [J].
Godino-Llorente, JI ;
Gómez-Vilda, P .
IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2004, 51 (02) :380-384
[10]   PERCEPTUAL AND ACOUSTIC CORRELATES OF ABNORMAL VOICE QUALITIES [J].
HAMMARBERG, B ;
FRITZELL, B ;
GAUFFIN, J ;
SUNDBERG, J ;
WEDIN, L .
ACTA OTO-LARYNGOLOGICA, 1980, 90 (5-6) :441-451