Robust classification of stop consonants using auditory-based speech processing

被引:0
作者
Ali, AMA [1 ]
Van der Spiegel, J [1 ]
Mueller, P [1 ]
机构
[1] Texas Instruments Inc, Warren, NJ 07059 USA
来源
2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING - VOL IV: SIGNAL PROCESSING FOR COMMUNICATIONS; VOL V: SIGNAL PROCESSING EDUCATION SENSOR ARRAY & MULTICHANNEL SIGNAL PROCESSING AUDIO & ELECTROACOUSTICS; VOL VI: SIGNAL PROCESSING THEORY & METHODS STUDENT FORUM | 2001年
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this work, a feature-based system for the automatic classification of stop consonants, in speaker independent continuous speech, is reported. The system uses a new auditory-based speech processing front-end that is based on the biologically rooted property of average localized synchrony detection (ALSD). It incorporates new algorithms for the extraction and manipulation of the acoustic-phonetic features that proved, statistically, to be rich in their information content. The experiments are performed on stop consonants extracted from the TIMIT database with additive white Gaussian noise at various signal-to-noise ratios. The obtained classification accuracy compares favorably with previous work. The results also showed a consistent improvement of 3% in the place detection over the Generalized Synchrony Detector (GSD) system under identical circumstances on clean and noisy speech. This illustrates the superior ability of the ALSD to suppress the spurious peaks and produce a consistent and robust formant (peak) representation.
引用
收藏
页码:81 / 84
页数:4
相关论文
共 15 条
[1]  
Ali A. M. A., 1998, J ACOUST SOC AM, V103, P2777
[2]  
ALI AMA, 2000, P ICASSP 2000, P1623
[3]  
ALI AMA, 1999, THESIS U PENN
[4]  
Bush M. A., 1983, P ICASSP
[5]   SPEAKER-INDEPENDENT CONSONANT CLASSIFICATION IN CONTINUOUS SPEECH WITH DISTINCTIVE FEATURES AND NEURAL NETWORKS [J].
DEMORI, R ;
FLAMMIA, G .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1993, 94 (06) :3091-3103
[6]   STOP-CONSONANT RECOGNITION - RELEASE BURSTS AND FORMANT TRANSITIONS AS FUNCTIONALLY EQUIVALENT, CONTEXT-DEPENDENT CUES [J].
DORMAN, MF ;
STUDDERTKENNEDY, M ;
RAPHAEL, LJ .
PERCEPTION & PSYCHOPHYSICS, 1977, 22 (02) :109-122
[7]   Auditory Models and Human Performance in Tasks Related to Speech Coding and Speech Recognition [J].
Ghitza, Oded .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1994, 2 (01) :115-132
[8]   A COMPARISON OF SIGNAL-PROCESSING FRONT-ENDS FOR AUTOMATIC WORD RECOGNITION [J].
JANKOWSKI, CR ;
VO, HDH ;
LIPPMANN, RP .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1995, 3 (04) :286-293
[9]   Time-Varying Feature Selection and Classification of Unvoiced Stop Consonants [J].
Nathan, Krishna S. ;
Silverman, Harvey F. .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1994, 2 (03) :395-405
[10]  
OHSHIMA Y, 1993, THESIS CARNEGIE MELL