Vowel Onset Point Detection Using Source, Spectral Peaks, and Modulation Spectrum Energies

被引:101
作者
Prasanna, S. R. Mahadeva [1 ]
Reddy, B. V. Sandeep [2 ]
Krishnamoorthy, P. [1 ]
机构
[1] Indian Inst Technol Guwahati, Dept Elect & Commun Engn, Gauhati 781039, Assam, India
[2] Automat Speech Recognit ASR Syst Applicat Technol, Noida 201301, India
来源
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING | 2009年 / 17卷 / 04期
关键词
Modulation spectrum and combining; source; spectral peaks; vowel onset point (VOP); SPEECH RECOGNITION; LINEAR PREDICTION; MODEL;
D O I
10.1109/TASL.2008.2010884
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Vowel onset point (VOP) is the instant at which the onset of vowel takes place during speech production. There are significant changes occurring in the energies of excitation source, spectral peaks, and modulation spectrum at the VOP. This paper demonstrates the independent use of each of these three energies in detecting the VOPs. Since each of these energies represents a different aspect of speech production, it may be possible that they contain complementary information about the VOP. The individual evidences are therefore combined for detecting the VOPs. The error rates measured as the ratio of missing and spurious to the total number of VOPs evaluated on the sentences taken from the TIMIT database are 6.92%, 8.8%, 6.13%, and 4.0% for source, spectral peaks, modulation spectrum, and combined information, respectively. The performance of the combined method for VOP detection is improved by 2.13% compared to the best performing individual VOP detection method.
引用
收藏
页码:556 / 565
页数:10
相关论文
共 24 条
[1]   EPOCH EXTRACTION FROM LINEAR PREDICTION RESIDUAL FOR IDENTIFICATION OF CLOSED GLOTTIS INTERVAL [J].
ANANTHAPADMANABHA, TV ;
YEGNANARAYANA, B .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1979, 27 (04) :309-319
[2]  
[Anonymous], 2001, DISCRETE TIME SPEECH
[3]  
[Anonymous], 1999, SPEECH COMMUNICATION
[4]  
Deller J. R., 1993, DISCRETE TIME PROCES
[5]   EFFECT OF TEMPORAL ENVELOPE SMEARING ON SPEECH RECEPTION [J].
DRULLMAN, R ;
FESTEN, JM ;
PLOMP, R .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1994, 95 (02) :1053-1064
[6]   Remaking speech [J].
Dudley, H .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1939, 11 (02) :169-177
[7]   ON THE ROLE OF SPECTRAL TRANSITION FOR SPEECH-PERCEPTION [J].
FURUI, S .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1986, 80 (04) :1016-1025
[8]  
GREENBERG S, 1997, P IEEE INT C AC SPEE, P1647
[9]   VOWEL-ONSET DETECTION [J].
HERMES, DJ .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1990, 87 (02) :866-873
[10]   Robust speech recognition using the modulation spectrogram [J].
Kingsbury, BED ;
Morgan, N ;
Greenberg, S .
SPEECH COMMUNICATION, 1998, 25 (1-3) :117-132