Speech formant frequency and bandwidth tracking using multiband energy demodulation

被引:83
作者
Potamianos, A [1 ]
Maragos, P [1 ]
机构
[1] GEORGIA INST TECHNOL,SCH ELECT & COMP ENGN,ATLANTA,GA 30332
关键词
D O I
10.1121/1.414997
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, the amplitude and frequency (AM-FM) modulation model and a multiband demodulation analysis scheme are applied to formant frequency and bandwidth tracking of speech signals. Filtering by a bank of Gabor bandpass filters is performed to isolate each speech resonance in the signal. Next, the amplitude envelope (AM) and instantaneous frequency (FM) are estimated for each band using the energy separation algorithm (ESA). Short-time formant frequency and bandwidth estimates are obtained from the instantaneous amplitude and frequency signals; two frequency estimates are proposed and their relative merits are discussed. The short-time estimates are used to compute the formant locations and bandwidths. Performance and computational issues of the algorithm are discussed. Overall, multiband demodulation analysis (MDA) is shown to be a useful tool for extracting information from the speech resonances in the time-frequency plane. (C) 1996 Acoustical Society of America.
引用
收藏
页码:3795 / 3806
页数:12
相关论文
共 27 条
[1]  
[Anonymous], SPEECH COMMUN
[2]   ESTIMATING AND INTERPRETING THE INSTANTANEOUS FREQUENCY OF A SIGNAL .1. FUNDAMENTALS [J].
BOASHASH, B .
PROCEEDINGS OF THE IEEE, 1992, 80 (04) :520-538
[3]   AM-FM ENERGY DETECTION AND SEPARATION IN NOISE USING MULTIBAND ENERGY OPERATORS [J].
BOVIK, AC ;
MARAGOS, P ;
QUATIERI, TF .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 1993, 41 (12) :3245-3265
[4]  
COHEN L, 1992, TIME FREQUENCY SIGNA
[5]   FORMANT ESTIMATION ALGORITHM BASED ON POLE FOCUSING OFFERING IMPROVED NOISE TOLERANCE AND FEATURE RESOLUTION [J].
DUNCAN, G ;
JACK, MA .
IEE PROCEEDINGS-F RADAR AND SIGNAL PROCESSING, 1988, 135 (01) :18-32
[6]  
FRIEDMAN DH, 1985, P INT C AC SPEECH SI, P1121
[7]  
Gabor D., 1946, Journal of the Institution of Electrical Engineers-Part III: Radio and Communication Engineering, V93, P429, DOI DOI 10.1049/JI-3-2.1946.0074
[8]   A System for Finding Speech Formants and Modulations via Energy Separation [J].
Hanson, Helen M. ;
Maragos, Petros ;
Potamianos, Alexandros .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1994, 2 (03) :436-443
[9]  
KAISER JF, 1990, IEEE DSP WORKSH NEW
[10]   FORMANT TRACKING USING HIDDEN MARKOV-MODELS AND VECTOR QUANTIZATION [J].
KOPEC, GE .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1986, 34 (04) :709-729