An iterative algorithm for decomposition of speech signals into periodic and aperiodic components

被引:59
作者
Yegnanarayana, B [1 ]
d'Alessandro, C [1 ]
Darsinos, V [1 ]
机构
[1] Indian Inst Technol, Dept Comp Sci & Engn, Madras 600036, Tamil Nadu, India
来源
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING | 1998年 / 6卷 / 01期
关键词
periodic and aperiodic decomposition; spectral extrapolation; spectral modeling; speech analysis/synthesis; voice source analysis;
D O I
10.1109/89.650304
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The speech signal may be considered as the output of a time-varying vocal tract system excited with quasiperiodic and/or random sequences of pulses, The quasiperiodic part may be considered as the deterministic or periodic component and the random part as the stochastic or aperiodic component of the excitation, In this paper, we discuss issues involved in identifying and separating the periodic and aperiodic components of the source. The decomposition is performed on an approximation to the excitation signal, instead of decomposing the speech signal directly, The linear prediction residual signal is used as an approximation to the excitation signal of the vocal tract system, Speech is first analyzed to determine the voiced and unvoiced parts of the signal, Decomposition of the voiced part into periodic and aperiodic components is then accomplished by first identifying the frequency regions of harmonic and noise components in the spectral domain, The signal corresponding to the noise regions is used as a first approximation to the aperiodic component, An iterative algorithm is proposed which reconstructs the aperiodic component in the harmonic regions. The periodic component is obtained by subtracting the reconstructed aperiodic component signal from the residual signal, The individual components of the residual are then used to excite the derived all-pole model of the vocal tract system to obtain the corresponding components of the speech signal, Experiments were conducted using synthetic speech, They demonstrated the ability of the algorithm for decomposition of a synthetic speech signal made of a mixture of periodic and aperiodic components, Application to natural speech is also discussed.
引用
收藏
页码:1 / 11
页数:11
相关论文
共 26 条
[1]  
[Anonymous], 1985, STL QPSR, DOI DOI 10.1016/0167-6393(89)90001-0
[2]  
[Anonymous], J SPEECH HEARING RES
[3]  
CHAFE C, P IEEE ICASSP 90, P1157
[4]   VOCAL QUALITY FACTORS - ANALYSIS, SYNTHESIS, AND PERCEPTION [J].
CHILDERS, DG ;
LEE, CK .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1991, 90 (05) :2394-2410
[5]  
COOK P, 1991, P 12 INT C PHON SCI
[6]  
DALESSANDRO C, P IEEE ICASSP 95, P760
[7]  
DARSINOS V, P EUR 95, P393
[8]   MBR-PSOLA - TEXT-TO-SPEECH SYNTHESIS BASED ON AN MBE RE-SYNTHESIS OF THE SEGMENTS DATABASE [J].
DUTOIT, T ;
LEICH, H .
SPEECH COMMUNICATION, 1993, 13 (3-4) :435-440
[9]  
Fant G., 1960, ACOUSTIC THEORY SPEE
[10]  
GEORGE EB, 1992, J AUDIO ENG SOC, V40, P497