An iterative algorithm for decomposition of speech signals into periodic and aperiodic components

被引：59

作者：

Yegnanarayana, B ^{[1
]}

d'Alessandro, C ^{[1
]}

Darsinos, V ^{[1
]}

机构：

[1] Indian Inst Technol, Dept Comp Sci & Engn, Madras 600036, Tamil Nadu, India

来源：

IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING | 1998年 / 6卷 / 01期

关键词：

periodic and aperiodic decomposition; spectral extrapolation; spectral modeling; speech analysis/synthesis; voice source analysis;

D O I：

10.1109/89.650304

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

The speech signal may be considered as the output of a time-varying vocal tract system excited with quasiperiodic and/or random sequences of pulses, The quasiperiodic part may be considered as the deterministic or periodic component and the random part as the stochastic or aperiodic component of the excitation, In this paper, we discuss issues involved in identifying and separating the periodic and aperiodic components of the source. The decomposition is performed on an approximation to the excitation signal, instead of decomposing the speech signal directly, The linear prediction residual signal is used as an approximation to the excitation signal of the vocal tract system, Speech is first analyzed to determine the voiced and unvoiced parts of the signal, Decomposition of the voiced part into periodic and aperiodic components is then accomplished by first identifying the frequency regions of harmonic and noise components in the spectral domain, The signal corresponding to the noise regions is used as a first approximation to the aperiodic component, An iterative algorithm is proposed which reconstructs the aperiodic component in the harmonic regions. The periodic component is obtained by subtracting the reconstructed aperiodic component signal from the residual signal, The individual components of the residual are then used to excite the derived all-pole model of the vocal tract system to obtain the corresponding components of the speech signal, Experiments were conducted using synthetic speech, They demonstrated the ability of the algorithm for decomposition of a synthetic speech signal made of a mixture of periodic and aperiodic components, Application to natural speech is also discussed.

引用

页码：1 / 11

页数：11

共 26 条

[11] MULTIBAND EXCITATION VOCODER [J].

GRIFFIN, DW ;

LIM, JS .

IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1988, 36 (08) :1223-1235

[12]

HARRIS FJ, 1978, P IEEE, V66, P51, DOI 10.1109/PROC.1978.10837

[13] SYNTHESIS OF BREATHY VOWELS - SOME RESEARCH METHODS [J].

HERMES, DJ .

SPEECH COMMUNICATION, 1991, 10 (5-6) :497-502

[14] INDIVIDUAL VARIATION IN MEASURES OF VOICE [J].

HOLMBERG, EB ;

PERKELL, JS ;

HILLMAN, RE ;

GRESS, C .

PHONETICA, 1994, 51 (1-3) :30-37

[15] SOFTWARE FOR A CASCADE-PARALLEL FORMANT SYNTHESIZER [J].

KLATT, DH .

JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1980, 67 (03) :971-995

[16] ANALYSIS, SYNTHESIS, AND PERCEPTION OF VOICE QUALITY VARIATIONS AMONG FEMALE AND MALE TALKERS [J].

KLATT, DH ;

KLATT, LC .

JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1990, 87 (02) :820-857

[17]

LAROCHE J, P IEEE ICASSP 93, P550

[18]

MARKET JD, 1976, LINEAR PREDICTION SP

[19]

MARTIN P, P IEEE ICASSP 82, P180

[20] SPEECH ANALYSIS SYNTHESIS BASED ON A SINUSOIDAL REPRESENTATION [J].

MCAULAY, RJ ;

QUATIERI, TF .

IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1986, 34 (04) :744-754

← 1 2 3 →