A CEPSTRUM-BASED TECHNIQUE FOR DETERMINING A HARMONICS-TO-NOISE RATIO IN SPEECH SIGNALS

被引:207
作者
DEKROM, G
机构
来源
JOURNAL OF SPEECH AND HEARING RESEARCH | 1993年 / 36卷 / 02期
关键词
HARMONICS-TO-NOISE RATIO (HNR); CEPSTRUM; VOICE QUALITY ASSESSMENT; SYNTHETIC VOICE SIGNALS;
D O I
10.1044/jshr.3602.254
中图分类号
H [语言、文字];
学科分类号
05 ;
摘要
A new method to calculate a spectral harmonics-to-noise ratio (HNR) in speech signals is presented. The method involves discrimination between harmonic and noise energy in the magnitude spectrum by means of a comb-liftering operation in the cepstrum domain. Sensitivity of HNR to (a) additive noise and (b) jitter was tested with synthetic vowel-like signals, generated at 10 fundamental frequencies. All jitter and noise signals were analyzed at three window lengths in order to investigate the effect of the length of the analysis frame on the estimated HNR values. Results of a multiple linear regression analysis with noise or jitter, F0, and window length as predictors for HNR indicate a major effect of both noise and jitter on HNR, in that HNR decreases almost linearly with increasing noise levels or increasing jitter. The influence of F0 and window length on HNR is small for the jittered signals, but HNR increases considerably with increasing F0 or window length for the noise signals. We conclude that the method seems to be a valid technique for determining the amount of spectral noise, because it is almost linearly sensitive to both noise and jitter for a large part of the noise or jitter continuum. The strong negative relation between HNR and jitter illustrates that spectral noise measures cannot simply be taken as indicators of the actual amount of noise in the time signal. Instead, HNR integrates several aspects of the acoustic stability of the signal. As such, HNR may be a useful parameter in the analysis of voice quality, although it cannot be directly interpreted in terms of underlying glottal events or perceptual characteristics.
引用
收藏
页码:254 / 266
页数:13
相关论文
共 34 条
[1]  
Bogert B. P., 1963, P S TIM SER AN, V15, P209
[2]   VOCAL QUALITY FACTORS - ANALYSIS, SYNTHESIS, AND PERCEPTION [J].
CHILDERS, DG ;
LEE, CK .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1991, 90 (05) :2394-2410
[3]   TECHNICAL CONSIDERATIONS IN COMPUTATION OF SPECTRAL HARMONICS-TO-NOISE RATIOS FOR SUSTAINED VOWELS [J].
COX, NB ;
ITO, MR ;
MORRISON, MD .
JOURNAL OF SPEECH AND HEARING RESEARCH, 1989, 32 (01) :203-218
[4]  
DAVIS SB, 1978, STATUS REPORT SPEECH, V54, P133
[5]   SOME WAVEFORM AND SPECTRAL FEATURES OF VOWEL ROUGHNESS [J].
DEAL, RE ;
EMANUEL, FW .
JOURNAL OF SPEECH AND HEARING RESEARCH, 1978, 21 (02) :250-264
[6]  
DEKROM G, 1990, P TUTORIAL RES WORKS, P83
[7]  
Emanuel F W, 1973, Folia Phoniatr (Basel), V25, P110
[8]   SOME SPECTRAL FEATURES OF NORMAL AND SIMULATED ROUGH VOWELS [J].
EMANUEL, FW ;
SANSONE, FE .
FOLIA PHONIATRICA, 1969, 21 (06) :401-415
[9]   ACOUSTIC CORRELATES OF VOCAL QUALITY [J].
ESKENAZI, L ;
CHILDERS, DG ;
HICKS, DM .
JOURNAL OF SPEECH AND HEARING RESEARCH, 1990, 33 (02) :298-306
[10]   SHORT-TERM STABILITY MEASURES FOR THE EVALUATION OF VOCAL QUALITY [J].
FEIJOO, S ;
HERNANDEZ, C .
JOURNAL OF SPEECH AND HEARING RESEARCH, 1990, 33 (02) :324-334