Perceptual phase quantization of speech

被引:14
作者
Kim, DS [1 ]
机构
[1] Samsung Adv Inst Technol, Human & Comp Interact Lab, Kyonggi Do 449712, South Korea
来源
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING | 2003年 / 11卷 / 04期
关键词
JND of phase; perception; phase quantization; speech coding;
D O I
10.1109/TSA.2003.814409
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
It is essential to incorporate perceptual characteristics of human hearing in modern speech/audio coding systems. However, the focus has been confined only to the magnitude information of speech, and little attention has been paid to phase information. In this paper, a quantitative. study on the characteristics of human phase perception is presented and a novel method is proposed for the quantization of phase information in speech/audio signals. First, the just-noticeable difference (JND) of phase for each harmonic in flat-spectrum periodic tones is measured for several different fundamental frequencies. Then a mathematical model of JND is established based on the measured data, to form a weighting function for phase quantization. Since the proposed weighting function is derived from psychoacoustic measurements, it provides a novel quantization method by which more bits are assigned to perceptually important phase components at the sacrifice of less important ones, resulting in perceptually closer quantized signal to the original one. Experimental results on five vowel speech signals demonstrate that the proposed weighting function is very effective for the quantization of phase information.
引用
收藏
页码:355 / 364
页数:10
相关论文
共 24 条
[1]   PREDICTIVE CODING OF SPEECH SIGNALS AND SUBJECTIVE ERROR CRITERIA [J].
ATAL, BS ;
SCHROEDER, MR .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1979, 27 (03) :247-254
[2]   PHASE EFFECTS IN A 3-COMPONENT SIGNAL [J].
BUUNEN, TJF ;
FESTEN, JM ;
BILSEN, FA ;
VANDENBR.G .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1974, 55 (02) :297-303
[3]  
BUUNEN TJF, 1976, THESIS U DELFT DELFT
[4]  
Cho YD, 1998, INT CONF ACOUST SPEE, P601, DOI 10.1109/ICASSP.1998.675336
[5]  
De Boer E, 1961, ACUSTICA, V11, P182
[6]   DERIVATION OF AUDITORY FILTER SHAPES FROM NOTCHED-NOISE DATA [J].
GLASBERG, BR ;
MOORE, BCJ .
HEARING RESEARCH, 1990, 47 (1-2) :103-138
[7]   AUDITORY SPECTRAL FILTERING AND MONAURAL PHASE PERCEPTION [J].
GOLDSTEIN, JL .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1967, 41 (02) :458-+
[8]  
GOTTESMAN O, 2000, THESIS U CALIFORNIA
[9]   MULTIBAND EXCITATION VOCODER [J].
GRIFFIN, DW ;
LIM, JS .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1988, 36 (08) :1223-1235
[10]   MONAURAL PHASE EFFECTS FOR 2-TONE SIGNALS [J].
HALL, JL ;
SCHROEDER, MR .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1972, 51 (06) :1882-+