On the perceptually irrelevant phase information in sinusoidal representation of speech

被引:12
作者
Kim, DS [1 ]
机构
[1] Samsung Adv Inst Technol, Human & Comp Interact Lab, Kyonggi Do 449712, South Korea
来源
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING | 2001年 / 9卷 / 08期
关键词
phase perception; speech coding; speech quality;
D O I
10.1109/89.966093
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
For efficient quantization of speech representations, it is essential to incorporate perceptual characteristics of human hearing. However, the focus has been confined only to the magnitude information of speech, and little attention has been paid to phase information. This paper presents a novel approach, termed perceptually irrelevant phase elimination (PIPE), to find out irrelevant phase information in acoustic signals in terms of perceived quality. The proposed method, inspired by the observation that the relative phase relationship within a critical band is perceptually important, is derived not only for stationary Fourier signals but also for harmonic signals. For harmonic signals, the "critical phase frequency" is defined below which phase information is perceptually irrelevant. PIPE algorithm is incorporated into the harmonic analysis/synthesis of speech, and subjective test results demonstrate the effectiveness of the proposed method.
引用
收藏
页码:900 / 905
页数:6
相关论文
共 17 条
[1]   PREDICTIVE CODING OF SPEECH SIGNALS AND SUBJECTIVE ERROR CRITERIA [J].
ATAL, BS ;
SCHROEDER, MR .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1979, 27 (03) :247-254
[2]  
ATAL BS, 1984, P INT C COMM AMST, P1610
[3]  
BILSEN FA, 1973, ACUSTICA, V28, P60
[4]  
Cho YD, 1998, INT CONF ACOUST SPEE, P601, DOI 10.1109/ICASSP.1998.675336
[5]  
De Boer E, 1961, ACUSTICA, V11, P182
[6]   DERIVATION OF AUDITORY FILTER SHAPES FROM NOTCHED-NOISE DATA [J].
GLASBERG, BR ;
MOORE, BCJ .
HEARING RESEARCH, 1990, 47 (1-2) :103-138
[7]   MULTIBAND EXCITATION VOCODER [J].
GRIFFIN, DW ;
LIM, JS .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1988, 36 (08) :1223-1235
[9]   TRANSFORMED UP-DOWN METHODS IN PSYCHOACOUSTICS [J].
LEVITT, H .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1971, 49 (02) :467-&
[10]  
MCAULAY RJ, 1995, SPEECH CODING SYNTHE, P121