On the usefulness of STFT phase spectrum in human listening tests

被引:102
作者
Paliwal, KK [1 ]
Alsteris, LD [1 ]
机构
[1] Griffith Univ, Sch Microelect Engn, Nathan, Qld 4111, Australia
关键词
short-time Fourier transform; phase spectrum; magnitude spectrum; speech perception; overlap-add procedure; automatic speech recognition;
D O I
10.1016/j.specom.2004.08.001
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The short-time Fourier transform (STFT) of a speech signal has two components: the magnitude spectrum and the phase spectrum. In this paper, the relative importance of short-time magnitude and phase spectra for speech perception is investigated. Human perception experiments are conducted to measure intelligibility of speech stimuli synthesized either from magnitude spectra or phase spectra. It is traditionally believed that the magnitude spectrum plays a dominant role for small window durations (20-40 ms); while the phase spectrum is more important for large window durations (>1 s). It is shown in this paper that even for small window durations, the phase spectrum can contribute to speech intelligibility as much as the magnitude spectrum if the analysis-modification-synthesis parameters are properly selected. (C) 2004 Elsevier B.V. All rights reserved.
引用
收藏
页码:153 / 170
页数:18
相关论文
共 46 条
[11]   SIGNAL RECONSTRUCTION FROM PHASE OR MAGNITUDE [J].
HAYES, MH ;
LIM, JS ;
OPPENHEIM, AV .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1980, 28 (06) :672-680
[12]   SOME RESULTS ON THE TIME-FREQUENCY SAMPLING OF THE SHORT-TIME FOURIER-TRANSFORM MAGNITUDE [J].
IZRAELEVITZ, D .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1985, 33 (06) :1611-1613
[13]  
Kim DS, 2000, INT CONF ACOUST SPEE, P1383, DOI 10.1109/ICASSP.2000.861838
[14]   ENHANCEMENT AND BANDWIDTH COMPRESSION OF NOISY SPEECH [J].
LIM, JS ;
OPPENHEIM, AV .
PROCEEDINGS OF THE IEEE, 1979, 67 (12) :1586-1604
[15]   Effects of phase on the perception of intervocalic stop consonants [J].
Liu, L ;
He, JL ;
Palm, G .
SPEECH COMMUNICATION, 1997, 22 (04) :403-417
[16]   PHASE EFFECTS IN MONAURAL PERCEPTION [J].
MATHES, RC ;
MILLER, RL .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1947, 19 (05) :780-797
[17]   RECONSTRUCTION OF SIGNALS FROM PHASE - EFFICIENT ALGORITHMS, SEGMENTATION, AND GENERALIZATIONS [J].
MERCHANT, GA ;
PARKS, TW .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1983, 31 (05) :1135-1147
[18]   SIGNAL RECONSTRUCTION FROM SHORT-TIME FOURIER-TRANSFORM MAGNITUDE [J].
NAWAB, SH ;
QUATIERI, TF ;
LIM, JS .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1983, 31 (04) :986-998
[19]  
Ohm GS., 1843, Annalen der Physik, V135, P513, DOI DOI 10.1002/ANDP.18431350802
[20]   THE IMPORTANCE OF PHASE IN SIGNALS [J].
OPPENHEIM, AV ;
LIM, JS .
PROCEEDINGS OF THE IEEE, 1981, 69 (05) :529-541