On the usefulness of STFT phase spectrum in human listening tests

被引:102
作者
Paliwal, KK [1 ]
Alsteris, LD [1 ]
机构
[1] Griffith Univ, Sch Microelect Engn, Nathan, Qld 4111, Australia
关键词
short-time Fourier transform; phase spectrum; magnitude spectrum; speech perception; overlap-add procedure; automatic speech recognition;
D O I
10.1016/j.specom.2004.08.001
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The short-time Fourier transform (STFT) of a speech signal has two components: the magnitude spectrum and the phase spectrum. In this paper, the relative importance of short-time magnitude and phase spectra for speech perception is investigated. Human perception experiments are conducted to measure intelligibility of speech stimuli synthesized either from magnitude spectra or phase spectra. It is traditionally believed that the magnitude spectrum plays a dominant role for small window durations (20-40 ms); while the phase spectrum is more important for large window durations (>1 s). It is shown in this paper that even for small window durations, the phase spectrum can contribute to speech intelligibility as much as the magnitude spectrum if the analysis-modification-synthesis parameters are properly selected. (C) 2004 Elsevier B.V. All rights reserved.
引用
收藏
页码:153 / 170
页数:18
相关论文
共 46 条
[1]   UNIFIED APPROACH TO SHORT-TIME FOURIER-ANALYSIS AND SYNTHESIS [J].
ALLEN, JB ;
RABINER, LR .
PROCEEDINGS OF THE IEEE, 1977, 65 (11) :1558-1564
[2]   SHORT-TERM SPECTRAL ANALYSIS, SYNTHESIS, AND MODIFICATION BY DISCRETE FOURIER-TRANSFORM [J].
ALLEN, JB .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1977, 25 (03) :235-238
[3]  
Alsteris, 2003, P EUR 2003, P2117
[4]  
Alsteris LD, 2004, 2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS, P573
[5]  
COX RC, 1980, P IEEE INT CONT ACOU, P150
[6]   WEIGHTED OVERLAP-ADD METHOD OF SHORT-TIME FOURIER ANALYSIS-SYNTHESIS [J].
CROCHIERE, RE .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1980, 28 (01) :99-102
[7]   EFFECTS OF ADDITIVE NOISE ON SIGNAL RECONSTRUCTION FROM FOURIER-TRANSFORM PHASE [J].
ESPY, CY ;
LIM, JS .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1983, 31 (04) :894-898
[8]   PHASE VOCODER [J].
FLANAGAN, JL ;
GOLDEN, RM .
BELL SYSTEM TECHNICAL JOURNAL, 1966, 45 (09) :1493-+
[9]   AUDITORY SPECTRAL FILTERING AND MONAURAL PHASE PERCEPTION [J].
GOLDSTEIN, JL .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1967, 41 (02) :458-+
[10]   SIGNAL ESTIMATION FROM MODIFIED SHORT-TIME FOURIER-TRANSFORM [J].
GRIFFIN, DW ;
LIM, JS .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1984, 32 (02) :236-243