Algorithms for computing the time-corrected instantaneous frequency (reassigned) spectrogram, with applications

被引:141
作者
Fulop, SA [1 ]
Fitz, K
机构
[1] Calif State Univ Fresno, Dept Linguist, Fresno, CA 93740 USA
[2] Washington State Univ, Sch Elect Engn & Comp Sci, Pullman, WA 99164 USA
关键词
D O I
10.1121/1.2133000
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
A modification of the spectrogram (log magnitude of the short-time Fourier transform) to more accurately show the instantaneous frequencies of signal components was first proposed in 1976 [Kodera et al., Phys. Earth Planet. Inter. 12, 142-150 (1976)], and has been considered or reinvented a few times since but never widely adopted. This paper presents a unified theoretical picture of this time-frequency analysis method, the time-corrected instantaneous frequency spectrogram, together with detailed implementable algorithms comparing three published techniques for its computation. The new representation is evaluated against the conventional spectrogram for its superior ability to track signal components. The lack of a uniform framework for either mathematics or implementation details which has characterized the disparate literature on the schemes has been remedied here. Fruitful application of the method is shown in the realms of speech phonation analysis, whale song pitch tracking, and additive sound modeling. (c) 2006 Acoustical Society of America.
引用
收藏
页码:360 / 371
页数:12
相关论文
共 27 条
[1]  
[Anonymous], P IEEE C AC SPEECH S
[2]  
[Anonymous], 2003, APPL TIME FREQUENCY
[3]   IMPROVING THE READABILITY OF TIME-FREQUENCY AND TIME-SCALE REPRESENTATIONS BY THE REASSIGNMENT METHOD [J].
AUGER, F ;
FLANDRIN, P .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 1995, 43 (05) :1068-1089
[4]   Notes on the theory of modulation [J].
Carson, JR .
PROCEEDINGS OF THE INSTITUTE OF RADIO ENGINEERS, 1922, 10 (01) :57-64
[6]  
Fitz K, 2002, J AUDIO ENG SOC, V50, P879
[7]  
FRIEDMAN DH, 1985, P INT C AC SPEECH SI, P1121
[8]   Yeyi clicks: Acoustic description and analysis [J].
Fulop, SA ;
Ladefoged, P ;
Liu, F ;
Vossen, R .
PHONETICA, 2003, 60 (04) :231-260
[9]  
Gabor D., 1946, J I ELEC ENGRS PART, V93, P429, DOI [10.1049/JI-3-2.1946.0074, DOI 10.1049/JI-3-2.1946.0074, 10.1049/ji-3-2.1946.0074]
[10]   Instantaneous frequency decomposition: An application to spectrally sparse sounds with fast frequency modulations [J].
Gardner, TJ ;
Magnasco, MO .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2005, 117 (05) :2896-2903