CALCULATION OF A CONSTANT-Q SPECTRAL TRANSFORM

被引:531
作者
BROWN, JC [1 ]
机构
[1] WELLESLEY COLL,DEPT PHYS,WELLESLEY,MA 02181
关键词
D O I
10.1121/1.400476
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The frequencies that have been chosen to make up the scale of Western music are geometrically spaced. Thus the discrete Fourier transform (DFT), although extremely efficient in the fast Fourier transform implementation, yields components which do not map efficiently to musical frequencies. This is because the frequency components calculated with the DFT are separated by a constant frequency difference and with a constant resolution. A calculation similar to a discrete Fourier transform but with a constant ratio of center frequency to resolution has been made; this is a constant Q transform and is equivalent to a 1/24-oct filter bank. Thus there are two frequency components for each musical note so that two adjacent notes in the musical scale played simultaneously can be resolved anywhere in the musical frequency range. This transform against log (frequency) to obtain a constant pattern in the frequency domain for sounds with harmonic frequency components has been plotted. This is compared to the conventional DFT that yields a constant spacing between frequency components. In addition to advantages for resolution, representation with a constant pattern has the advantage that note identification ("note identification" rather than the term "pitch tracking," which is widely used in the signal processing community, is being used since the editor has correctly pointed out that "pitch" should be reserved for a perceptual contest), instrument recognition, and signal separation can be done elegantly by a straightforward pattern recognition algorithm.
引用
收藏
页码:425 / 434
页数:10
相关论文
共 20 条
[1]  
[Anonymous], 1965, FOURIER TRANSFORM IT
[2]   UNEQUAL BANDWIDTH SPECTRAL ANALYSIS USING DIGITAL FREQUENCY WARPING [J].
BRACCINI, C ;
OPPENHEIM, AV .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1974, AS22 (04) :236-244
[3]   MELLIN TRANSFORMS AND CONSTANT-Q SPECTRAL ANALYSIS [J].
GAMBARDELLA, G .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1979, 66 (03) :913-915
[4]   CONTRIBUTION TO THEORY OF SHORT-TIME SPECTRAL ANALYSIS WITH NONUNIFORM BANDWIDTH FILTERS [J].
GAMBARDELLA, G .
IEEE TRANSACTIONS ON CIRCUIT THEORY, 1971, CT18 (04) :455-+
[5]   EVIDENCE FOR A GENERAL TEMPLATE IN CENTRAL OPTIMAL PROCESSING FOR PITCH OF COMPLEX TONES [J].
GERSON, A ;
GOLDSTEIN, JL .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1978, 63 (02) :498-510
[6]  
Harris F. J., 1976, Computers & Electrical Engineering, V3, P171, DOI 10.1016/0045-7906(76)90022-7
[7]  
HARRIS FJ, 1978, P IEEE, V66, P51, DOI 10.1109/PROC.1978.10837
[8]   POWER SPECTRA OBTAINED FROM EXPONENTIALLY INCREASING SPACINGS OF SAMPLING POSITIONS AND FREQUENCIES [J].
HELMS, HD .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1976, 24 (01) :63-71
[9]   FAST FOURIER-TRANSFORM - INTRODUCTION WITH SOME MINICOMPUTER EXPERIMENTS [J].
HIGGINS, RJ .
AMERICAN JOURNAL OF PHYSICS, 1976, 44 (08) :766-773
[10]  
KASHIMA KL, 1985, STANM28 STANF DEP MU