A Framework for Invertible, Real-Time Constant-Q Transforms

被引:68
作者
Holighaus, Nicki [1 ]
Doerfler, Monika [2 ]
Angelo Velasco, Gino [3 ]
Grill, Thomas [4 ]
机构
[1] Austrian Acad Sci, Acoust Res Inst, A-1040 Vienna, Austria
[2] Univ Vienna, Fac Math, Numer Harmon Anal Grp, A-1090 Vienna, Austria
[3] Univ Philippines, Coll Sci, Inst Math, Quezon City 1101, Philippines
[4] Austrian Res Inst Artificial Intelligence, A-1010 Vienna, Austria
来源
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING | 2013年 / 21卷 / 04期
基金
奥地利科学基金会;
关键词
Audio signals; constant-Q; Gabor frames; real-time; time-frequency dictionary; RECONSTRUCTION;
D O I
10.1109/TASL.2012.2234114
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Audio signal processing frequently requires time-frequency representations and in many applications, a non-linear spacing of frequency bands is preferable. This paper introduces a framework for efficient implementation of invertible signal transforms allowing for non-uniform frequency resolution. Non-uniformity in frequency is realized by applying nonstationary Gabor frames with adaptivity in the frequency domain. The realization of a perfectly invertible constant-Q transform is described in detail. To achieve real-time processing, independent of signal length, slice-wise processing of the full input signal is proposed and referred to as sliCQ transform. By applying frame theory and FFT-based processing, the presented approach overcomes computational inefficiency and lack of invertibility of classical constant-Q transform implementations. Numerical simulations evaluate the efficiency of the proposed algorithm and the method's applicability is illustrated by experiments on real-life audio signals.
引用
收藏
页码:775 / 785
页数:11
相关论文
共 22 条
[1]  
[Anonymous], P 9 SOUND MUS COMP C
[2]  
[Anonymous], THESIS U MEDITERRANE
[3]   Theory, implementation and applications of nonstationary Gabor frames [J].
Balazs, P. ;
Doerfler, M. ;
Jaillet, F. ;
Holighaus, N. ;
Velasco, G. .
JOURNAL OF COMPUTATIONAL AND APPLIED MATHEMATICS, 2011, 236 (06) :1481-1496
[4]   Frequency-Domain Design of Overcomplete Rational-Dilation Wavelet Transforms [J].
Bayram, Ilker ;
Selesnick, Ivan W. .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2009, 57 (08) :2957-2972
[5]   CALCULATION OF A CONSTANT-Q SPECTRAL TRANSFORM [J].
BROWN, JC .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1991, 89 (01) :425-434
[6]   AN EFFICIENT ALGORITHM FOR THE CALCULATION OF A CONSTANT-Q TRANSFORM [J].
BROWN, JC ;
PUCKETTE, MS .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1992, 92 (05) :2698-2701
[7]  
Cranitch M., 2006, P AUD ENG SOC CONV, V120
[8]   PAINLESS NONORTHOGONAL EXPANSIONS [J].
DAUBECHIES, I ;
GROSSMANN, A ;
MEYER, Y .
JOURNAL OF MATHEMATICAL PHYSICS, 1986, 27 (05) :1271-1283
[9]   THE PHASE VOCODER - A TUTORIAL [J].
DOLSON, M .
COMPUTER MUSIC JOURNAL, 1986, 10 (04) :14-27
[10]   A CLASS OF NONHARMONIC FOURIER SERIES [J].
DUFFIN, RJ ;
SCHAEFFER, AC .
TRANSACTIONS OF THE AMERICAN MATHEMATICAL SOCIETY, 1952, 72 (MAR) :341-366