IMPLEMENTATION OF EFFICIENT FFT ALGORITHMS ON FUSED MULTIPLY ADD ARCHITECTURES

被引:22
作者
LINZER, EN
FEIG, E
机构
[1] IBM Thomas J. Watson Research Laboratory, Yorktown Heights
关键词
D O I
10.1109/TSP.1993.193130
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The decimation-in-time radix-2, radix-4, split-radix, and radix-8 algorithms, presented in a paper by Linzer and Feig [51, are described in detail. These algorithms compute discrete Fourier transforms (DFT's) on input sequences with lengths that are powers of 2 with fewer multiply-adds than traditional Cooley-Tukey algorithms. The descriptions given provide the needed details to implement these algorithms efficiently in a computer program that could compute DFT's on a length 2m sequence for general m. We describe and give timing results for a radix-4 version that we have implemented on the RS/6000 workstation. The timing results show that a substantial saving in execution time is obtained when the new radix-4 FFT is used instead of a standard Cooley-Tukey radix-4 FFT. Finally, we present a set of experiments that suggest that numerical behavior of the new algorithms is slightly better than the numerical behavior of Cooley-Tukey FFT's.
引用
收藏
页码:93 / 107
页数:15
相关论文
共 11 条
[1]  
BLAHUT RE, 1986, FAST ALGORITHMS DIGI
[2]   AN ALGORITHM FOR MACHINE CALCULATION OF COMPLEX FOURIER SERIES [J].
COOLEY, JW ;
TUKEY, JW .
MATHEMATICS OF COMPUTATION, 1965, 19 (90) :297-&
[3]   IMPLEMENTATION OF SPLIT-RADIX FFT ALGORITHMS FOR COMPLEX, REAL, AND REAL-SYMMETRICAL DATA [J].
DUHAMEL, P .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1986, 34 (02) :285-295
[4]  
LI Z, 1986, P IEEE INT C AC SPEE, P289
[5]  
LINZER E, 1991, RC17414 IBM RES TECH
[6]  
LINZER E, IN PRESS MATH COMPUT, V60
[7]  
LU C, 1991, P ICASSP, P2185
[8]  
MECHENTEL A, 1990, COMMUNICATION
[9]  
MEYER R, 1990, P ICASSP, P15003
[10]  
MONTOYE RK, 1990, IBM J RES DEV, V34, P71