Reduced computational redundancy implementation of DSP algorithms using computation sharing vector scaling

被引:13
作者
Muhammad, K [1 ]
Roy, K
机构
[1] Texas Instruments Inc, Dallas, TX 75243 USA
[2] Purdue Univ, Sch Elect & Comp Engn, W Lafayette, IN 47907 USA
基金
美国国家科学基金会;
关键词
adaptive computing; arithmetic; computing; high performance; low-power design; number representation; performance tradeoffs; system level;
D O I
10.1109/TVLSI.2002.1043332
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we present a general approach which specifically targets reduction of redundant computation in common digital-signal processing (DSP) tasks such as filtering and matrix multiplication. We show that such tasks can be expressed as multiplication of vectors by scalars and this allows fast multiplication by sharing computation. Vector scaling operation is decomposed to find the most effective precomputations which yield a fast multiplier implementation. Two decomposition approaches are presented, one based on a greedy decomposition and the other based on fixed-size lookup and this leads to two multiplier architectures for vector-scalar products. Analog simulation of an example multiplier shows a speed advantage by a factor of about 1.85 over a conventional carry save array multiplier. Further simulations using 0.18 mu technology show up to 20% speed advantage over Booth encoded Wallace tree multipliers.
引用
收藏
页码:292 / 300
页数:9
相关论文
共 11 条
[1]  
GALLAGHER W, 1994, 28 AS C SIGN SYST CO, V1, P545
[2]   Efficient semisystolic architectures for finite-field arithmetic [J].
Jain, SK ;
Song, LL ;
Parhi, KK .
IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 1998, 6 (01) :101-113
[3]   AN ARCHITECTURE FOR HIGH-PERFORMANCE SMALL-AREA MULTIPLIERS FOR USE IN DIGITAL FILTERING APPLICATIONS [J].
KWENTUS, AY ;
HUNG, HT ;
WILLSON, AN .
IEEE JOURNAL OF SOLID-STATE CIRCUITS, 1994, 29 (02) :117-121
[4]  
MARTEL C, 1995, PROCEEDINGS OF THE 12TH SYMPOSIUM ON COMPUTER ARITHMETIC, P42, DOI 10.1109/ARITH.1995.465378
[5]   A SYSTEMATIC METHODOLOGY FOR THE DESIGN OF HIGH-PERFORMANCE RECURSIVE DIGITAL-FILTERS [J].
MCQUILLAN, SE ;
MCCANNY, JV .
IEEE TRANSACTIONS ON COMPUTERS, 1995, 44 (08) :971-982
[6]   Speed, power, area, and latency tradeoffs in adaptive FIR filtering for PRML read channels [J].
Muhammad, K ;
Staszewski, RB ;
Balsara, PT .
IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2001, 9 (01) :42-51
[7]  
Muhammad K., 1999, Proceedings 1999 IEEE International Conference on Computer Design: VLSI in Computers and Processors (Cat. No.99CB37040), P230, DOI 10.1109/ICCD.1999.808430
[8]  
Srinivas H. R., 1992, Journal of VLSI Signal Processing, V4, P177, DOI 10.1007/BF00925121
[9]   A 550-MSamplels 8-tap FIR digital filter for magnetic recording read channels [J].
Staszewski, RB ;
Muhammad, K ;
Balsara, P .
IEEE JOURNAL OF SOLID-STATE CIRCUITS, 2000, 35 (08) :1205-1210
[10]  
[No title captured]