Parallel merged multiplier-accumulator coprocessor optimized for digital filters

被引:2
作者
Parandeh-Afshar, H.
Fakhraie, S. M. [1 ]
Fatemi, O. [1 ]
机构
[1] Univ Tehran, Nanoelect Ctr Excellence, Dept Elect & Comp Engn, Tehran, Iran
关键词
Multiply-accumulate; DSP; SIMD; RISC processor; Coprocessor; Audio filters;
D O I
10.1016/j.compeleceng.2008.04.005
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In an attempt to improve the speed of VLSI signal processing systems, a new architecture for a high-speed multiply-accumulate (MAC) unit optimized for digital filters is proposed. This unit is designed as a coprocessor for the LEON2 RISC processor [LEON2 Processor; 2005 [Online]. <http://www.gaisler.com/products/leon2/leon.html>]. In this work, four parallel MAC units with two dual-port coefficient register-files, a three-port general register-file and a control unit are included in the coprocessing block. With the existence of four parallel units, several SIMD format instructions have been added to LEON2 instruction set. Each MAC unit has two 16-bit inputs, 32-bit output register and a programmable round-saturate block. The MAC unit uses a new architecture which embeds the accumulate module within the partial products summation tree of the multiplier with minimum overhead. A central control unit controls inputs of the four MACs and loading of the output registers. Our experimental results demonstrate a high performance in implementation of digital filters at elevated speeds of up to 33 millions of input samples per second in a 018 mu M technology. (C) 2008 Elsevier Ltd. All rights reserved.
引用
收藏
页码:864 / 873
页数:10
相关论文
共 17 条
  • [1] ACKLAND B, 2000, IEEE J SOLID STATE C, V35
  • [2] [Anonymous], 1997, SIGNALS SYSTEMS
  • [3] A SIGNED BINARY MULTIPLICATION TECHNIQUE
    BOOTH, AD
    [J]. QUARTERLY JOURNAL OF MECHANICS AND APPLIED MATHEMATICS, 1951, 4 (02) : 236 - 240
  • [4] Architecture and implementation of a vector/SIMD multiply-accumulate unit
    Danysh, A
    Tan, D
    [J]. IEEE TRANSACTIONS ON COMPUTERS, 2005, 54 (03) : 284 - 293
  • [5] ELQUIBALY F, 2000, IEEE T CIRC SYST, V47
  • [6] FAYED AA, 2002, IEEE T VLSI, V3
  • [7] HINRICHS W, 2000, IEEE J SOLID STATE C, V35
  • [8] Multimedia processor architecture
    Ikedo, T
    Martens, WL
    [J]. IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA COMPUTING AND SYSTEMS, PROCEEDINGS, 1998, : 316 - 325
  • [9] Kang HJ, 2005, P SOC PHOTO-OPT INS, P448, DOI 10.1117/12.586535
  • [10] Lee R. B., 1997, P IEEE WORKSH SIGN P, P9