Streaming Reduction Circuit

被引:3
作者
Gerards, Marco [1 ]
Kuper, Jan [1 ]
Kokkeler, Andre [1 ]
Molenkamp, Bert [1 ]
机构
[1] Univ Twente, Dept EEMCS, NL-7500 AE Enschede, Netherlands
来源
PROCEEDINGS OF THE 2009 12TH EUROMICRO CONFERENCE ON DIGITAL SYSTEM DESIGN, ARCHITECTURES, METHODS AND TOOLS | 2009年
关键词
D O I
10.1109/DSD.2009.141
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Reduction circuits are used to reduce rows of floating point values to single values. Binary floating point operators often have deep pipelines, which may cause hazards when many consecutive rows have to be reduced. We present an algorithm by which any number of consecutive rows of arbitrary lengths can be reduced by a pipelined commutative and associative binary operator in an efficient manner. The algorithm is simple to implement, has a low latency, produces results in-order, and requires only small buffers. Besides, it uses only a single pipeline for the involved operation. The complexity of the algorithm depends on the depth of the pipeline, not on the length of the input rows. In this paper we discuss an implementation of this algorithm and we prove its correctness.
引用
收藏
页码:287 / 292
页数:6
相关论文
共 7 条
[1]  
Bodnar MR, 2006, ANN IEEE SYM FIELD P, P303
[2]   On the efficiency of reductions in μ-SIMD media extensions [J].
Corbal, J ;
Espasa, R ;
Valero, M .
2001 INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURES AND COMPILATION TECHNIQUES, PROCEEDINGS, 2001, :83-94
[3]  
HE C, 2006, GROUP ALIGNMENT BASE, P136
[4]   PARALLEL SOLUTION OF LINEAR-SYSTEMS WITH STRIPED SPARSE MATRICES [J].
MELHEM, R .
PARALLEL COMPUTING, 1988, 6 (02) :165-184
[5]   An FPGA-based application-specific processor for efficient reduction of multiple variable-length floating-point data sets [J].
Morris, Gerald R. ;
Prasanna, Viktor K. ;
Anderson, Richard D. .
IEEE 17TH INTERNATIONAL CONFERENCE ON APPLICATION-SPECIFIC SYSTEMS, ARCHITECTURES AND PROCESSORS, PROCEEDINGS, 2006, :323-+
[6]  
Weber M, 2001, ARBITERS DESIGN IDEA
[7]   High-performance reduction circuits using deeply pipelined operators on FPGAs [J].
Zhuo, Ling ;
Morris, Gerald R. ;
Prasanna, Viktor K. .
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2007, 18 (10) :1377-1392