A new approach to constructing optimal parallel prefix circuits with small depth

被引:16
作者
Lin, YC
Hsiao, JW
机构
[1] Natl Taiwan Univ, Dept Comp Sci & Informat Engn, Sect 4, Taipei 10672, Taiwan
[2] Natl Taiwan Univ, Dept Elect Engn, Sect 4, Taipei 10672, Taiwan
关键词
combinational circuits; depth; depth-size optimal; fan-out; parallel algorithms; prefix computation; size optimal;
D O I
10.1016/j.jpdc.2003.09.004
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Parallel prefix circuits are parallel algorithms performing the prefix operation for the combinational circuit model. The size of a prefix circuit is the number of operation nodes in the circuit, and the depth is the maximum level of operation nodes. A circuit with n inputs is depth-size optimal if its depth plus size equals 2n - 2. Smaller depth implies faster computation, while smaller size implies less power consumption, smaller VLSI area, and less cost. A circuit should have a small fan-out and small depth for it to be of practical use. In this paper, we present a new approach to easing the design of parallel prefix circuits, and construct a depth-size optimal parallel prefix circuit, named WE4, with fan-out 4. In many cases of n, WE4 has the smallest depth among all known depth-size optimal prefix circuits with bounded fan-out. (C) 2003 Elsevier Inc. All rights reserved.
引用
收藏
页码:97 / 107
页数:11
相关论文
共 33 条
[11]  
HSIAO JW, 2001, THESIS NATL TAIWAN U
[12]   THE POWER OF PARALLEL PREFIX [J].
KRUSKAL, CP ;
RUDOLPH, L ;
SNIR, M .
IEEE TRANSACTIONS ON COMPUTERS, 1985, 34 (10) :965-968
[13]   PARALLEL PREFIX COMPUTATION [J].
LADNER, RE ;
FISCHER, MJ .
JOURNAL OF THE ACM, 1980, 27 (04) :831-838
[14]  
Lakshmivarahan S., 1987, Proceedings of the 1987 International Conference on Parallel Processing, P58
[15]  
LAKSHMIVARAHAN S, 1994, PARALLEL COMPUTING U
[16]  
Leighton F.T., 1992, Introduction to Parallel Algorithms and Architecture: Arrays. Trees. Hypercubes
[17]   Scalable hardware-algorithms for binary prefix sums [J].
Lin, R ;
Nakano, K ;
Olariu, S ;
Pinotti, MC ;
Schwing, JL ;
Zomaya, AY .
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2000, 11 (08) :838-850
[18]  
Lin YC, 2003, J INF SCI ENG, V19, P75
[19]   Efficient parallel prefix algorithms on multiport message-passing systems [J].
Lin, YC ;
Yeh, CS .
INFORMATION PROCESSING LETTERS, 1999, 71 (02) :91-95
[20]   Constructing H4, a fast depth-size optimal parallel prefix circuit [J].
Lin, YC ;
Hsu, YH ;
Liu, CK .
JOURNAL OF SUPERCOMPUTING, 2003, 24 (03) :279-304