Design and architectural exploration of expression-grained reconfigurable Arrays

被引:15
作者
Ansaloni, Giovanni [1 ]
Bonzini, Paolo [1 ]
Pozzi, Laura [1 ]
机构
[1] Univ Lugano, Fac Informat, CH-6900 Lugano, Switzerland
来源
2008 SYMPOSIUM ON APPLICATION SPECIFIC PROCESSORS | 2008年
关键词
D O I
10.1109/SASP.2008.4570782
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Reconfigurable Arrays combine the benefit of spatial execution, typical of hardware solutions, with that of programmability, present in microprocessors. When mapping software applications (or parts of them) onto hardware, however, FPGAs often provide more flexibility than is needed, and do not implement coarser-level operations efficiently. Therefore, Coarse Grained Reconfigurable Arrays (CGRAs) have been proposed to this aim. While most CGRA designs feature an array cell of the order of an ALIJ, this paper proposes a new kind of coarse grained array, called EGRA (Expression-Grained Reconfigurable Array), featuring a cell composed of a cluster of ALUs with flexible interconnect. The EGRA attempts to further close the performance gap between reconfigurable and hardwired logic by implementing an arithmetic/logic expression per cell, rather than a single operation. A mapping methodology is proposed that can retargetably compile to a family of EGRAs, therefore enabling architectural exploration of the granularity of the proposed cell. Performance results on a number of embedded applications show that EGRAs can be used as a reconfigurable fabric for customizable processors, outperforming more traditional CGRA designs.
引用
收藏
页码:26 / 33
页数:8
相关论文
共 20 条
[1]  
[Anonymous], 2005, P 2005 INT C COMPILE
[2]  
Atasu K, 2003, DES AUT CON, P256
[3]  
Bouwens F, 2007, LECT NOTES COMPUT SC, V4419, P1
[4]   Retiming DAG's [J].
Calland, PY ;
Mignotte, A ;
Peyran, O ;
Robert, Y ;
Vivien, F .
IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 1998, 17 (12) :1319-1325
[5]  
Clark N, 2004, INT SYMP MICROARCH, P30
[6]   FLOWMAP - AN OPTIMAL TECHNOLOGY MAPPING ALGORITHM FOR DELAY OPTIMIZATION IN LOOKUP-TABLE BASED FPGA DESIGNS [J].
CONG, J ;
DING, YH .
IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 1994, 13 (01) :1-12
[7]  
FISHER C, 2001, P 11 INT C FIELD PRO, P17
[8]   A high-performance data path for synthesizing DSP kernels [J].
Galanis, Michalis D. ;
Theodoridis, George ;
Tragoudas, Spyros ;
Goutis, Costas E. .
IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2006, 25 (06) :1154-1163
[9]  
Goldstein SC, 1999, CONF PROC INT SYMP C, P28, DOI [10.1145/307338.300982, 10.1109/ISCA.1999.765937]
[10]   MiBench: A free, commercially representative embedded benchmark suite [J].
Guthaus, MR ;
Ringenberg, JS ;
Ernst, D ;
Austin, TM ;
Mudge, T ;
Brown, RB .
WWC-4: IEEE INTERNATIONAL WORKSHOP ON WORKLOAD CHARACTERIZATION, 2001, :3-14