A coarse-grained reconfigurable computing architecture with loop self-pipelining

被引:0
作者
Yong Dou
GuiMing Wu
JinHui Xu
XingMing Zhou
机构
[1] National University of Defense Technology,National Laboratory for Parallel & Distributed Processing
来源
Science in China Series F: Information Sciences | 2009年 / 52卷
关键词
reconfigurable computing; loop pipelining; data driven; register promotion;
D O I
暂无
中图分类号
学科分类号
摘要
Reconfigurable computing tries to achieve the balance between high efficiency of custom computing and flexibility of general-purpose computing. This paper presents the implementation techniques in LEAP, a coarse-grained reconfigurable array, and proposes a speculative execution mechanism for dynamic loop scheduling with the goal of one iteration per cycle and implementation techniques to support decoupling synchronization between the token generator and the collector. This paper also introduces the techniques of exploiting both data dependences of intra- and inter-iteration, with the help of two instructions for special data reuses in the loop-carried dependences. The experimental results show that the number of memory accesses reaches on average 3% of an RISC processor simulator with no memory optimization. In a practical image matching application, LEAP architecture achieves about 34 times of speedup in execution cycles, compared with general-purpose processors.
引用
收藏
页码:575 / 587
页数:12
相关论文
共 37 条
[21]   Coarse-grained reconfigurable architectures for radio baseband processing: A survey [J].
Hassan, Zohaib ;
Ometov, Aleksandr ;
Lohan, Elena Simona ;
Nurmi, Jari .
JOURNAL OF SYSTEMS ARCHITECTURE, 2024, 154
[22]   Template-based Compilation for Coarse-grained Reconfigurable Processor [J].
Yin, Shouyi ;
Yin, Chongyong ;
Liu, Leibo ;
Wei, Shaojun .
2009 IEEE 8TH INTERNATIONAL CONFERENCE ON ASIC, VOLS 1 AND 2, PROCEEDINGS, 2009, :1302-1305
[23]   Design and Implementation of a Coarse-grained Dynamically Reconfigurable Multimedia Accelerator [J].
Nguyen, Hung K. ;
Tran, Xuan-Tu .
ACM TRANSACTIONS ON PARALLEL COMPUTING, 2022, 9 (03)
[24]   BilRC: An Execution Triggered Coarse Grained Reconfigurable Architecture [J].
Atak, Oguzhan ;
Atalar, Abdullah .
IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2013, 21 (07) :1285-1298
[25]   A Template-based Framework for Exploring Coarse-Grained Reconfigurable Architectures [J].
Podobas, Artur ;
Sano, Kentaro ;
Matsuoka, Satoshi .
2020 IEEE 31ST INTERNATIONAL CONFERENCE ON APPLICATION-SPECIFIC SYSTEMS, ARCHITECTURES AND PROCESSORS (ASAP 2020), 2020, :1-8
[26]   Instructionless General Purpose Coarse-Grained Reconfigurable Processor Performance in Encryption [J].
Mundza, Zbigniew ;
Kielbik, Rafal .
PROCEEDINGS OF THE 24TH INTERNATIONAL CONFERENCE MIXED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS - MIXDES 2017, 2017, :564-568
[27]   Prevention of Hot Spot Development on Coarse-Grained Dynamically Reconfigurable Architectures [J].
Eisenhardt, Sven ;
Schweizer, Thomas ;
Bernauer, Andreas ;
Kuhn, Tommy ;
Rosenstiel, Wolfgang .
2009 INTERNATIONAL CONFERENCE ON RECONFIGURABLE COMPUTING AND FPGAS, 2009, :12-17
[28]   Hierarchical representation of on-chip context to reduce reconfiguration time and implementation area for coarse-grained reconfigurable architecture [J].
YanSheng Wang ;
LeiBo Liu ;
ShouYi Yin ;
Min Zhu ;
Peng Cao ;
Jun Yang ;
ShaoJun Wei .
Science China Information Sciences, 2013, 56 :1-20
[29]   Hierarchical representation of on-chip context to reduce reconfiguration time and implementation area for coarse-grained reconfigurable architecture [J].
Wang YanSheng ;
Liu LeiBo ;
Yin ShouYi ;
Zhu Min ;
Cao Peng ;
Yang Jun ;
Wei ShaoJun .
SCIENCE CHINA-INFORMATION SCIENCES, 2013, 56 (11) :1-20
[30]   Hierarchical representation of on-chip context to reduce reconfiguration time and implementation area for coarse-grained reconfigurable architecture [J].
WANG YanSheng ;
LIU LeiBo ;
YIN ShouYi ;
ZHU Min ;
CAO Peng ;
YANG Jun ;
WEI ShaoJun .
ScienceChina(InformationSciences), 2013, 56 (11) :279-298