A coarse-grained reconfigurable computing architecture with loop self-pipelining

被引:0
作者
Yong Dou
GuiMing Wu
JinHui Xu
XingMing Zhou
机构
[1] National University of Defense Technology,National Laboratory for Parallel & Distributed Processing
来源
Science in China Series F: Information Sciences | 2009年 / 52卷
关键词
reconfigurable computing; loop pipelining; data driven; register promotion;
D O I
暂无
中图分类号
学科分类号
摘要
Reconfigurable computing tries to achieve the balance between high efficiency of custom computing and flexibility of general-purpose computing. This paper presents the implementation techniques in LEAP, a coarse-grained reconfigurable array, and proposes a speculative execution mechanism for dynamic loop scheduling with the goal of one iteration per cycle and implementation techniques to support decoupling synchronization between the token generator and the collector. This paper also introduces the techniques of exploiting both data dependences of intra- and inter-iteration, with the help of two instructions for special data reuses in the loop-carried dependences. The experimental results show that the number of memory accesses reaches on average 3% of an RISC processor simulator with no memory optimization. In a practical image matching application, LEAP architecture achieves about 34 times of speedup in execution cycles, compared with general-purpose processors.
引用
收藏
页码:575 / 587
页数:12
相关论文
共 37 条
[11]   Memory-Aware Loop Mapping on Coarse-Grained Reconfigurable Architectures [J].
Yin, Shouyi ;
Yao, Xianqing ;
Liu, Dajiang ;
Liu, Leibo ;
Wei, Shaojun .
IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2016, 24 (05) :1895-1908
[12]   Battery-Aware Task Mapping for Coarse-Grained Reconfigurable Architecture [J].
Yin, Shouyi ;
Shi, Rui ;
Liu, Leibo ;
Wei, Shaojun .
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2013, E96D (12) :2524-2535
[13]   A Coarse-Grained Reconfigurable Architecture for Compute-Intensive MapReduce Acceleration [J].
Liang, Shuang ;
Yin, Shouyi ;
Liu, Leibo ;
Guo, Yike ;
Wei, Shaojun .
IEEE COMPUTER ARCHITECTURE LETTERS, 2016, 15 (02) :69-72
[14]   A Survey of Coarse-Grained Reconfigurable Architecture and Design: Taxonomy, Challenges, and Applications [J].
Liu, Leibo ;
Zhu, Jianfeng ;
Li, Zhaoshi ;
Lu, Yanan ;
Deng, Yangdong ;
Han, Jie ;
Yin, Shouyi ;
Wei, Shaojun .
ACM COMPUTING SURVEYS, 2020, 52 (06)
[15]   ProDFA: Accelerating Domain Applications with a Coarse-Grained Runtime Reconfigurable Architecture [J].
Yan, Ming ;
Yang, Ziyu ;
Liu, Lei ;
Li, Sikun .
PROCEEDINGS OF THE 2012 IEEE 18TH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS (ICPADS 2012), 2012, :834-839
[16]   A fine-grained/coarse-grained heterogeneous reconfigurable DSP architecture based on parallel processing [J].
Yang, Y ;
Mao, ZG ;
Lai, FC .
PROCEEDINGS OF THE SECOND INTERNATIONAL SYMPOSIUM ON INSTRUMENTATION SCIENCE AND TECHNOLOGY, VOL 3, 2002, :558-562
[17]   Using Verilog-to-Routing Framework for Coarse-Grained Reconfigurable Architecture Routing [J].
Mudza, Zbigniew .
PROCEEDINGS OF THE 25TH INTERNATIONAL CONFERENCE MIXED DESIGN OF INTEGRATED CIRCUITS AND SYSTEM (MIXDES 2018), 2018, :218-222
[18]   Design Space Exploration for Efficient Resource Utilization in Coarse-Grained Reconfigurable Architecture [J].
Kim, Yoonjin ;
Mahapatra, Rabi N. ;
Choi, Kiyoung .
IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2010, 18 (10) :1471-1482
[19]   Implementation of H.264/AVC Encoder on Coarse-grained Dynamically Reconfigurable Computing System [J].
KiemHung Nguyen ;
Cao, Peng ;
Wang, XueXiang .
2012 FOURTH INTERNATIONAL CONFERENCE ON COMMUNICATIONS AND ELECTRONICS (ICCE), 2012, :483-488
[20]   Power-conscious configuration cache structure and code mapping for coarse-grained reconfigurable architecture [J].
Kim, Yoonjin ;
Park, Ilhyun ;
Choi, Kiyoung ;
Paek, Yunheung .
ISLPED '06: PROCEEDINGS OF THE 2006 INTERNATIONAL SYMPOSIUM ON LOW POWER ELECTRONICS AND DESIGN, 2006, :310-315