A coarse-grained reconfigurable computing architecture with loop self-pipelining

被引:0
作者
Yong Dou
GuiMing Wu
JinHui Xu
XingMing Zhou
机构
[1] National University of Defense Technology,National Laboratory for Parallel & Distributed Processing
来源
Science in China Series F: Information Sciences | 2009年 / 52卷
关键词
reconfigurable computing; loop pipelining; data driven; register promotion;
D O I
暂无
中图分类号
学科分类号
摘要
Reconfigurable computing tries to achieve the balance between high efficiency of custom computing and flexibility of general-purpose computing. This paper presents the implementation techniques in LEAP, a coarse-grained reconfigurable array, and proposes a speculative execution mechanism for dynamic loop scheduling with the goal of one iteration per cycle and implementation techniques to support decoupling synchronization between the token generator and the collector. This paper also introduces the techniques of exploiting both data dependences of intra- and inter-iteration, with the help of two instructions for special data reuses in the loop-carried dependences. The experimental results show that the number of memory accesses reaches on average 3% of an RISC processor simulator with no memory optimization. In a practical image matching application, LEAP architecture achieves about 34 times of speedup in execution cycles, compared with general-purpose processors.
引用
收藏
页码:575 / 587
页数:12
相关论文
共 39 条
[31]   Mapping H.264/AVC Fractional Motion Estimation Algorithm onto Coarse-Grained Reconfigurable Computing System [J].
KiemHung Nguyen ;
Cao, Peng ;
Wang, Xuexiang .
ADVANCES IN FUTURE COMPUTER AND CONTROL SYSTEMS, VOL 1, 2012, 159 :299-309
[32]   An Efficient Implementation of H.264/AVC Integer Motion Estimation Algorithm on Coarse-grained Reconfigurable Computing System [J].
Nguyen, Kiem-Hung ;
Cao, Peng ;
Wang, Xue-Xiang .
JOURNAL OF COMPUTERS, 2013, 8 (03) :594-604
[33]   A Novel Top to Bottom Toolchain For Generating Virtual Coarse-Grained Reconfigurable Arrays [J].
Fricke, Florian .
2021 31ST INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE LOGIC AND APPLICATIONS (FPL 2021), 2021, :267-268
[34]   Evaluation of Coarse-Grained Reconfigurable Array for a Dual Mode OTFS-OFDM Modulator [J].
Hassan, Zohaib ;
Hussain, Waqar ;
Ometov, Aleksandr ;
Lohan, Elena Simona ;
Nurmi, Jari .
2024 IEEE WORKSHOP ON SIGNAL PROCESSING SYSTEMS, SIPS, 2024, :183-188
[35]   Dynamically Mapping N-point IDCT on Coarse-Grained Reconfigurable Platform--CTaiJi [J].
Song, Liguo .
2008 SECOND INTERNATIONAL CONFERENCE ON FUTURE GENERATION COMMUNICATION AND NETWORKING SYMPOSIA, VOLS 1-5, PROCEEDINGS, 2008, :266-269
[36]   Mapping Optimization of Affine Loop Nests for Reconfigurable Computing Architecture [J].
Liu, Dajiang ;
Yin, Shouyi ;
Yin, Chongyong ;
Liu, Leibo ;
Wei, Shaojun .
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2012, E95D (12) :2898-2907
[37]   Hardware Software Co-design of H.264 Baseline Encoder on Coarse-Grained Dynamically Reconfigurable Computing System-on-Chip [J].
Nguyen, Hung K. ;
Cao, Peng ;
Wang, Xue-Xiang ;
Yang, Jun ;
Shi, Longxing ;
Zhu, Min ;
Liu, Leibo ;
Wei, Shaojun .
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2013, E96D (03) :601-615
[38]   Reconfigurable Domain-Specific Architectures Based on Coarse-Grained Operators in High-Performance FPGAs [J].
da Silva, Lucas Bragança ;
Grandis, César ;
Penha, Jeronimo ;
Nacif, José Augusto M. ;
Ferreira, Ricardo S. .
Concurrency and Computation: Practice and Experience, 2025, 37 (15-17)
[39]   An Energy-Efficient Coarse-Grained Reconfigurable Processing Unit for Multiple-Standard Video Decoding [J].
Liu, Leibo ;
Wang, Dong ;
Zhu, Min ;
Wang, Yansheng ;
Yin, Shouyi ;
Cao, Peng ;
Yang, Jun ;
Wei, Shaojun .
IEEE TRANSACTIONS ON MULTIMEDIA, 2015, 17 (10) :1706-1720