Conflict-Free Loop Mapping for Coarse-Grained Reconfigurable Architecture with Multi-Bank Memory

被引:14
作者
Yin, Shouyi [1 ]
Yao, Xianqing [1 ]
Lu, Tianyi [1 ]
Liu, Dajiang [1 ]
Gu, Jiangyuan [1 ]
Liu, Leibo [1 ]
Wei, Shaojun [1 ]
机构
[1] Tsinghua Univ, Inst Microelect, Beijing 100084, Peoples R China
关键词
Multi-bank; modulo scheduling; data placement; access pattern;
D O I
10.1109/TPDS.2017.2682241
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Coarse-grained reconfigurable architecture (CGRA) is a promising architecture with high performance, high power-efficiency and attraction of flexibility. The computation-intensive parts of an application (e.g., loops) are often mapped on CGRA for acceleration. Due to the high parallel data access demands, the architecture with multi-bank memory is proposed to improve parallelism. For CGRA with multi-bank memory, a joint solution, which simultaneously considers the memory partitioning and modulo scheduling, is proposed to achieve a valid mapping with better performance. In this solution, the modulo scheduling and operator scheduling are used to achieve a valid loop mapping and a valid data placement without any memory access conflicts. By avoiding the pipelining stalls caused by conflicts, the performance of loop mapping is greatly improved. The experimental results on benchmarks of the Livermore, Polybench and Mediabench show that our approach can improve the performance of loops on CGRA to 1.89 x, 1.49 x and 1.37x compared with REGIMap, HTDM and REGIMap with memory partitioning, at cost of an acceptable increase in compilation time.
引用
收藏
页码:2471 / 2485
页数:15
相关论文
empty
未找到相关数据