Improving Performance of Nested Loops on Reconfigurable Array Processors

被引:23
|
作者
Kim, Yongjoo [2 ]
Lee, Jongeun [1 ]
Mai, Toan X. [1 ]
Paek, Yunheung [2 ]
机构
[1] UNIST, Sch ECE, Energy Efficient Comp Lab, Ulsan 689798, South Korea
[2] Seoul Natl Univ, Sch Elect Engn & Comp Sci, Software Optimizat & Restruct Grp, Seoul 151744, South Korea
基金
新加坡国家研究基金会;
关键词
Design; Algorithms; Performance; Coarse-grained reconfigurable architecture; compilation; nested loop; software pipelining; PARALLEL;
D O I
10.1145/2086696.2086711
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Pipelining algorithms are typically concerned with improving only the steady-state performance, or the kernel time. The pipeline setup time happens only once and therefore can be negligible compared to the kernel time. However, for Coarse-Grained Reconfigurable Architectures (CGRAs) used as a coprocessor to a main processor, pipeline setup can take much longer due to the communication delay between the two processors, and can become significant if it is repeated in an outer loop of a loop nest. In this paper we evaluate the overhead of such non-kernel execution times when mapping nested loops for CGRAs, and propose a novel architecture-compiler cooperative scheme to reduce the overhead, while also minimizing the number of extra configurations required. Our experimental results using loops from multimedia and scientific domains demonstrate that our proposed techniques can greatly increase the performance of nested loops by up to 2.87 times compared to the conventional approach of accelerating only the innermost loops. Moreover, the mappings generated by our techniques require only a modest number of configurations that can fit in recent reconfigurable architectures.
引用
收藏
页数:23
相关论文
共 50 条
  • [1] Reconfigurable nested photonic topological loops
    李小雪
    芮光浩
    崔一平
    顾兵
    Chinese Optics Letters, 2022, 20 (06) : 23 - 28
  • [2] Reconfigurable nested photonic topological loops
    Li, Xiaoxue
    Rui, Guanghao
    Cui, Yiping
    Gu, Bing
    CHINESE OPTICS LETTERS, 2022, 20 (06)
  • [3] Phase locked loops for array processors
    Leonov, GA
    Seledzhi, SM
    1ST IEEE INTERNATIONAL CONFERENCE ON CIRCUITS AND SYSTEMS FOR COMMNICATIONS, PROCEEDINGS, 2002, : 281 - 282
  • [4] A dynamically reconfigurable interconnect for array processors
    John, LK
    John, E
    IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 1998, 6 (01) : 150 - 157
  • [5] A DESIGN METHOD FOR ONLINE RECONFIGURABLE ARRAY PROCESSORS
    FRANZEN, J
    JOURNAL OF VLSI SIGNAL PROCESSING, 1993, 5 (01): : 21 - 35
  • [6] Effective nested Kalman fusion for improving microelectromechanical system array performance
    Liang, Siyuan
    Huang, Yijie
    Zhu, Weilong
    Zhao, Feng
    MEASUREMENT SCIENCE AND TECHNOLOGY, 2020, 31 (11)
  • [7] Design method for on-line reconfigurable array processors
    Franzen, Jens
    Journal of VLSI Signal Processing Systems for Signal, Image, and Video Technology, 1993, 5 (01): : 21 - 35
  • [8] Timing analysis of an optically differential reconfigurable gate array for dynamically reconfigurable processors
    Watanabe, M
    Kobayashi, F
    ERSA '04: THE 2004 INTERNATIONAL CONFERENCE ON ENGINEERING OF RECONFIGURABLE SYSTEMS AND ALGORITHMS, 2004, : 311 - 311
  • [9] ARRAY PROCESSORS BOOST PERFORMANCE
    PERSUN, T
    ELECTRONIC PRODUCTS MAGAZINE, 1984, 26 (12): : 69 - 77
  • [10] Improving nested loops' ILP on a parallel ASIC design
    Light, R
    Maxfield, W
    Reed, B
    Passos, N
    Sha, EHM
    PARALLEL AND DISTRIBUTED COMPUTING SYSTEMS, 2000, : 105 - 110