Symbolic Loop Parallelization for Balancing I/O and Memory Accesses on Processor Arrays

被引:0
|
作者
Tanase, Alexandru [1 ]
Witterauf, Michael [1 ]
Teich, Juergen [1 ]
Hannig, Frank [1 ]
机构
[1] Friedrich Alexander Univ Erlangen Nurnberg FAU, Dept Comp Sci, Hardware Software Co Design, Nurnberg, Germany
关键词
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Loop parallelization techniques for massively parallel processor arrays using one-level tiling are often either I/O- or memory-bounded, exceeding the target architecture's capabilities. Furthermore, if the number of available processing elements is only known at runtime-as in adaptive systems-static approaches fail. To solve these problems, we present a hybrid compile/runtime technique to symbolically parallelize loop nests with uniform dependences on multiple levels. At compile time, two novel transformations are performed: (a) symbolic hierarchical tiling followed by (b) symbolic multi-level scheduling. By tuning the size of the tiles on multiple levels, a trade-off between the necessary I/O-bandwidth and memory is possible, which facilitates obeying resource constraints. The resulting schedules are symbolic with respect to the number of tiles; thus, the number of processing elements to map onto does not need to be known at compile time. At runtime, when the number is known, a simple prolog chooses a feasible schedule with respect to I/O and memory constraints that is latency-optimal for the chosen tile size. In this way, our approach dynamically chooses latency-optimal and feasible schedules while avoiding expensive re-compilations.
引用
收藏
页码:188 / 197
页数:10
相关论文
共 15 条
  • [1] Symbolic Parallelization of Loop Programs for Massively Parallel Processor Arrays
    Teich, Juergen
    Tanase, Alexandru
    Hannig, Frank
    PROCEEDINGS OF THE 2013 IEEE 24TH INTERNATIONAL CONFERENCE ON APPLICATION-SPECIFIC SYSTEMS, ARCHITECTURES AND PROCESSORS (ASAP 13), 2013, : 1 - 9
  • [2] Symbolic Loop Compilation for Tightly Coupled Processor Arrays
    Witterauf, Michael
    Walter, Dominik
    Hannig, Frank
    Teich, Juergen
    ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 2021, 20 (05)
  • [3] Symbolic Mapping of Loop Programs onto Processor Arrays
    Teich, Juergen
    Tanase, Alexandru
    Hannig, Frank
    JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2014, 77 (1-2): : 31 - 59
  • [4] Symbolic Mapping of Loop Programs onto Processor Arrays
    Jürgen Teich
    Alexandru Tanase
    Frank Hannig
    Journal of Signal Processing Systems, 2014, 77 : 31 - 59
  • [5] Symbolic Multi-Level Loop Mapping of Loop Programs for Massively Parallel Processor Arrays
    Tanase, Alexandru
    Witterauf, Michael
    Teich, Juergen
    Hannig, Frank
    ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 2018, 17 (02)
  • [6] Optimization of communication cost within processor arrays caused by I/O
    Siegel, Sebastian
    Merker, Renate
    PROCEEDINGS OF THE 18TH IASTED INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED COMPUTING AND SYSTEMS, 2006, : 680 - +
  • [7] Real-time Scheduling of I/O Transfers for Massively Parallel Processor Arrays
    Walter, Dominik
    Witterauf, Michael
    Teich, Juergen
    2020 18TH ACM-IEEE INTERNATIONAL CONFERENCE ON FORMAL METHODS AND MODELS FOR SYSTEM DESIGN (MEMOCODE), 2020, : 104 - 114
  • [8] LION Real-Time I/O Transfer Control for Massively Parallel Processor Arrays
    Walter, Dominik
    Teich, Juergen
    2021 19TH ACM-IEEE INTERNATIONAL CONFERENCE ON FORMAL METHODS AND MODELS FOR SYSTEM DESIGN (MEMOCODE), 2022, : 32 - 43
  • [9] FAST SIGNAL PROCESSOR COMES RICH WITH MEMORY, I/O LINES ON CMOS CHIP
    RAMACHANDRAN, G
    JUJII, S
    ELECTRONIC DESIGN, 1984, 32 (10) : 227 - &
  • [10] Trace-Based Analysis and Optimization for the Semtex CFD Application - Hidden Remote Memory Accesses and I/O Performance
    Mickler, Holger
    Knuepfer, Andreas
    Kluge, Michael
    Mueller, Matthias S.
    Nagel, Wolfgang E.
    EURO-PAR 2008 WORKSHOPS - PARALLEL PROCESSING, 2009, 5415 : 295 - 304