Symbolic Loop Parallelization for Balancing I/O and Memory Accesses on Processor Arrays

被引：0

作者：

Tanase, Alexandru ^{[1
]}

Witterauf, Michael ^{[1
]}

Teich, Juergen ^{[1
]}

Hannig, Frank ^{[1
]}

机构：

[1] Friedrich Alexander Univ Erlangen Nurnberg FAU, Dept Comp Sci, Hardware Software Co Design, Nurnberg, Germany

来源：

2015 ACM/IEEE INTERNATIONAL CONFERENCE ON FORMAL METHODS AND MODELS FOR CODESIGN (MEMOCODE) | 2015年

关键词：

D O I：

暂无

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Loop parallelization techniques for massively parallel processor arrays using one-level tiling are often either I/O- or memory-bounded, exceeding the target architecture's capabilities. Furthermore, if the number of available processing elements is only known at runtime-as in adaptive systems-static approaches fail. To solve these problems, we present a hybrid compile/runtime technique to symbolically parallelize loop nests with uniform dependences on multiple levels. At compile time, two novel transformations are performed: (a) symbolic hierarchical tiling followed by (b) symbolic multi-level scheduling. By tuning the size of the tiles on multiple levels, a trade-off between the necessary I/O-bandwidth and memory is possible, which facilitates obeying resource constraints. The resulting schedules are symbolic with respect to the number of tiles; thus, the number of processing elements to map onto does not need to be known at compile time. At runtime, when the number is known, a simple prolog chooses a feasible schedule with respect to I/O and memory constraints that is latency-optimal for the chosen tile size. In this way, our approach dynamically chooses latency-optimal and feasible schedules while avoiding expensive re-compilations.

引用

页码：188 / 197

页数：10

共 15 条

[1] Symbolic Parallelization of Loop Programs for Massively Parallel Processor Arrays
Teich, Juergen
Tanase, Alexandru
Hannig, Frank
PROCEEDINGS OF THE 2013 IEEE 24TH INTERNATIONAL CONFERENCE ON APPLICATION-SPECIFIC SYSTEMS, ARCHITECTURES AND PROCESSORS (ASAP 13), 2013, : 1 - 9
[2] Symbolic Loop Compilation for Tightly Coupled Processor Arrays
Witterauf, Michael
Walter, Dominik
Hannig, Frank
Teich, Juergen
ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 2021, 20 (05)
[3] Symbolic Mapping of Loop Programs onto Processor Arrays
Teich, Juergen
Tanase, Alexandru
Hannig, Frank
JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2014, 77 (1-2): : 31 - 59
[4] Symbolic Mapping of Loop Programs onto Processor Arrays
Jürgen Teich
Alexandru Tanase
Frank Hannig
Journal of Signal Processing Systems, 2014, 77 : 31 - 59
[5] Symbolic Multi-Level Loop Mapping of Loop Programs for Massively Parallel Processor Arrays
Tanase, Alexandru
Witterauf, Michael
Teich, Juergen
Hannig, Frank
ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 2018, 17 (02)
[6] Optimization of communication cost within processor arrays caused by I/O
Siegel, Sebastian
Merker, Renate
PROCEEDINGS OF THE 18TH IASTED INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED COMPUTING AND SYSTEMS, 2006, : 680 - +
[7] Real-time Scheduling of I/O Transfers for Massively Parallel Processor Arrays
Walter, Dominik
Witterauf, Michael
Teich, Juergen
2020 18TH ACM-IEEE INTERNATIONAL CONFERENCE ON FORMAL METHODS AND MODELS FOR SYSTEM DESIGN (MEMOCODE), 2020, : 104 - 114
[8] LION Real-Time I/O Transfer Control for Massively Parallel Processor Arrays
Walter, Dominik
Teich, Juergen
2021 19TH ACM-IEEE INTERNATIONAL CONFERENCE ON FORMAL METHODS AND MODELS FOR SYSTEM DESIGN (MEMOCODE), 2022, : 32 - 43
[9] FAST SIGNAL PROCESSOR COMES RICH WITH MEMORY, I/O LINES ON CMOS CHIP
RAMACHANDRAN, G
JUJII, S
ELECTRONIC DESIGN, 1984, 32 (10) : 227 - &
[10] Trace-Based Analysis and Optimization for the Semtex CFD Application - Hidden Remote Memory Accesses and I/O Performance
Mickler, Holger
Knuepfer, Andreas
Kluge, Michael
Mueller, Matthias S.
Nagel, Wolfgang E.
EURO-PAR 2008 WORKSHOPS - PARALLEL PROCESSING, 2009, 5415 : 295 - 304

← 1 2 →