Insight into tiles generated by means of a correction technique

被引:7
作者
Bielecki, Wlodzimierz [1 ]
Skotnicki, Piotr [1 ]
机构
[1] West Pomeranian Univ Technol, Fac Comp Sci, Ul Zolnierska 49, PL-71210 Szczecin, Poland
关键词
Optimizing compilers; Tiling; Transitive closure; Dependence graph; Code locality; 68N20; 68N15; 68M20; 68R01; 57M15; 90C10;
D O I
10.1007/s11227-018-2678-x
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Well-known techniques for tiled code generation are based on the polyhedral model and affine transformations. An alternative approach to generation of tiled code is to correct original rectangular tiles defined for a loop nest by means of the transitive closure of a dependence graph instead of deriving and applying affine transformations. In this paper, we present results of an analysis of basic features of tiles generated due to correction of original rectangular tiles. We introduce procedures which allow us to recognize such features as target tile type (fixed, varied, parametric), dimensionality, size (the number of statement instances within a tile), and loop nest tileability (the percentage of statement instances that can be tiled with rectangular tiles). We consider differences between those features of tiles generated by means of affine transformations and transitive closure. We also discuss results of experiments with PolyBench benchmarks and show how differences in tiles generated with the examined approach and affine transformations affect serial tiled code performance.
引用
收藏
页码:2665 / 2690
页数:26
相关论文
共 33 条
[1]  
[Anonymous], 2013, GPR, DOI [DOI 10.1145/2400682.2400713, DOI 10.19476/J.YSXB.1004.0609.2013.09.014]
[2]  
[Anonymous], POLYHEDRAL BENCHMARK
[3]   Coarse-grained loop parallelization: Iteration Space Slicing vs affine transformations [J].
Beletska, Anna ;
Bielecki, Wlodzimierz ;
Cohen, Albert ;
Palkowski, Marek ;
Siedlecki, Krzysztof .
PARALLEL COMPUTING, 2011, 37 (08) :479-497
[4]  
Bielecki Wlodzimierz, 2013, Combinatorial Optimization and Applications. 7th International Conference, COCOA 2013. Proceedings: LNCS 8287, P129, DOI 10.1007/978-3-319-03780-6_12
[5]   Generation of parallel synchronization-free tiled code [J].
Bielecki, Wlodzimierz ;
Palkowski, Marek ;
Skotnicki, Piotr .
COMPUTING, 2018, 100 (03) :277-302
[6]   TILING ARBITRARILY NESTED LOOPS BY MEANS OF THE TRANSITIVE CLOSURE OF DEPENDENCE GRAPHS [J].
Bielecki, Wlodzimierz ;
Palkowski, Marek .
INTERNATIONAL JOURNAL OF APPLIED MATHEMATICS AND COMPUTER SCIENCE, 2016, 26 (04) :919-939
[7]   Free scheduling for statement instances of parameterized arbitrarily nested affine loops [J].
Bielecki, Wlodzimierz ;
Palkowski, Marek ;
Klimek, Tomasz .
PARALLEL COMPUTING, 2012, 38 (09) :518-532
[8]  
Bielecki W, 2010, LECT NOTES COMPUT SC, V6508, P104, DOI 10.1007/978-3-642-17458-2_10
[9]   A practical automatic polyhedral parallelizer and locality optimizer [J].
Bondhugula, Uday ;
Hartono, Albert ;
Ramanujam, J. ;
Sadayappan, P. .
ACM SIGPLAN NOTICES, 2008, 43 (06) :101-113
[10]  
Bondhugula U, 2008, LECT NOTES COMPUT SC, V4959, P132