Revisiting split tiling for stencil computations in polyhedral compilation

被引:0
|
作者
Yingying Li
Huihui Sun
Jianmin Pang
机构
[1] State Key Laboratory of Mathematical Engineering and Advanced Computing,
来源
The Journal of Supercomputing | 2022年 / 78卷
关键词
Stencils; Loop tiling; Split tiling; Polyhedral compilation; Parallelism; Locality;
D O I
暂无
中图分类号
学科分类号
摘要
Complex tile shapes maximize parallelism and locality of stencil computations by enabling tile-wise concurrent start, i.e., all tiles along a particular tiling direction of the iteration space can be started concurrently. We study split tiling—a tiling technique exploiting tile-wise concurrent start at the expense of additional synchronizations, in the context of polyhedral compilation. Derived from classical parallelogram tiling, our approach first splits a parallelogram tile into multiple phases that can be executed simultaneously with those of the neighboring tiles. The technique then minimizes the amount of synchronizations by merging boundary phases of consecutive tiles along the time-tiled direction. We implement our approach on top of a well-defined polyhedral representation, generating code for both CPUs and GPUs. The experimental results on a 16-core Intel Xeon Silver show that our approach can achieve an average improvement of 2
引用
收藏
页码:440 / 470
页数:30
相关论文
共 50 条
  • [21] Variability of stencil computations for porous media
    Grebhahn, Alexander
    Engwer, Christian
    Bolten, Matthias
    Apel, Sven
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2017, 29 (17):
  • [22] Resilience for Stencil Computations with Latent Errors
    Fang, Aiman
    Cavelan, Aurelien
    Robert, Yves
    Chien, Andrew A.
    2017 46TH INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING (ICPP), 2017, : 581 - 590
  • [23] Register Caching for Stencil Computations on GPUs
    Falch, Thomas L.
    Elster, Anne C.
    16TH INTERNATIONAL SYMPOSIUM ON SYMBOLIC AND NUMERIC ALGORITHMS FOR SCIENTIFIC COMPUTING (SYNASC 2014), 2014, : 479 - 486
  • [24] Automatic Adaptive Approximation for Stencil Computations
    Schmitt, Maxime
    Helluy, Philippe
    Bastoul, Cedric
    PROCEEDINGS OF THE 28TH INTERNATIONAL CONFERENCE ON COMPILER CONSTRUCTION (CC '19), 2019, : 170 - 181
  • [25] Adapting combined tiling to stencil optimizations on sunway processor
    Sun, Biao
    Li, Mingzhen
    Yang, Hailong
    Xu, Jun
    Luan, Zhongzhi
    Qian, Depei
    CCF TRANSACTIONS ON HIGH PERFORMANCE COMPUTING, 2023, 5 (03) : 322 - 333
  • [26] Adapting combined tiling to stencil optimizations on sunway processor
    Biao Sun
    Mingzhen Li
    Hailong Yang
    Jun Xu
    Zhongzhi Luan
    Depei Qian
    CCF Transactions on High Performance Computing, 2023, 5 : 322 - 333
  • [27] Computations versus bijections for tiling enumeration
    Fischer, Ilse
    ADVANCES IN APPLIED MATHEMATICS, 2023, 142
  • [28] Revisiting Temporal Blocking Stencil Optimizations
    Zhang, Lingqi
    Wahib, Mohamed
    Chen, Peng
    Meng, Jintao
    Wang, Xiao
    Endo, Toshio
    Matsuoka, Satoshi
    PROCEEDINGS OF THE 37TH INTERNATIONAL CONFERENCE ON SUPERCOMPUTING, ACM ICS 2023, 2023, : 251 - 263
  • [29] Double precision stencil computations on Kepler GPUs
    Vizitiu, Anamaria
    Itu, Lucian
    Lazar, Laszlo
    Suciu, Constantin
    2014 18TH INTERNATIONAL CONFERENCE SYSTEM THEORY, CONTROL AND COMPUTING (ICSTCC), 2014, : 123 - 127
  • [30] The memory behavior of cache oblivious stencil computations
    Matteo Frigo
    Volker Strumpen
    The Journal of Supercomputing, 2007, 39 : 93 - 112