Revisiting split tiling for stencil computations in polyhedral compilation

被引:0
|
作者
Yingying Li
Huihui Sun
Jianmin Pang
机构
[1] State Key Laboratory of Mathematical Engineering and Advanced Computing,
来源
The Journal of Supercomputing | 2022年 / 78卷
关键词
Stencils; Loop tiling; Split tiling; Polyhedral compilation; Parallelism; Locality;
D O I
暂无
中图分类号
学科分类号
摘要
Complex tile shapes maximize parallelism and locality of stencil computations by enabling tile-wise concurrent start, i.e., all tiles along a particular tiling direction of the iteration space can be started concurrently. We study split tiling—a tiling technique exploiting tile-wise concurrent start at the expense of additional synchronizations, in the context of polyhedral compilation. Derived from classical parallelogram tiling, our approach first splits a parallelogram tile into multiple phases that can be executed simultaneously with those of the neighboring tiles. The technique then minimizes the amount of synchronizations by merging boundary phases of consecutive tiles along the time-tiled direction. We implement our approach on top of a well-defined polyhedral representation, generating code for both CPUs and GPUs. The experimental results on a 16-core Intel Xeon Silver show that our approach can achieve an average improvement of 2
引用
收藏
页码:440 / 470
页数:30
相关论文
共 50 条
  • [31] Modeling Stencil Computations on Modern HPC Architectures
    de la Cruz, Raul
    Araya-Polo, Mauricio
    HIGH PERFORMANCE COMPUTING SYSTEMS: PERFORMANCE MODELING, BENCHMARKING, AND SIMULATION, 2015, 8966 : 149 - 171
  • [32] Autotuning Stencil-Based Computations on GPUs
    Mametjanov, Azamat
    Lowell, Daniel
    Ma, Ching-Chen
    Norris, Boyana
    2012 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER), 2012, : 266 - 274
  • [33] Automatic Performance Tuning of Stencil Computations on GPUs
    Garvey, Joseph D.
    Abdelrahman, Tarek S.
    2015 44TH INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING (ICPP), 2015, : 300 - 309
  • [34] Autotuning divide-and-conquer stencil computations
    Natarajan, Ekanathan Palamadai
    Dehnavi, Maryam Mehri
    Leiserson, Charles
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2017, 29 (17):
  • [35] The memory behavior of cache oblivious stencil computations
    Frigo, Matteo
    Strumpen, Volker
    JOURNAL OF SUPERCOMPUTING, 2007, 39 (02): : 93 - 112
  • [36] Automatic Partitioning of Stencil Computations on Heterogeneous Systems
    Pereira, Alyson D.
    Rocha, Rodrigo C. O.
    Ramos, Luiz
    Castro, Marcio
    Goes, Luis F. W.
    2017 INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE AND HIGH PERFORMANCE COMPUTING WORKSHOPS (SBAC-PADW), 2017, : 43 - 48
  • [37] Speeding Up Stencil Computations with Kernel Convolution
    Januario, Guilherme C.
    Rosenburg, Bryan S.
    Park, Yoonho
    Perrone, Michael
    Moreira, Jose
    Carvalho, Tereza C. M. B.
    PROCEEDINGS OF 28TH IEEE INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE AND HIGH PERFORMANCE COMPUTING, (SBAC-PAD 2016), 2016, : 76 - 83
  • [38] DEPTH COMPUTATIONS FROM POLYHEDRAL IMAGES
    SPARR, G
    IMAGE AND VISION COMPUTING, 1992, 10 (10) : 683 - 688
  • [39] DEPTH COMPUTATIONS FROM POLYHEDRAL IMAGES
    SPARR, G
    LECTURE NOTES IN COMPUTER SCIENCE, 1992, 588 : 378 - 386
  • [40] Low Precision Processing for High Order Stencil Computations
    Singh, Gagandeep
    Diamantopoulos, Dionysios
    Stuijk, Sander
    Hagleitner, Christoph
    Corporaal, Henk
    EMBEDDED COMPUTER SYSTEMS: ARCHITECTURES, MODELING, AND SIMULATION, SAMOS 2019, 2019, 11733 : 403 - 415