Revisiting split tiling for stencil computations in polyhedral compilation

被引:0
|
作者
Yingying Li
Huihui Sun
Jianmin Pang
机构
[1] State Key Laboratory of Mathematical Engineering and Advanced Computing,
来源
The Journal of Supercomputing | 2022年 / 78卷
关键词
Stencils; Loop tiling; Split tiling; Polyhedral compilation; Parallelism; Locality;
D O I
暂无
中图分类号
学科分类号
摘要
Complex tile shapes maximize parallelism and locality of stencil computations by enabling tile-wise concurrent start, i.e., all tiles along a particular tiling direction of the iteration space can be started concurrently. We study split tiling—a tiling technique exploiting tile-wise concurrent start at the expense of additional synchronizations, in the context of polyhedral compilation. Derived from classical parallelogram tiling, our approach first splits a parallelogram tile into multiple phases that can be executed simultaneously with those of the neighboring tiles. The technique then minimizes the amount of synchronizations by merging boundary phases of consecutive tiles along the time-tiled direction. We implement our approach on top of a well-defined polyhedral representation, generating code for both CPUs and GPUs. The experimental results on a 16-core Intel Xeon Silver show that our approach can achieve an average improvement of 2
引用
收藏
页码:440 / 470
页数:30
相关论文
共 50 条
  • [41] Scalable Distributed High-Order Stencil Computations
    Jacquelin, Mathias
    Araya-Polo, Mauricio
    Meng, Jie
    SC22: INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS, 2022,
  • [42] Autotuning Stencil Computations with Structural Ordinal Regression Learning
    Cosenza, Biagio
    Durillo, Juan J.
    Ermon, Stefano
    Juurlink, Ben
    2017 31ST IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS), 2017, : 287 - 296
  • [43] Optimization and Performance Modeling of Stencil Computations on Modern Microprocessors
    Datta, Kaushik
    Kamil, Shoaib
    Williams, Samuel
    Oliker, Leonid
    Shalf, John
    Yelick, Katherine
    SIAM REVIEW, 2009, 51 (01) : 129 - 159
  • [44] Data Partitioning Strategies for Stencil Computations on NUMA Systems
    Feinbube, Frank
    Plauth, Max
    Knaust, Marius
    Polze, Andreas
    EURO-PAR 2017: PARALLEL PROCESSING WORKSHOPS, 2018, 10659 : 597 - 609
  • [45] Automatic Mapping and Optimization to Kokkos with Polyhedral Compilation
    Baskaran, Muthu
    Jin, Charles
    Meister, Benoit
    Springer, Jonathan
    2020 IEEE HIGH PERFORMANCE EXTREME COMPUTING CONFERENCE (HPEC), 2020,
  • [46] Verified Validation for Affine Scheduling in Polyhedral Compilation
    Li, Xuyang
    Liang, Hongjin
    Feng, Xinyu
    THEORETICAL ASPECTS OF SOFTWARE ENGINEERING, TASE 2024, 2024, 14777 : 287 - 305
  • [47] An Abstraction for Distributed Stencil Computations Using Charm plus
    Bhosale, Aditya
    Fink, Zane
    Kale, Laxmikant
    ASYNCHRONOUS MANY-TASK SYSTEMS AND APPLICATIONS, WAMTA 2024, 2024, 14626 : 123 - 134
  • [48] Optimization and Performance Modeling of Stencil Computations on ARM Architectures
    Zhang, Kaifang
    Su, Huayou
    Zhang, Peng
    Dou, Yong
    Proceedings - 2020 IEEE 22nd International Conference on High Performance Computing and Communications, IEEE 18th International Conference on Smart City and IEEE 6th International Conference on Data Science and Systems, HPCC-SmartCity-DSS 2020, 2020, : 113 - 121
  • [49] TILING 3-DIMENSIONAL SPACE WITH POLYHEDRAL TILES
    SHEPHARD, GC
    ZEITSCHRIFT FUR KRISTALLOGRAPHIE, 1983, 162 (1-4): : 206 - 207
  • [50] A Strategy for Automatic Performance Tuning of Stencil Computations on GPUs
    Garvey, Joseph D.
    Abdelrahman, Tarek S.
    SCIENTIFIC PROGRAMMING, 2018, 2018