Efficient Stencil Computation with Temporal Blocking by Halide DSL

被引:0
|
作者
Aikawa, Hiroki [1 ,2 ]
Endo, Toshio [1 ]
Yuki, Tomoya [1 ]
Hirofuchi, Takahiro [2 ]
Ikegami, Tsutomu [2 ]
机构
[1] Tokyo Inst Technol, Tokyo, Japan
[2] AIST, Tokyo, Japan
来源
2022 IEEE INTL CONF ON PARALLEL & DISTRIBUTED PROCESSING WITH APPLICATIONS, BIG DATA & CLOUD COMPUTING, SUSTAINABLE COMPUTING & COMMUNICATIONS, SOCIAL COMPUTING & NETWORKING, ISPA/BDCLOUD/SOCIALCOM/SUSTAINCOM | 2022年
关键词
PARALLELISM;
D O I
10.1109/ISPA-BDCloud-SocialCom-SustainCom57177.2022.00116
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The stencil computation is a memory centric computation, where memory access becomes a bottleneck. Therefore locality improvement method like temporal blocking is essential to improve the speed. Introducing temporal blocking, however, introduces heavy programming costs. This paper proposes a methodology to implement efficient stencil software with temporal blocking easier using the domainspecific language Halide. Generally, boundary conditions, such as Neumann and Dirichlet boundary conditions, are needed in stencil computations. Also user-defined boundary computations may be needed in general fluid dynamics applications. We indicate a pitfall in combining temporal blocking and boundary conditions on Halide and show a solution to improve performance. We also apply our methodology to the II otspot3 benchmark from Rodinia benchmark suite and demonstrate 3.5 to 5.0 times performance improvement over the original OpenNIP implementation.
引用
收藏
页码:870 / 877
页数:8
相关论文
共 4 条
  • [1] Revisiting Temporal Blocking Stencil Optimizations
    Zhang, Lingqi
    Wahib, Mohamed
    Chen, Peng
    Meng, Jintao
    Wang, Xiao
    Endo, Toshio
    Matsuoka, Satoshi
    PROCEEDINGS OF THE 37TH INTERNATIONAL CONFERENCE ON SUPERCOMPUTING, ACM ICS 2023, 2023, : 251 - 263
  • [2] Automatic Generation of Efficient Codes from Mathematical Descriptions of Stencil Computation
    Muranushi, Takayuki
    Nishizawa, Seiya
    Tomita, Hirofumi
    Nitadori, Keigo
    Iwasawa, Masaki
    Maruyama, Yutaka
    Yashiro, Hisashi
    Nakamura, Yoshifumi
    Hotta, Hideyuki
    Makino, Junichiro
    Hosono, Natsuki
    Inoue, Hikaru
    FHPC'16: PROCEEDINGS OF THE 5TH INTERNATIONAL WORKSHOP ON FUNCTIONAL HIGH-PERFORMANCE COMPUTING, 2016, : 17 - 22
  • [3] ChronoGraph: Enabling Temporal Graph Traversals for Efficient Information Diffusion Analysis over Time
    Byun, Jaewook
    Woo, Sungpil
    Kim, Daeyoung
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2020, 32 (03) : 424 - 437
  • [4] ChronoGraph: Enabling temporal graph traversals for efficient information diffusion analysis over time
    Byun, Jaewook
    Woo, Sungpil
    Kim, Daeyoung
    2020 IEEE 36TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2020), 2020, : 2026 - 2027