Automating GPU Scalability for Complex Scientific Models: Phonon Boltzmann Transport Equation

被引:0
|
作者
Heisler, Eric [1 ]
Saurav, Siddharth [2 ]
Deshmukh, Aadesh [1 ]
Mazumder, Sandip [2 ]
Sundar, Hari [1 ]
机构
[1] Univ Utah, Kahlert Sch Comp, Salt Lake City, UT 84112 USA
[2] Ohio State Univ, Mech & Aerosp Engn, Columbus, OH USA
来源
PROCEEDINGS 2024 IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM, IPDPS 2024 | 2024年
基金
美国国家科学基金会;
关键词
Domain-specific language; GPU; code generation; physics; differential equations; PARALLEL COMPUTATION;
D O I
10.1109/IPDPS57955.2024.00045
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Heterogeneous computing environments combining CPU and GPU resources provide a great boost to large-scale scientific computing applications. Code generation utilities that partition the work into CPU and GPU tasks while considering data movement costs allow researchers to develop high-performance solutions more quickly and easily, and make these resources accessible to a larger user base. We present developments for a domain-specific language (DSL) and code generation framework for solving partial differential equations (PDEs). These enhancements facilitate GPU-accelerated solution of the Boltzmann transport equation (BTE) for phonons, which is the governing equation for simulating thermal transport in semiconductor materials at sub-micron scales. The solution of the BTE involves thousands of coupled PDEs as well as complicated boundary conditions and solving a nonlinear equation that couples all of the degrees of freedom at each time step. These developments enable the DSL to generate configurable hybrid GPU/CPU code that couples accelerated kernels with user-defined code. We observed performance improvements of around 18X compared to a CPU-only version produced by this same DSL with minimal additional programming effort.
引用
收藏
页码:430 / 439
页数:10
相关论文
共 10 条