Calculation of Distributed-Order Fractional Derivative on Tensor Cores-Enabled GPU

被引：0

作者：

Bohaienko, Vsevolod ^{[1
]}

机构：

[1] NAS Ukraine, VM Glushkov Inst Cybernet, Glushkov Ave 40, Kiev, Ukraine

来源：

INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING | 2023年 / 51卷 / 4-5期

关键词：

Distributed-order derivative; Parallel computation; GPU; Tensor cores; Diffusion; DIFFERENTIAL-EQUATIONS; ALGORITHM; SCHEME;

D O I：

10.1007/s10766-023-00754-9

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

Due to an increased computational complexity of calculating the values of the dis-tributed-order Caputo fractional derivative compared to the classical Caputo deriva-tive there is a need to develop new techniques that accelerate it. In this paper for this purpose we propose to use a fast matrix "multiply and accumulate" operation avail-able in GPU's that contain the so-called tensor cores. We present and experimentally analyze the properties of GPU-algorithms that are based on the L1 finite-difference approximation of the derivative and incorporate them into the Crank-Nicholson scheme for the distributed-order time-fractional diffusion equation. The computation of derivative's values on GPU was faster than the multi-threaded implementation on CPU only for a large number of time steps with growing performance gain when number of time steps increase. The usage of the single-precision data type increased the error up to 2.7% comparing with the usage of the double-precision data type. Half-precision computations in tensor cores increased the error up to 29.5%. While solving a time-fractional diffusion equation, algorithms implemented for GPU with the usage of the single-precision data type were at least three times faster than the CPU-implementation for the number of time steps more than 1280. Data type pre-cision had only slight influence on the solution error with significantly increased execution time when the double-precision data type was used for data storage and processing.

引用

页码：256 / 270

页数：15

共 24 条

[1] On a numerical scheme for solving differential equations of fractional order
Atanackovic, T. M.
Stankovic, B.
[J]. MECHANICS RESEARCH COMMUNICATIONS, 2008, 35 (07) : 429 - 438
[2] A KERNEL COMPRESSION SCHEME FOR FRACTIONAL DIFFERENTIAL EQUATIONS
Baffet, Daniel
Hesthaven, Jan S.
[J]. SIAM JOURNAL ON NUMERICAL ANALYSIS, 2017, 55 (02) : 496 - 520
[3] Bohaienko V., 2020, CEUR WORKSHOP PROC, P636
[4] A fast finite-difference algorithm for solving space-fractional filtration equation with a generalised Caputo derivative
Bohaienko, V. O.
[J]. COMPUTATIONAL & APPLIED MATHEMATICS, 2019, 38 (03)
[5] HPC optimal parallel communication algorithm for the simulation of fractional-order systems
Bonchis, C.
Kaslik, E.
Rosu, F.
[J]. JOURNAL OF SUPERCOMPUTING, 2019, 75 (03) : 1014 - 1025
[6] Finite Difference/Finite Element Methods for Distributed-Order Time Fractional Diffusion Equations
Bu, Weiping
Xiao, Aiguo
Zeng, Wei
[J]. JOURNAL OF SCIENTIFIC COMPUTING, 2017, 72 (01) : 422 - 441
[7] Some Boundary-Value Problems of Filtration Dynamics Corresponding to Models of Fractional Diffusion of Distributed Order
Bulavatsky, V. M.
[J]. CYBERNETICS AND SYSTEMS ANALYSIS, 2022, 58 (01) : 65 - 76
[8] Sparsity-Aware Precorrected Tensor Train Algorithm for Fast Solution of 2-D Scattering Problems and Current Flow Modeling on Unstructured Meshes
Chen, Zhuotong
Gomez, Luis
Zheng, Shucheng
Yucel, Abdulkadir C.
Zhang, Zheng
Okhmatovski, Vladimir, I
[J]. IEEE TRANSACTIONS ON MICROWAVE THEORY AND TECHNIQUES, 2019, 67 (12) : 4833 - 4847
[9] Applications of Distributed-Order Fractional Operators: A Review
Ding, Wei
Patnaik, Sansit
Sidhardh, Sai
Semperlotti, Fabio
[J]. ENTROPY, 2021, 23 (01) : 1 - 42
[10] Efficient solution of time-fractional differential equations with a new adaptive multi-term discretization of the generalized Caputo-Dzherbashyan derivative
Durastante, Fabio
[J]. CALCOLO, 2019, 56 (04)

← 1 2 3 →