Multithreaded implicitly dealiased convolutions

被引：5

作者：

Roberts, Malcolm ^{[1
]}

Bowman, John C. ^{[2
]}

机构：

[1] Comp Modelling Grp Ltd, 3710 33 St NW, Calgary, AB T2L 2M1, Canada

[2] Univ Alberta, Dept Math & Stat Sci, Edmonton, AB T6G 2G1, Canada

来源：

JOURNAL OF COMPUTATIONAL PHYSICS | 2018年 / 356卷

基金：

加拿大自然科学与工程研究理事会;

关键词：

Convolution; Implicit dealiasing; Fast Fourier transform; Multithreading; Parallelization; Pseudospectral method;

D O I：

10.1016/j.jcp.2017.11.026

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

Implicit dealiasing is a method for computing in-place linear convolutions via fast Fourier transforms that decouples work memory from input data. It offers easier memory management and, for long one-dimensional input sequences, greater efficiency than conventional zero-padding. Furthermore, for convolutions of multidimensional data, the segregation of data and work buffers can be exploited to reduce memory usage and execution time significantly. This is accomplished by processing and discarding data as it is generated, allowing work memory to be reused, for greater data locality and performance. A multithreaded implementation of implicit dealiasing that accepts an arbitrary number of input and output vectors and a general multiplication operator is presented, along with an improved one-dimensional Hermitian convolution that avoids the loop dependency inherent in previous work. An alternate data format that can accommodate a Nyquist mode and enhance cache efficiency is also proposed. (c) 2017 Elsevier Inc. All rights reserved.

引用

页码：98 / 114

页数：17

共 50 条

[1] EFFICIENT DEALIASED CONVOLUTIONS WITHOUT PADDING
Bowman, John C.
Roberts, Malcolm
SIAM JOURNAL ON SCIENTIFIC COMPUTING, 2011, 33 (01) : 386 - 406
[2] HYBRID DEALIASING OF COMPLEX CONVOLUTIONS
Murasko, Noel
Bowman, John C.
SIAM JOURNAL ON SCIENTIFIC COMPUTING, 2024, 46 (03) : B159 - B178
[3] Multithreaded Processors
Arun, Venkat
RESONANCE-JOURNAL OF SCIENCE EDUCATION, 2015, 20 (09): : 844 - 855
[4] Multithreaded processors
Arun V.
Resonance, 2015, 20 (9) : 844 - 855
[5] On convolutions and neutrix convolutions involving the incomplete Gamma function
Fisher, B
Jolevska-Tuneska, B
Takaci, A
INTEGRAL TRANSFORMS AND SPECIAL FUNCTIONS, 2004, 15 (05) : 405 - 414
[6] MULTITHREADED PARALLELISM WITH OPENMP
Rufai, Raimi
Bozyigit, Muslim
Alghamdi, Jaralla
Ahmed, Moataz
PARALLEL PROCESSING LETTERS, 2005, 15 (04) : 367 - 378
[7] Multithreaded decoupled architecture
Dorojevets, MN
Oklobdzija, VG
INTERNATIONAL JOURNAL OF HIGH SPEED COMPUTING, 1995, 7 (03): : 465 - 480
[8] On fractional convolutions and distributions
Jain, Pankaj
Jain, Sandhya
Kumar, Rajender
INTEGRAL TRANSFORMS AND SPECIAL FUNCTIONS, 2015, 26 (11) : 885 - 899
[9] Approximations by convolutions and antiderivatives
A. M. Sedletskii
Mathematical Notes, 2006, 79 : 697 - 706
[10] Nonhomogeneous Rankin convolutions
Vinogradov A.I.
Journal of Mathematical Sciences, 1998, 89 (1) : 933 - 944

← 1 2 3 4 5 →