Multithreaded implicitly dealiased convolutions

被引:5
作者
Roberts, Malcolm [1 ]
Bowman, John C. [2 ]
机构
[1] Comp Modelling Grp Ltd, 3710 33 St NW, Calgary, AB T2L 2M1, Canada
[2] Univ Alberta, Dept Math & Stat Sci, Edmonton, AB T6G 2G1, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Convolution; Implicit dealiasing; Fast Fourier transform; Multithreading; Parallelization; Pseudospectral method;
D O I
10.1016/j.jcp.2017.11.026
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Implicit dealiasing is a method for computing in-place linear convolutions via fast Fourier transforms that decouples work memory from input data. It offers easier memory management and, for long one-dimensional input sequences, greater efficiency than conventional zero-padding. Furthermore, for convolutions of multidimensional data, the segregation of data and work buffers can be exploited to reduce memory usage and execution time significantly. This is accomplished by processing and discarding data as it is generated, allowing work memory to be reused, for greater data locality and performance. A multithreaded implementation of implicit dealiasing that accepts an arbitrary number of input and output vectors and a general multiplication operator is presented, along with an improved one-dimensional Hermitian convolution that avoids the loop dependency inherent in previous work. An alternate data format that can accommodate a Nyquist mode and enhance cache efficiency is also proposed. (c) 2017 Elsevier Inc. All rights reserved.
引用
收藏
页码:98 / 114
页数:17
相关论文
共 50 条
  • [1] EFFICIENT DEALIASED CONVOLUTIONS WITHOUT PADDING
    Bowman, John C.
    Roberts, Malcolm
    SIAM JOURNAL ON SCIENTIFIC COMPUTING, 2011, 33 (01) : 386 - 406
  • [2] HYBRID DEALIASING OF COMPLEX CONVOLUTIONS
    Murasko, Noel
    Bowman, John C.
    SIAM JOURNAL ON SCIENTIFIC COMPUTING, 2024, 46 (03) : B159 - B178
  • [3] Multithreaded Processors
    Arun, Venkat
    RESONANCE-JOURNAL OF SCIENCE EDUCATION, 2015, 20 (09): : 844 - 855
  • [4] Multithreaded processors
    Arun V.
    Resonance, 2015, 20 (9) : 844 - 855
  • [5] On convolutions and neutrix convolutions involving the incomplete Gamma function
    Fisher, B
    Jolevska-Tuneska, B
    Takaci, A
    INTEGRAL TRANSFORMS AND SPECIAL FUNCTIONS, 2004, 15 (05) : 405 - 414
  • [6] MULTITHREADED PARALLELISM WITH OPENMP
    Rufai, Raimi
    Bozyigit, Muslim
    Alghamdi, Jaralla
    Ahmed, Moataz
    PARALLEL PROCESSING LETTERS, 2005, 15 (04) : 367 - 378
  • [7] Multithreaded decoupled architecture
    Dorojevets, MN
    Oklobdzija, VG
    INTERNATIONAL JOURNAL OF HIGH SPEED COMPUTING, 1995, 7 (03): : 465 - 480
  • [8] On fractional convolutions and distributions
    Jain, Pankaj
    Jain, Sandhya
    Kumar, Rajender
    INTEGRAL TRANSFORMS AND SPECIAL FUNCTIONS, 2015, 26 (11) : 885 - 899
  • [9] Approximations by convolutions and antiderivatives
    A. M. Sedletskii
    Mathematical Notes, 2006, 79 : 697 - 706
  • [10] Nonhomogeneous Rankin convolutions
    Vinogradov A.I.
    Journal of Mathematical Sciences, 1998, 89 (1) : 933 - 944