Large-Scale Algorithm Design for Parallel FFT-based Simulations on GPUs

被引:0
|
作者
Kulkarni, Anuva [1 ]
Franchetti, Franz [1 ]
Kovacevic, Jelena [1 ]
机构
[1] Carnegie Mellon Univ, Dept Elect & Comp Engn, Pittsburgh, PA 15213 USA
基金
美国国家科学基金会;
关键词
Irregular domain decomposition; algorithm design; GPU; lossy compression;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
We describe and analyze a co-design of algorithm and software for high-performance simulation of a partial differential equation (PDE) numerical solver for large-scale datasets. Large-scale scientific simulations involving parallel Fast Fourier Transforms (FFTs) have extreme memory requirements and high communication cost. This hampers high resolution analysis with fine grids. Moreover, it is difficult to accelerate legacy Fortran scientific codes with modern hardware such as GPUs because of memory constraints of GPUs. Our proposed solution uses signal processing techniques such as lossy compression and domain-local FFTs to lower iteration cost without adversely impacting accuracy of the result. In this work, we discuss proof-of-concept results for various aspects of algorithm development.
引用
收藏
页码:301 / 305
页数:5
相关论文
共 50 条
  • [1] Design of large-scale parallel simulations
    Knepley, MG
    Sameh, AH
    Sarin, V
    PARALLEL COMPUTATIONAL FLUID DYNAMICS: TOWARDS TERAFLOPS, OPTIMIZATION, AND NOVEL FORMULATIONS, 2000, : 273 - 279
  • [2] Application of FFT-based Algorithms for Large-Scale Universal Kriging Problems
    J. Fritz
    I. Neuweiler
    W. Nowak
    Mathematical Geosciences, 2009, 41 : 509 - 533
  • [3] Application of FFT-based Algorithms for Large-Scale Universal Kriging Problems
    Fritz, J.
    Neuweiler, I.
    Nowak, W.
    MATHEMATICAL GEOSCIENCES, 2009, 41 (05) : 509 - 533
  • [4] Validation of FFT-based algorithms for large-scale modeling of wave propagation in tissue
    Mould, JC
    Wojcik, GL
    Carcione, LM
    Tabei, M
    Mast, TD
    Waag, RC
    1999 IEEE ULTRASONICS SYMPOSIUM PROCEEDINGS, VOLS 1 AND 2, 1999, : 1551 - 1556
  • [5] MLFMA-FFT Parallel Algorithm for the Solution of Large-Scale Problems in Electromagnetics
    Taboada, J. M.
    Araujo, M. G.
    Bertolo, J. M.
    Landesa, L.
    Obelleiro, F.
    Rodriguez, J. L.
    PROGRESS IN ELECTROMAGNETICS RESEARCH-PIER, 2010, 105 : 15 - 30
  • [6] HI-FFT: Heterogeneous Parallel In-Place Algorithm for Large-Scale 2D-FFT
    Kang, Homin
    Lee, Jaehong
    Kim, Duksu
    IEEE ACCESS, 2021, 9 : 120261 - 120273
  • [7] An Optimized FFT-Based Direct Poisson Solver on CUDA GPUs
    Wu, Jing
    JaJa, Joseph
    Balaras, Elias
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2014, 25 (03) : 550 - 559
  • [8] Parallel Algorithm of IDCT with GPUs and CUDA for Large-scale Video Quality of 3G
    Chen, Qingkui
    Wang, Haifeng
    Zhuang, Songlin
    Liu, Bocheng
    JOURNAL OF COMPUTERS, 2012, 7 (08) : 1880 - 1886
  • [9] Parallel distributed FFT-based solvers for 3-D Poisson problems in Meso-scale atmospheric simulations
    Giraud, L
    Guivarch, R
    Stein, J
    INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS, 2001, 15 (01): : 36 - 46
  • [10] Large-scale parallel simulations in computational electromagnetics
    Sankar, V
    Kabakian, A
    Rowell, C
    Sahely, T
    COMPUTATIONAL FLUID DYNAMICS 2000, 2001, : 411 - 416