Large-Scale Algorithm Design for Parallel FFT-based Simulations on GPUs

被引:0
|
作者
Kulkarni, Anuva [1 ]
Franchetti, Franz [1 ]
Kovacevic, Jelena [1 ]
机构
[1] Carnegie Mellon Univ, Dept Elect & Comp Engn, Pittsburgh, PA 15213 USA
基金
美国国家科学基金会;
关键词
Irregular domain decomposition; algorithm design; GPU; lossy compression;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
We describe and analyze a co-design of algorithm and software for high-performance simulation of a partial differential equation (PDE) numerical solver for large-scale datasets. Large-scale scientific simulations involving parallel Fast Fourier Transforms (FFTs) have extreme memory requirements and high communication cost. This hampers high resolution analysis with fine grids. Moreover, it is difficult to accelerate legacy Fortran scientific codes with modern hardware such as GPUs because of memory constraints of GPUs. Our proposed solution uses signal processing techniques such as lossy compression and domain-local FFTs to lower iteration cost without adversely impacting accuracy of the result. In this work, we discuss proof-of-concept results for various aspects of algorithm development.
引用
收藏
页码:301 / 305
页数:5
相关论文
共 50 条
  • [41] FFT-based homogenization algorithm using digital images
    Terada, K
    Suzuki, K
    Ohtsubo, H
    MATERIALS SCIENCE RESEARCH INTERNATIONAL, 1997, 3 (04): : 231 - 236
  • [42] Enhancement of FFT-based Algorithm for Voltage Flicker Measurement
    Chen, Ming-Tang
    Hsiao, Sheng-Jen
    Lu, Chen-Wen
    2008 13TH INTERNATIONAL CONFERENCE ON HARMONICS AND QUALITY OF POWER, VOLS 1 AND 2, 2008, : 401 - 405
  • [43] A New FFT-based Acquisition Algorithm for GPS Signals
    Jiang Yi
    Zhang Shufang
    Hu Qing
    Sun Xiaowen
    2008 INTERNATIONAL WORKSHOP ON EDUCATION TECHNOLOGY AND TRAINING AND 2008 INTERNATIONAL WORKSHOP ON GEOSCIENCE AND REMOTE SENSING, VOL 2, PROCEEDINGS,, 2009, : 416 - 419
  • [44] A Monte Carlo algorithm for large-scale magnetic simulations
    Ziolkowski, Grzegorz
    Chrobak, Artur
    Chrobak, Dariusz
    ENGINEERING COMPUTATIONS, 2025,
  • [45] A finite element perspective on nonlinear FFT-based micromechanical simulations
    Zeman, J.
    de Geus, T. W. J.
    Vondrejc, J.
    Peerlings, R. H. J.
    Geers, M. G. D.
    INTERNATIONAL JOURNAL FOR NUMERICAL METHODS IN ENGINEERING, 2017, 111 (10) : 903 - 926
  • [46] A Parallel Retrodiction Algorithm for Large-Scale Multitarget Tracking
    Yeung, Siu Lun
    Tager, Sean
    Wilson, Paul
    Tharmarasa, Ratnasingham
    Armour, Wes
    Thiyagalingam, Jeyarajan
    IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS, 2021, 57 (01) : 5 - 21
  • [47] A simple parallel algorithm for large-scale portfolio problems
    Smimou, Kamal
    Thulasiram, Ruppa K.
    JOURNAL OF RISK FINANCE, 2010, 11 (05) : 481 - 495
  • [48] An efficient FFT-based algorithm for power series expansions
    Hwang, C
    Shih, YP
    Wu, RY
    COMPUTERS & CHEMICAL ENGINEERING, 1997, 21 (09) : 1043 - 1049
  • [49] Large-Scale Parallel Alignment Algorithm for SMRT Reads
    Xia, Zeyu
    Cui, Yingbo
    Zhang, Ang
    Zhang, Peng
    Long, Sifan
    Tang, Tao
    Peng, Lin
    Huang, Chun
    Yang, Canqun
    Liao, Xiangke
    ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, ICA3PP 2021, PT II, 2022, 13156 : 213 - 229
  • [50] Parallel fast algorithm for large-scale electromagnetic scattering
    Wu, F
    Zhang, YJ
    Oo, ZZ
    Li, EP
    SEVENTH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND GRID IN ASIA PACIFIC REGION, PROCEEDINGS, 2004, : 188 - 194