Large-Scale Algorithm Design for Parallel FFT-based Simulations on GPUs

被引:0
|
作者
Kulkarni, Anuva [1 ]
Franchetti, Franz [1 ]
Kovacevic, Jelena [1 ]
机构
[1] Carnegie Mellon Univ, Dept Elect & Comp Engn, Pittsburgh, PA 15213 USA
来源
2018 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP 2018) | 2018年
基金
美国国家科学基金会;
关键词
Irregular domain decomposition; algorithm design; GPU; lossy compression;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
We describe and analyze a co-design of algorithm and software for high-performance simulation of a partial differential equation (PDE) numerical solver for large-scale datasets. Large-scale scientific simulations involving parallel Fast Fourier Transforms (FFTs) have extreme memory requirements and high communication cost. This hampers high resolution analysis with fine grids. Moreover, it is difficult to accelerate legacy Fortran scientific codes with modern hardware such as GPUs because of memory constraints of GPUs. Our proposed solution uses signal processing techniques such as lossy compression and domain-local FFTs to lower iteration cost without adversely impacting accuracy of the result. In this work, we discuss proof-of-concept results for various aspects of algorithm development.
引用
收藏
页码:301 / 305
页数:5
相关论文
共 50 条
  • [1] HI-FFT: Heterogeneous Parallel In-Place Algorithm for Large-Scale 2D-FFT
    Kang, Homin
    Lee, Jaehong
    Kim, Duksu
    IEEE ACCESS, 2021, 9 : 120261 - 120273
  • [2] Parallel Algorithm of IDCT with GPUs and CUDA for Large-scale Video Quality of 3G
    Chen, Qingkui
    Wang, Haifeng
    Zhuang, Songlin
    Liu, Bocheng
    JOURNAL OF COMPUTERS, 2012, 7 (08) : 1880 - 1886
  • [3] Large-Scale direct numerical simulations of turbulence using GPUs and modern Fortran
    Karp, Martin
    Massaro, Daniele
    Jansson, Niclas
    Hart, Alistair
    Wahlgren, Jacob
    Schlatter, Philipp
    Markidis, Stefano
    INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS, 2023, 37 (05) : 487 - 502
  • [4] CuLDA: Solving Large-scale LDA Problems on GPUs
    Xie, Xiaolong
    Liang, Yun
    Li, Xiuhong
    Tan, Wei
    HPDC'19: PROCEEDINGS OF THE 28TH INTERNATIONAL SYMPOSIUM ON HIGH-PERFORMANCE PARALLEL AND DISTRIBUTED COMPUTING, 2019, : 195 - 205
  • [5] Large-scale powder mixer simulations using massively parallel GPU architectures
    Radeke, Charles A.
    Glasser, Benjamin J.
    Khinast, Johannes G.
    CHEMICAL ENGINEERING SCIENCE, 2010, 65 (24) : 6435 - 6442
  • [6] Efficient Parallel UPGMA algorithm Based on Multiple GPUs
    Hung, Che-Lun
    Wu, Fu-Che
    Lin, Chun-Yuan
    Chan, Yu-Wei
    2016 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2016, : 870 - 873
  • [7] An Efficient Parallel ISODATA Algorithm Based on Kepler GPUs
    Yang, Shiquan
    Dong, Jianqiang
    Yuan, Bo
    PROCEEDINGS OF THE 2014 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2014, : 2444 - 2449
  • [8] Large-scale flow simulations using lattice Boltzmann method with AMR following free-surface on multiple GPUs
    Watanabe, Seiya
    Aoki, Takayuki
    COMPUTER PHYSICS COMMUNICATIONS, 2021, 264 (264)
  • [9] ngAP: Non-blocking Large-scale Automata Processing on GPUs
    Ge, Tianao
    Zhang, Tong
    Liu, Hongyuan
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON ARCHITECTURAL SUPPORT FOR PROGRAMMING LANGUAGES AND OPERATING SYSTEMS, ASPLOS 2024, VOL 1, 2024, : 268 - 285
  • [10] GStream: A Graph Streaming Processing Method for Large-Scale Graphs on GPUs
    Seo, Hyunseok
    Kim, Jinwook
    Kim, Min-Soo
    ACM SIGPLAN NOTICES, 2015, 50 (08) : 253 - 254