Large-Scale Algorithm Design for Parallel FFT-based Simulations on GPUs

被引:0
|
作者
Kulkarni, Anuva [1 ]
Franchetti, Franz [1 ]
Kovacevic, Jelena [1 ]
机构
[1] Carnegie Mellon Univ, Dept Elect & Comp Engn, Pittsburgh, PA 15213 USA
来源
2018 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP 2018) | 2018年
基金
美国国家科学基金会;
关键词
Irregular domain decomposition; algorithm design; GPU; lossy compression;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
We describe and analyze a co-design of algorithm and software for high-performance simulation of a partial differential equation (PDE) numerical solver for large-scale datasets. Large-scale scientific simulations involving parallel Fast Fourier Transforms (FFTs) have extreme memory requirements and high communication cost. This hampers high resolution analysis with fine grids. Moreover, it is difficult to accelerate legacy Fortran scientific codes with modern hardware such as GPUs because of memory constraints of GPUs. Our proposed solution uses signal processing techniques such as lossy compression and domain-local FFTs to lower iteration cost without adversely impacting accuracy of the result. In this work, we discuss proof-of-concept results for various aspects of algorithm development.
引用
收藏
页码:301 / 305
页数:5
相关论文
共 50 条
  • [21] Hierarchical Parallel Algorithm for Modularity-Based Community Detection Using GPUs
    Cheong, Chun Yew
    Huynh, Huynh Phung
    Lo, David
    Goh, Rick Siow Mong
    EURO-PAR 2013 PARALLEL PROCESSING, 2013, 8097 : 775 - 787
  • [22] Efficient and Large-Scale Dissipative Particle Dynamics Simulations on GPU
    Yang, Keda
    Bai, Zhiqiang
    Su, Jiaye
    Guo, Hongxia
    SOFT MATERIALS, 2014, 12 (02) : 185 - 196
  • [23] Towards Large-Scale Molecular Dynamics Simulations on Graphics Processors
    Davis, Joseph E.
    Ozsoy, Adnan
    Patel, Sandeep
    Taufer, Michela
    BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, PROCEEDINGS, 2009, 5462 : 176 - 186
  • [24] Extension of Parallel Primitives and Their Applications to Large-Scale Data Processing
    Nakano, Masashi
    Chang, Qiong
    Miyazaki, Jun
    DATABASE AND EXPERT SYSTEMS APPLICATIONS, PT II, DEXA 2024, 2024, 14911 : 248 - 253
  • [25] PARALLEL SIMULATION OF LARGE-SCALE ARTIFICIAL SOCIETY WITH GPU AS COPROCESSOR
    Guo, Gang
    Chen, Bin
    Qiu, Xiaogang
    INTERNATIONAL JOURNAL OF MODELING SIMULATION AND SCIENTIFIC COMPUTING, 2013, 4 (02)
  • [26] TC-Stream: Large-Scale Graph Triangle Counting on a Single Machine Using GPUs
    Huang, Jianqiang
    Wang, Haojie
    Fei, Xiang
    Wang, Xiaoying
    Chen, Wenguang
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2022, 33 (11) : 3067 - 3078
  • [27] Streaming parallel GPU acceleration of large-scale filter-based spiking neural networks
    Slazynski, Leszek
    Bohte, Sander
    NETWORK-COMPUTATION IN NEURAL SYSTEMS, 2012, 23 (04) : 183 - 211
  • [28] A PARALLEL DOMAIN DECOMPOSITION ALGORITHM FOR LARGE SCALE IMAGE DENOISING
    Chen, Rongliang
    Huang, Jizu
    Cai, Xiao-Chuan
    INVERSE PROBLEMS AND IMAGING, 2019, 13 (06) : 1259 - 1282
  • [29] A fast calculation method for large-scale shell structure based on multigird method and GPU parallel computing
    State Key Laboratory of Advanced Design and Manufacturing for Vehicle Body, Hunan University, Changsha 410082, China
    Gongcheng Lixue, 5 (20-26): : 20 - 26
  • [30] GPU Based Parallel Matrix Exponential Algorithm for Large Scale Power System Electromagnetic Transient Simulation
    Zhao, Jinli
    Liu, Juntao
    Li, Peng
    Fu, Xiaopeng
    Song, Guanyu
    Wang, Chengshan
    2016 IEEE INNOVATIVE SMART GRID TECHNOLOGIES - ASIA (ISGT-ASIA), 2016, : 110 - 114