A Memory Efficient Parallel All-Pairs Computation Framework: Computation - Communication Overlap

被引:1
|
作者
Yeleswarapu, Venkata Kasi Viswanath [1 ]
Somani, Arun K. [1 ]
机构
[1] Iowa State Univ, Dept Elect & Comp Engn, Ames, IA 50010 USA
来源
PARALLEL PROCESSING AND APPLIED MATHEMATICS (PPAM 2017), PT I | 2018年 / 10777卷
基金
美国国家科学基金会;
关键词
Communication - computation overlap; High performance computing; All-Pairs problems; Parallel computing; MPI;
D O I
10.1007/978-3-319-78024-5_39
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
All-Pairs problems require each data element in a set of N data elements to be paired with every other data element for specific computation using the two data elements. Our framework aims to address recurring problems of scalability, distributing equal work load to all nodes and by reducing memory footprint. We reduce memory footprint of All-Pairs problems, by reducing memory requirement from N/root P to 3N/P. A bio-informatics application is implemented to demonstrate the scalability ranging up to 512 cores for the data set we experimented, redundancy management, and speed up performance of the framework.
引用
收藏
页码:443 / 458
页数:16
相关论文
共 42 条
  • [31] A Parallel Computing Method for the Computation of the Moore-Penrose Generalized Inverse for Shared-Memory Architectures
    Gelvez-Almeida, Elkin
    Barrientos, Ricardo J.
    Vilches-Ponce, Karina
    Mora, Marco
    IEEE ACCESS, 2023, 11 : 134834 - 134845
  • [32] Maximizing Communication–Computation Overlap Through Automatic Parallelization and Run-time Tuning of Non-blocking Collective Operations
    Youcef Barigou
    Edgar Gabriel
    International Journal of Parallel Programming, 2017, 45 : 1390 - 1416
  • [33] Maximizing Communication-Computation Overlap Through Automatic Parallelization and Run-time Tuning of Non-blocking Collective Operations
    Barigou, Youcef
    Gabriel, Edgar
    INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2017, 45 (06) : 1390 - 1416
  • [34] Adaptive Barrier Algorithm in MPI Based on Analytical Evaluations for Communication Time in the LogP Model of Parallel Computation
    Zharikov, Vitalii V.
    Paznikov, Alexey A.
    Pavsky, Kirill V.
    Pavsky, Valery A.
    2018 INTERNATIONAL SCIENTIFIC MULTI-CONFERENCE ON INDUSTRIAL ENGINEERING AND MODERN TECHNOLOGIES (FAREASTCON), 2018,
  • [35] Enabling Efficient Hybrid Systolic Computation in Shared-L1-Memory Manycore Clusters
    Mazzola, Sergio
    Riedel, Samuel
    Benini, Luca
    IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2024, 32 (09) : 1602 - 1615
  • [36] Communication-Computation Overlapping with Dynamic Loop Scheduling for Preconditioned Parallel Iterative Solvers on Multicore and Manycore Clusters
    Nakajima, Kengo
    Hanawa, Toshihiro
    2017 46TH INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING WORKSHOPS (ICPPW), 2017, : 210 - 219
  • [37] Shared-Memory Parallel Vector Implementation of the Immersed Boundary Method for the Computation of Blood Flow in the Beating Mammalian Heart
    David McQueen
    Charles Peskin
    The Journal of Supercomputing, 1997, 11 : 213 - 236
  • [38] Shared-memory parallel vector implementation of the immersed boundary method for the computation of blood flow in the beating mammalian heart
    McQueen, DM
    Peskin, CS
    JOURNAL OF SUPERCOMPUTING, 1997, 11 (03) : 213 - 236
  • [39] Efficient Distributed Mapping-Based Computation for Convolutional Neural Networks in Multi-Core Embedded Parallel Environment
    Jia, Long
    Li, Gang
    Lu, Meili
    Wei, Xile
    Yi, Guosheng
    ELECTRONICS, 2023, 12 (18)
  • [40] Investigation of parallel computation in 3D magnetic field analysis on distributed memory type of multiprocessors containing 296 PUs
    Takahashi, N
    Nakano, T
    Fujiwara, K
    Muramatsu, K
    COMPEL-THE INTERNATIONAL JOURNAL FOR COMPUTATION AND MATHEMATICS IN ELECTRICAL AND ELECTRONIC ENGINEERING, 1998, 17 (5-6) : 726 - +