A Memory Efficient Parallel All-Pairs Computation Framework: Computation - Communication Overlap

被引：1

作者：

Yeleswarapu, Venkata Kasi Viswanath ^{[1
]}

Somani, Arun K. ^{[1
]}

机构：

[1] Iowa State Univ, Dept Elect & Comp Engn, Ames, IA 50010 USA

来源：

PARALLEL PROCESSING AND APPLIED MATHEMATICS (PPAM 2017), PT I | 2018年 / 10777卷

基金：

美国国家科学基金会;

关键词：

Communication - computation overlap; High performance computing; All-Pairs problems; Parallel computing; MPI;

D O I：

10.1007/978-3-319-78024-5_39

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

All-Pairs problems require each data element in a set of N data elements to be paired with every other data element for specific computation using the two data elements. Our framework aims to address recurring problems of scalability, distributing equal work load to all nodes and by reducing memory footprint. We reduce memory footprint of All-Pairs problems, by reducing memory requirement from N/root P to 3N/P. A bio-informatics application is implemented to demonstrate the scalability ranging up to 512 cores for the data set we experimented, redundancy management, and speed up performance of the framework.

引用

页码：443 / 458

页数：16

共 42 条

[31] A Parallel Computing Method for the Computation of the Moore-Penrose Generalized Inverse for Shared-Memory Architectures
Gelvez-Almeida, Elkin
Barrientos, Ricardo J.
Vilches-Ponce, Karina
Mora, Marco
IEEE ACCESS, 2023, 11 : 134834 - 134845
[32] Maximizing Communication–Computation Overlap Through Automatic Parallelization and Run-time Tuning of Non-blocking Collective Operations
Youcef Barigou
Edgar Gabriel
International Journal of Parallel Programming, 2017, 45 : 1390 - 1416
[33] Maximizing Communication-Computation Overlap Through Automatic Parallelization and Run-time Tuning of Non-blocking Collective Operations
Barigou, Youcef
Gabriel, Edgar
INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2017, 45 (06) : 1390 - 1416
[34] Adaptive Barrier Algorithm in MPI Based on Analytical Evaluations for Communication Time in the LogP Model of Parallel Computation
Zharikov, Vitalii V.
Paznikov, Alexey A.
Pavsky, Kirill V.
Pavsky, Valery A.
2018 INTERNATIONAL SCIENTIFIC MULTI-CONFERENCE ON INDUSTRIAL ENGINEERING AND MODERN TECHNOLOGIES (FAREASTCON), 2018,
[35] Enabling Efficient Hybrid Systolic Computation in Shared-L1-Memory Manycore Clusters
Mazzola, Sergio
Riedel, Samuel
Benini, Luca
IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2024, 32 (09) : 1602 - 1615
[36] Communication-Computation Overlapping with Dynamic Loop Scheduling for Preconditioned Parallel Iterative Solvers on Multicore and Manycore Clusters
Nakajima, Kengo
Hanawa, Toshihiro
2017 46TH INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING WORKSHOPS (ICPPW), 2017, : 210 - 219
[37] Shared-Memory Parallel Vector Implementation of the Immersed Boundary Method for the Computation of Blood Flow in the Beating Mammalian Heart
David McQueen
Charles Peskin
The Journal of Supercomputing, 1997, 11 : 213 - 236
[38] Shared-memory parallel vector implementation of the immersed boundary method for the computation of blood flow in the beating mammalian heart
McQueen, DM
Peskin, CS
JOURNAL OF SUPERCOMPUTING, 1997, 11 (03) : 213 - 236
[39] Efficient Distributed Mapping-Based Computation for Convolutional Neural Networks in Multi-Core Embedded Parallel Environment
Jia, Long
Li, Gang
Lu, Meili
Wei, Xile
Yi, Guosheng
ELECTRONICS, 2023, 12 (18)
[40] Investigation of parallel computation in 3D magnetic field analysis on distributed memory type of multiprocessors containing 296 PUs
Takahashi, N
Nakano, T
Fujiwara, K
Muramatsu, K
COMPEL-THE INTERNATIONAL JOURNAL FOR COMPUTATION AND MATHEMATICS IN ELECTRICAL AND ELECTRONIC ENGINEERING, 1998, 17 (5-6) : 726 - +

← 1 2 3 4 5 →