COMMUNICATION OPTIMIZATIONS FOR IRREGULAR SCIENTIFIC COMPUTATIONS ON DISTRIBUTED-MEMORY ARCHITECTURES

被引：75

作者：

DAS, R

UYSAL, M

SALTZ, J

HWANG, YS

机构：

[1] Department of Computer Science, University of Maryland, College Park

[2] UMIACS, University of Maryland, College Park

来源：

JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING | 1994年 / 22卷 / 03期

关键词：

D O I：

10.1006/jpdc.1994.1104

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

This paper describes a number of optimizations that can be used to support the efficient execution of irregular problems on distributed memory parallel machines. These primitives (1) coordinate interprocessor data movement, (2) manage the storage of, and access to, copies of off-processor data, (3) minimize interprocessor communication requirements, and (4) support a shared name space. We present a detailed performance and scalability analysis of the communication primitives. This performance and scalability analysis is carried out using a workload generator, kernels from real applications, and a large unstructured adaptive application (the molecular dynamics code CHARMM). (C) 1994 Academic Press, Inc.

引用

页码：462 / 478

页数：17

共 50 条

[21] An algorithmic framework for parallelizing vision computations on distributed-memory machines
Chung, Y
1997 INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS, PROCEEDINGS, 1997, : 160 - 165
[22] Optimizing I/O for irregular applications on distributed-memory machines
Carretero, J
No, J
Choudhary, A
PARALLEL COMPUTATION, 1999, 1557 : 470 - 479
[23] PARALLEL RENDERING OF VOLUMETRIC DATA SET ON DISTRIBUTED-MEMORY ARCHITECTURES
MONTANI, C
PEREGO, R
SCOPIGNO, R
CONCURRENCY-PRACTICE AND EXPERIENCE, 1993, 5 (02): : 153 - 167
[24] ADAPTING FORTRAN 90 ARRAY PROGRAMS FOR DISTRIBUTED-MEMORY ARCHITECTURES
MERLIN, JH
LECTURE NOTES IN COMPUTER SCIENCE, 1992, 591 : 184 - 200
[25] Parallelizing RRT on Large-Scale Distributed-Memory Architectures
Devaurs, Didier
Simeon, Thierry
Cortes, Juan
IEEE TRANSACTIONS ON ROBOTICS, 2013, 29 (02) : 571 - 579
[26] Compiling Affine Loop Nests for Distributed-Memory Parallel Architectures
Bondhugula, Uday
2013 INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS (SC), 2013,
[27] Efficient Lagrangian particle tracking algorithms for distributed-memory architectures
Baldan, Giacomo
Bellosta, Tommaso
Guardone, Alberto
COMPUTERS & FLUIDS, 2023, 256
[28] MAPPING UNIFORM LOOP NESTS ONTO DISTRIBUTED-MEMORY ARCHITECTURES
DARTE, A
ROBERT, Y
PARALLEL COMPUTING, 1994, 20 (05) : 679 - 710
[29] Evolution-Based Scheduling of Computations and Communications on Distributed-Memory Multicomputers
Al-Mouhamed, Mayez
Computer Journal, 42 (05): : 373 - 389
[30] Evolution-based scheduling of computations and communications on distributed-memory multicomputers
Al-Mouhamed, M
COMPUTER JOURNAL, 1999, 42 (05): : 373 - 390

← 1 2 3 4 5 →