Integrating fine-grained message passing in cache coherent shared memory multiprocessors

被引：2

作者：

Poulsen, DK ^{[1
]}

Yew, PC ^{[1
]}

机构：

[1] UNIV MINNESOTA,DEPT COMP SCI,MINNEAPOLIS,MN 55455

来源：

JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING | 1996年 / 33卷 / 02期

基金：

美国国家科学基金会;

关键词：

D O I：

10.1006/jpdc.1996.0036

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

This paper considers the use of data prefetching and an alternative mechanism, data forwarding, for reducing memory latency caused by interprocessor communication in cache coherent, shared memory multiprocessors. Data prefetching is accomplished by using a multiprocessor software pipelined algorithm. Data forwarding is used to target interprocessor data communication, rather than synchronization, and is applied to communication-related accesses between successive parallel loops. Prefetching and forwarding are each shown to be more effective for certain types of architectural and application characteristics. Given this result, a new hybrid prefetching and forwarding approach is proposed and evaluated that allows the relative amounts of prefetching and forwarding used to be adapted to these characteristics. When compared to prefetching or forwarding alone, the new hybrid scheme is shown to increase performance stability over varying application characteristics, to reduce processor instruction overheads, cache miss ratios, and memory system bandwidth requirements, and to reduce performance sensitivity to architectural parameters such as cache size. Algorithms for data prefetching, data forwarding, and hybrid prefetching and forwarding are described. These algorithms are applied by using a parallelizing compiler and are evaluated via execution-driven simulations of large, optimized, numerical application codes with loop-level and vector parallelism. (C) 1996 Academic Press, Inc.

引用

页码：172 / 188

页数：17

共 50 条

[1] Support for fine-grained synchronization in shared-memory multiprocessors
Vlassov, Vladimir
Merino, Oscar Sierra
Moritz, Csaba Andras
Popov, Konstantin
PARALLEL COMPUTING TECHNOLOGIES, PROCEEDINGS, 2007, 4671 : 453 - 467
[2] Efficient message passing on shared memory multiprocessors
Lect Notes Comput Sci, (221):
[3] Efficient message passing on UNIX shared memory multiprocessors
Bernaschi, M
FUTURE GENERATION COMPUTER SYSTEMS, 1998, 13 (06) : 443 - 449
[4] A Fine-Grained Message Passing MOEA/D
Derbel, Bilel
Liefooghe, Arnaud
Marquet, Gauvain
Talbi, El-Ghazali
2015 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2015, : 1837 - 1844
[5] Termination detection for fine-grained message-passing architectures
Naylor, Matthew
Moore, Simon W.
Mokhov, Andrey
Thomas, David
Beaumont, Jonathan R.
Fleming, Shane
Markettos, A. Theodore
Bytheway, Thomas
Brown, Andrew
2020 IEEE 31ST INTERNATIONAL CONFERENCE ON APPLICATION-SPECIFIC SYSTEMS, ARCHITECTURES AND PROCESSORS (ASAP 2020), 2020, : 17 - 24
[6] Fine-grained task reweighting on multiprocessors
Block, A
Anderson, JH
Bishop, G
11TH IEEE INTERNATIONAL CONFERENCE ON EMBEDDED AND REAL-TIME COMPUTING SYSTEMS AND APPLICATIONS, PROCEEDINGS, 2005, : 429 - 435
[7] Integrating Software Distributed Shared Memory and Message Passing Programming
Wong, H. J.
Rendell, A. P.
2009 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING AND WORKSHOPS, 2009, : 197 - +
[8] General hardware multicasting for fine-grained message-passing architectures
Naylor, Matthew
Moore, Simon W.
Thomas, David
Beaumont, Jonathan R.
Fleming, Shane
Vousden, Mark
Markettos, A. Theodore
Bytheway, Thomas
Brown, Andrew
2021 29TH EUROMICRO INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED AND NETWORK-BASED PROCESSING (PDP 2021), 2021, : 126 - 133
[9] INTEGRATING MESSAGE-PASSING AND SHARED-MEMORY - EARLY EXPERIENCE
KRANZ, D
LIM, BH
JOHNSON, K
KUBIATOWICZ, J
AGARWAL, A
SIGPLAN NOTICES, 1993, 28 (01): : 84 - 84
[10] INTEGRATING MESSAGE-PASSING AND SHARED-MEMORY - EARLY EXPERIENCE
KRANZ, D
JOHNSON, K
AGARWAL, A
KUBIATOWICZ, J
LIM, BH
SIGPLAN NOTICES, 1993, 28 (07): : 54 - 63

← 1 2 3 4 5 →