Integrating fine-grained message passing in cache coherent shared memory multiprocessors

被引:2
|
作者
Poulsen, DK [1 ]
Yew, PC [1 ]
机构
[1] UNIV MINNESOTA,DEPT COMP SCI,MINNEAPOLIS,MN 55455
基金
美国国家科学基金会;
关键词
D O I
10.1006/jpdc.1996.0036
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
This paper considers the use of data prefetching and an alternative mechanism, data forwarding, for reducing memory latency caused by interprocessor communication in cache coherent, shared memory multiprocessors. Data prefetching is accomplished by using a multiprocessor software pipelined algorithm. Data forwarding is used to target interprocessor data communication, rather than synchronization, and is applied to communication-related accesses between successive parallel loops. Prefetching and forwarding are each shown to be more effective for certain types of architectural and application characteristics. Given this result, a new hybrid prefetching and forwarding approach is proposed and evaluated that allows the relative amounts of prefetching and forwarding used to be adapted to these characteristics. When compared to prefetching or forwarding alone, the new hybrid scheme is shown to increase performance stability over varying application characteristics, to reduce processor instruction overheads, cache miss ratios, and memory system bandwidth requirements, and to reduce performance sensitivity to architectural parameters such as cache size. Algorithms for data prefetching, data forwarding, and hybrid prefetching and forwarding are described. These algorithms are applied by using a parallelizing compiler and are evaluated via execution-driven simulations of large, optimized, numerical application codes with loop-level and vector parallelism. (C) 1996 Academic Press, Inc.
引用
收藏
页码:172 / 188
页数:17
相关论文
共 50 条
  • [1] Support for fine-grained synchronization in shared-memory multiprocessors
    Vlassov, Vladimir
    Merino, Oscar Sierra
    Moritz, Csaba Andras
    Popov, Konstantin
    PARALLEL COMPUTING TECHNOLOGIES, PROCEEDINGS, 2007, 4671 : 453 - 467
  • [3] Efficient message passing on UNIX shared memory multiprocessors
    Bernaschi, M
    FUTURE GENERATION COMPUTER SYSTEMS, 1998, 13 (06) : 443 - 449
  • [4] A Fine-Grained Message Passing MOEA/D
    Derbel, Bilel
    Liefooghe, Arnaud
    Marquet, Gauvain
    Talbi, El-Ghazali
    2015 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2015, : 1837 - 1844
  • [5] Termination detection for fine-grained message-passing architectures
    Naylor, Matthew
    Moore, Simon W.
    Mokhov, Andrey
    Thomas, David
    Beaumont, Jonathan R.
    Fleming, Shane
    Markettos, A. Theodore
    Bytheway, Thomas
    Brown, Andrew
    2020 IEEE 31ST INTERNATIONAL CONFERENCE ON APPLICATION-SPECIFIC SYSTEMS, ARCHITECTURES AND PROCESSORS (ASAP 2020), 2020, : 17 - 24
  • [6] Fine-grained task reweighting on multiprocessors
    Block, A
    Anderson, JH
    Bishop, G
    11TH IEEE INTERNATIONAL CONFERENCE ON EMBEDDED AND REAL-TIME COMPUTING SYSTEMS AND APPLICATIONS, PROCEEDINGS, 2005, : 429 - 435
  • [7] Integrating Software Distributed Shared Memory and Message Passing Programming
    Wong, H. J.
    Rendell, A. P.
    2009 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING AND WORKSHOPS, 2009, : 197 - +
  • [8] General hardware multicasting for fine-grained message-passing architectures
    Naylor, Matthew
    Moore, Simon W.
    Thomas, David
    Beaumont, Jonathan R.
    Fleming, Shane
    Vousden, Mark
    Markettos, A. Theodore
    Bytheway, Thomas
    Brown, Andrew
    2021 29TH EUROMICRO INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED AND NETWORK-BASED PROCESSING (PDP 2021), 2021, : 126 - 133
  • [9] INTEGRATING MESSAGE-PASSING AND SHARED-MEMORY - EARLY EXPERIENCE
    KRANZ, D
    LIM, BH
    JOHNSON, K
    KUBIATOWICZ, J
    AGARWAL, A
    SIGPLAN NOTICES, 1993, 28 (01): : 84 - 84
  • [10] INTEGRATING MESSAGE-PASSING AND SHARED-MEMORY - EARLY EXPERIENCE
    KRANZ, D
    JOHNSON, K
    AGARWAL, A
    KUBIATOWICZ, J
    LIM, BH
    SIGPLAN NOTICES, 1993, 28 (07): : 54 - 63