Compiler support for array distribution on NUMA shared memory multiprocessors

被引:2
|
作者
Abdelrahman, TS [1 ]
Wong, TN [1 ]
机构
[1] Univ Toronto, Dept Elect & Comp Engn, Toronto, ON M5S 1A4, Canada
来源
JOURNAL OF SUPERCOMPUTING | 1998年 / 12卷 / 04期
基金
加拿大自然科学与工程研究理事会;
关键词
data distribution; locality management; cache management; parallelizing compilers; NUMA multiprocessors;
D O I
10.1023/A:1008035807599
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Management of program data to improve data locality and reduce false sharing is critical for scaling performance on NUMA shared memory multiprocessors. We use HPF-like data decomposition directives to partition and place arrays in data-parallel applications on Hector, a shared-memory NUMA multiprocessor. We describe a compiler system for automating the partitioning and placement of arrays. The compiler exploits Hector's shared memory architecture to efficiently implement distributed arrays. Experimental results from a prototype implementation demonstrate the effectiveness of these techniques. They also demonstrate the magnitude of the performance improvement attainable when our compiler-based data management schemes are used instead of operating system data management policies; performance improves by up to a factor of 5.
引用
收藏
页码:349 / 371
页数:23
相关论文
共 50 条
  • [11] Parallel Data Distribution Management on Shared-memory Multiprocessors
    Marzolla, Moreno
    D'angelo, Gabriele
    ACM TRANSACTIONS ON MODELING AND COMPUTER SIMULATION, 2020, 30 (01):
  • [12] EXPERIMENTAL COMPARISON OF MEMORY MANAGEMENT POLICIES FOR NUMA MULTIPROCESSORS
    LAROWE, RP
    ELLIS, CS
    ACM TRANSACTIONS ON COMPUTER SYSTEMS, 1991, 9 (04): : 319 - 363
  • [13] Applications for shared memory multiprocessors
    Stenstrom, P
    Dahlgren, F
    COMPUTER, 1996, 29 (12) : 29 - 31
  • [14] Impact of memory contention on dynamic scheduling on NUMA multiprocessors
    Durand, D
    Montaut, T
    Kervella, L
    Jalby, W
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 1996, 7 (11) : 1201 - 1214
  • [15] Architectural support for parallel reductions in scalable shared-memory multiprocessors
    Garzarán, MJ
    Prvulovic, M
    Zhang, Y
    Jula, A
    Yu, H
    Rauchwerger, L
    Torrellas, J
    2001 INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURES AND COMPILATION TECHNIQUES, PROCEEDINGS, 2001, : 243 - 254
  • [16] Architectural support for scalable speculative parallelization in shared-memory multiprocessors
    Cintra, M
    Martínez, JF
    Torrellas, J
    PROCEEDING OF THE 27TH INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE, 2000, : 13 - 24
  • [17] Support for fine-grained synchronization in shared-memory multiprocessors
    Vlassov, Vladimir
    Merino, Oscar Sierra
    Moritz, Csaba Andras
    Popov, Konstantin
    PARALLEL COMPUTING TECHNOLOGIES, PROCEEDINGS, 2007, 4671 : 453 - 467
  • [18] Compiler Support for Selective Page Migration in NUMA Architectures
    Piccoli, Guilherme
    Santos, Henrique N.
    Rodrigues, Raphael E.
    Pousa, Christiane
    Borin, Edson
    Magno, Fernando
    Pereira, Quintao
    PROCEEDINGS OF THE 23RD INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURES AND COMPILATION TECHNIQUES (PACT'14), 2014, : 369 - 380
  • [19] Memory Affinity for Hierarchical Shared Memory Multiprocessors
    Ribeiro, Christiane Pousa
    Mehaut, Jean-Francois
    Carissimi, Alexandre
    Castro, Marcio
    Fernandes, Luiz Gustavo
    PROCEEDINGS OF THE 21ST INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE AND HIGH PERFORMANCE COMPUTING, 2009, : 59 - +
  • [20] Compiler and runtime support for shared memory parallelization of data mining algorithms
    Li, XG
    Jin, RM
    Agrawal, G
    LANGUAGES AND COMPILERS FOR PARALLEL COMPUTING, 2005, 2481 : 265 - 279