Compiler support for array distribution on NUMA shared memory multiprocessors

被引:2
|
作者
Abdelrahman, TS [1 ]
Wong, TN [1 ]
机构
[1] Univ Toronto, Dept Elect & Comp Engn, Toronto, ON M5S 1A4, Canada
来源
JOURNAL OF SUPERCOMPUTING | 1998年 / 12卷 / 04期
基金
加拿大自然科学与工程研究理事会;
关键词
data distribution; locality management; cache management; parallelizing compilers; NUMA multiprocessors;
D O I
10.1023/A:1008035807599
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Management of program data to improve data locality and reduce false sharing is critical for scaling performance on NUMA shared memory multiprocessors. We use HPF-like data decomposition directives to partition and place arrays in data-parallel applications on Hector, a shared-memory NUMA multiprocessor. We describe a compiler system for automating the partitioning and placement of arrays. The compiler exploits Hector's shared memory architecture to efficiently implement distributed arrays. Experimental results from a prototype implementation demonstrate the effectiveness of these techniques. They also demonstrate the magnitude of the performance improvement attainable when our compiler-based data management schemes are used instead of operating system data management policies; performance improves by up to a factor of 5.
引用
收藏
页码:349 / 371
页数:23
相关论文
共 50 条