PHLAME: Hierarchical Locality Exploitation using the PGAS Model

被引:2
作者
Anbar, Ahmad [1 ]
Serres, Olivier [1 ]
Kayraklioglu, Engin [1 ]
Badawy, Abdel-Hameed A. [1 ,2 ]
El-Ghazawi, Tarek [1 ]
机构
[1] George Washington Univ, Elect & Comp Engn Dept, Washington, DC 20052 USA
[2] Arkansas Tech Univ, Dept Elect Engn, Russellville, AR USA
来源
2015 9TH INTERNATIONAL CONFERENCE ON PARTITIONED GLOBAL ADDRESS SPACE PROGRAMMING MODELS (PGAS) | 2015年
关键词
D O I
10.1109/PGAS.2015.16
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Parallel computers are becoming deeply hierarchical. Locality-aware programming models allow programmers to control locality at one level through establishing affinity between data and executing activities. This, however, does not enable locality exploitation at other levels. Therefore, we must conceive an efficient abstraction of hierarchical locality and develop techniques to exploit it. Techniques applied directly by programmers, beyond the first level, burden the programmer and hinder productivity. In this work, we propose the Parallel Hierarchical Locality Abstraction Model for Execution (PHLAME). PHLAME is an execution model to abstract and exploit machine hierarchical properties through locality-aware programming and a runtime system that takes into account machine characteristics, data sharing and communication profile of the underlying application. This paper presents and experiments with concepts and techniques that can drive such runtime system in support of PHLAME. Our experiments show that our techniques scale to 1024 cores and achieve performance gains of up to 88%.
引用
收藏
页码:82 / 89
页数:8
相关论文
共 19 条
[1]   Leveraging Hierarchical Data Locality in Parallel Programming Models [J].
Anbar, Ahmad ;
Kayraklioglu, Engin ;
Serres, Olivier ;
El-Ghazawi, Tarek .
2014 IEEE INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS, 2014 IEEE 6TH INTL SYMP ON CYBERSPACE SAFETY AND SECURITY, 2014 IEEE 11TH INTL CONF ON EMBEDDED SOFTWARE AND SYST (HPCC,CSS,ICESS), 2014, :363-366
[2]  
Anbar Ahmad, 2014, P 20 IEEE INT C PAR
[3]  
Baily D., 1994, RNR94007 NAS
[4]  
Bonachea D., 2002, TECHNICAL REPORT
[5]   hwloc: a Generic Framework for Managing Hardware Affinities in HPC Applications [J].
Broquedis, Francois ;
Clet-Ortega, Jerome ;
Moreaud, Stephanie ;
Furmento, Nathalie ;
Goglin, Brice ;
Mercier, Guillaume ;
Thibault, Samuel ;
Namyst, Raymond .
PROCEEDINGS OF THE 18TH EUROMICRO CONFERENCE ON PARALLEL, DISTRIBUTED AND NETWORK-BASED PROCESSING, 2010, :180-186
[6]   Parallel programmability and the Chapel language [J].
Chamberlain, B. L. ;
Callahan, D. ;
Zima, H. P. .
INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS, 2007, 21 (03) :291-312
[7]   Using the Translation Lookaside Buffer to Map Threads in Parallel Applications Based on Shared Memory [J].
Cruz, Eduardo H. M. ;
Diener, Matthias ;
Navaux, Philippe O. A. .
2012 IEEE 26TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS), 2012, :532-543
[8]   OpenMP: An industry standard API for shared-memory programming [J].
Dagum, L ;
Menon, R .
IEEE COMPUTATIONAL SCIENCE & ENGINEERING, 1998, 5 (01) :46-55
[9]  
Devine K, 2002, COMPUT SCI ENG, V4, P90, DOI 10.1109/5992.988653
[10]  
El-Ghazawi T., 2002, Proceedings of the 2002 ACM/IEEE conference on Supercomputing, Supercomputing '02, P1