Leveraging Hierarchical Data Locality in Parallel Programming Models

被引:1
作者
Anbar, Ahmad [1 ]
Kayraklioglu, Engin [1 ]
Serres, Olivier [1 ]
El-Ghazawi, Tarek [1 ]
机构
[1] George Washington Univ, Sch Engn & Appl Sci, Dept Elect & Comp Engn, Washington, DC 20052 USA
来源
2014 IEEE INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS, 2014 IEEE 6TH INTL SYMP ON CYBERSPACE SAFETY AND SECURITY, 2014 IEEE 11TH INTL CONF ON EMBEDDED SOFTWARE AND SYST (HPCC,CSS,ICESS) | 2014年
关键词
Data locality; Many-cores; Hierarchical thread clustering;
D O I
10.1109/HPCC.2014.62
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We are proposing a novel framework that ameliorates locality-aware parallel programming models, by defining hierarchical data locality model extension. We also propose a hierarchical thread partitioning algorithm. This algorithm synthesizes hierarchical thread placement layouts that targets minimizing the program's overall communication costs. We demonstrated the effectiveness of our approach using NAS Parallel Benchmarks implemented in Unified Parallel C (UPC) language using a modified Berkeley UPC Compiler and runtime system. We demonstrated an up to 85% improvement in performance by applying the placement layout suggested by our algorithm.
引用
收藏
页码:363 / 366
页数:4
相关论文
共 9 条
[1]  
[Anonymous], 2014, BERKELEY UPC COMPILE
[2]  
[Anonymous], 1999, COMPLEXITY APPROXIMA, DOI DOI 10.1007/978-3-642-58412-1
[3]  
[Anonymous], 2011, P WORKSH EX PROGR CH
[4]  
[Anonymous], TECH REP
[5]  
Bailey D., 1995, TECH REP
[6]   Using the Translation Lookaside Buffer to Map Threads in Parallel Applications Based on Shared Memory [J].
Cruz, Eduardo H. M. ;
Diener, Matthias ;
Navaux, Philippe O. A. .
2012 IEEE 26TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS), 2012, :532-543
[7]  
El-Ghazawi Tarek., 2003, UPC DISTRIBUTED SHAR
[8]  
Jeannot E, 2010, LECT NOTES COMPUT SC, V6272, P199, DOI 10.1007/978-3-642-15291-7_20
[9]  
Su Hang., 2008, IEEE INT S WORLD WIR, P1