Optimizing Irregular Shared-Memory Applications for Clusters

被引:0
作者
Min, Seung-Jai [1 ]
Eigenmann, Rudolf [1 ]
机构
[1] Purdue Univ, Sch Elect & Comp Engn, W Lafayette, IN 47907 USA
来源
ICS'08: PROCEEDINGS OF THE 2008 ACM INTERNATIONAL CONFERENCE ON SUPERCOMPUTING | 2008年
关键词
Compiler Analysis; Runtime Techniques; OpenMP; MPI; Irregular Data Accesses; Performance;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Irregular applications pose challenges in optimizing communication, due to the difficulty of analyzing irregular data accesses accurately and efficiently. This challenge is especially big when translating irregular shared-memory applications to message-passing form for clusters. The lack of effective irregular data analysis in the translation system results in unnecessary or redundant communication, which limits application scalability. In this paper, we present a Lean Distributed Shared Memory (LDSM) system, which features a fast and accurate irregular data access (IDA) analysis. The analysis uses a region-based diff method and makes use of a runtime library that is optimized for irregular applications. We describe three optimizations that improve the LDSM system performance. A parallel array reduction transformation reduces overheads in the analysis. A packed communication optimization and a differential communication optimization effectively eliminate unnecessary and redundant messages. We evaluate the performance of the optimized LDSM system on a set of representative irregular benchmarks. The optimized LDSM executes irregular applications on average 45% faster than the hand-tuned MPI applications.
引用
收藏
页码:256 / 265
页数:10
相关论文
共 29 条
  • [1] Agrawal G, 1995, SUPERCOMP PROC, P1270
  • [2] TreadMarks: Shared memory computing on networks of workstations
    Amza, C
    Cox, AL
    Dwarkadas, S
    Keleher, P
    Lu, HH
    Rajamony, R
    Yu, WM
    Zwaenepoel, W
    [J]. COMPUTER, 1996, 29 (02) : 18 - &
  • [3] [Anonymous], 2005, P 19 ANN INT C SUPER
  • [4] BALART J, 2006, 19 INT WORKSH LANG C, P330
  • [5] Basumallik A., 2006, Proceedings of the 2006 ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming PPoPP'06, P119, DOI 10.1145/1122971.1122990
  • [6] BRECHT T, 1999, ATEC 99, P7
  • [7] CHARMM - A PROGRAM FOR MACROMOLECULAR ENERGY, MINIMIZATION, AND DYNAMICS CALCULATIONS
    BROOKS, BR
    BRUCCOLERI, RE
    OLAFSON, BD
    STATES, DJ
    SWAMINATHAN, S
    KARPLUS, M
    [J]. JOURNAL OF COMPUTATIONAL CHEMISTRY, 1983, 4 (02) : 187 - 217
  • [8] CHEN WY, 2007, ICS 07, P158
  • [9] Das R, 1995, SUPERCOMP PROC, P1918
  • [10] El-Ghazawi T. A., 2003, UPC LANGUAGE SPECIFI