Optimizing Irregular Shared-Memory Applications for Clusters

被引:0
作者
Min, Seung-Jai [1 ]
Eigenmann, Rudolf [1 ]
机构
[1] Purdue Univ, Sch Elect & Comp Engn, W Lafayette, IN 47907 USA
来源
ICS'08: PROCEEDINGS OF THE 2008 ACM INTERNATIONAL CONFERENCE ON SUPERCOMPUTING | 2008年
关键词
Compiler Analysis; Runtime Techniques; OpenMP; MPI; Irregular Data Accesses; Performance;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Irregular applications pose challenges in optimizing communication, due to the difficulty of analyzing irregular data accesses accurately and efficiently. This challenge is especially big when translating irregular shared-memory applications to message-passing form for clusters. The lack of effective irregular data analysis in the translation system results in unnecessary or redundant communication, which limits application scalability. In this paper, we present a Lean Distributed Shared Memory (LDSM) system, which features a fast and accurate irregular data access (IDA) analysis. The analysis uses a region-based diff method and makes use of a runtime library that is optimized for irregular applications. We describe three optimizations that improve the LDSM system performance. A parallel array reduction transformation reduces overheads in the analysis. A packed communication optimization and a differential communication optimization effectively eliminate unnecessary and redundant messages. We evaluate the performance of the optimized LDSM system on a set of representative irregular benchmarks. The optimized LDSM executes irregular applications on average 45% faster than the hand-tuned MPI applications.
引用
收藏
页码:256 / 265
页数:10
相关论文
共 29 条
  • [11] Hilfinger P. N., 2001, TITANIUM LANGUAGE RE
  • [12] HWANG Y, 1994, RUNTIME SUPPORT PARA
  • [13] Ke J., 2004, SC 04, P59
  • [14] COMPILING GLOBAL NAME-SPACE PARALLEL LOOPS FOR DISTRIBUTED EXECUTION
    KOELBEL, C
    MEHROTRA, P
    [J]. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 1991, 2 (04) : 440 - 451
  • [15] LU H, 1997, P 6 ACM SIGPLAN S PR, P48
  • [16] *M P I FOR, 1994, UTCS94230 M P I FOR
  • [17] Optimizing OpenMP programs on software distributed shared memory systems
    Min, SJ
    Basumallik, A
    Eigenmann, R
    [J]. INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2003, 31 (03) : 225 - 249
  • [18] MIN SJ, 2004, LCR 04, P1
  • [19] MIRCHANDANEY R, 1988, ICS 88, P140
  • [20] MOON B, 1995, CSTR3428