On the Effects of CPU Caches on MPI Point-to-Point Communications

被引:2
作者
Pellegrini, Simone [1 ]
Hoefler, Torsten [2 ]
Fahringer, Thomas [1 ]
机构
[1] Univ Innsbruck, A-6020 Innsbruck, Austria
[2] Univ Illinois, Urbana, IL 61801 USA
来源
2012 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER) | 2012年
关键词
MPI; CPU Cache; Code Transformations;
D O I
10.1109/CLUSTER.2012.22
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Several researchers investigated the placing of communication calls in message-passing parallel codes. The current rule of thumb it to maximize communication/computation overlap with early binding. In this work, we demonstrate that this is not the only design constraint because CPU caches can have a significant impact on communications. We conduct an empirical study of the interaction between CPU caching and communications for several different communication scenarios. We use the gained insight to formulate a set of intuitive rules for communication call placement and show how our rules can be applied to practical codes. Our optimized codes show an improvement of up to 40% for a simple stencil code. Our work is a first step towards communication optimizations by moving communication calls. We expect that future communication-aware compilers will use our insights as a standard technique to move communication calls in order to optimize performance.
引用
收藏
页码:495 / 503
页数:9
相关论文
共 18 条
  • [1] [Anonymous], 2005, LBNL59208
  • [2] Buntinas D, 2006, SIXTH IEEE INTERNATIONAL SYMPOSIUM ON CLUSTER COMPUTING AND THE GRID, P521
  • [3] Cameron K. W., 2003, IPDPS 03
  • [4] Chaarawi M, 2008, LECT NOTES COMPUT SC, V5205, P210, DOI 10.1007/978-3-540-87475-1_30
  • [5] Chen WY, 2005, PACT 2005: 14TH INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURES AND COMPILATION TECHNIQUES, P267
  • [6] Danalis A., 2005, SC 05
  • [7] Buffer-safe communication optimization based on data flow analysis and performance prediction
    Fahringer, T
    Mehofer, E
    [J]. 1997 INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURES AND COMPILATION TECHNIQUES, PROCEEDINGS, 1997, : 189 - 200
  • [8] Gropp W., 1999, Recent Advances in Parallel Virtual Machine and Message Passing Interface. 6th European PVM/MPI User's Group Meeting. Proceedings (Lecture Notes in Computer Science Vol.1697), P11
  • [9] Grove D., 2001, P HPC AS
  • [10] Comparison of MPI Benchmark Programs on Shared Memory and Distributed Memory Machines (Point-to-Point Communication)
    Hamid, Nor Asilah Wati Abdul
    Coddington, Paul
    [J]. INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS, 2010, 24 (04) : 469 - 483