Adaptive and Dynamic Design for MPI Tag Matching

被引:33
作者
Bayatpour, M. [1 ]
Subramoni, H. [1 ]
Chakraborty, S. [1 ]
Panda, D. K. [1 ]
机构
[1] Ohio State Univ, Dept Comp Sci & Engn, Columbus, OH 43210 USA
来源
2016 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER) | 2016年
关键词
D O I
10.1109/CLUSTER.2016.69
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The Message Passing Interface (MPI) standard specifies the use of (source, tag, communicator) tuple to identify whether an incoming message is what the receiver process is expecting. The cost associated with this process, commonly known as "tag matching", is tightly coupled with the communication pattern of the application and the load it generates at each individual process. Although researchers have proposed several schemes to mitigate the cost of tag matching, they have all been static and do not adapt dynamically to the communication load at individual processes and can lead to degradation in tag matching times. Such static designs can also lead to unnecessary memory overheads for queue management at processes whose communication loads do not justify enhanced tag matching schemes. In this paper, we take up this challenge and propose a tag matching design which dynamically adapts to the communication load at each individual process at runtime. Our experimental evaluation shows that the proposed adaptive and dynamic tag matching scheme is able to deliver the best performance when compared with multiple state-of-the-art tag matching schemes while limiting the memory consumed to the absolute minimum necessary to deliver the desired performance benefits. For instance, with HPCG, the adaptive scheme delivers 20%, 32% and up to 2X improvements in tag matching performance when compared to the default, bin-based and rank-based schemes, respectively. To the best of our knowledge this is the first tag matching design that is capable of dynamically adapting to the communication requirements of end applications.
引用
收藏
页码:1 / 10
页数:10
相关论文
共 15 条
[1]  
[Anonymous], MPI MESS PASS INT ST
[2]  
[Anonymous], LLNLTR490254
[3]   HPCx: towards capability computing [J].
Ashworth, M ;
Bush, IJ ;
Guest, MF ;
Sunderland, AG ;
Booth, S ;
Hein, J ;
Smith, L ;
Stratford, K ;
Curioni, A .
CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2005, 17 (10) :1329-1361
[4]  
BAILEY DH, 1994, 94006 RNR
[5]   MYRINET - A GIGABIT-PER-SECOND LOCAL-AREA-NETWORK [J].
BODEN, NJ ;
COHEN, D ;
FELDERMAN, RE ;
KULAWIK, AE ;
SEITZ, CL ;
SEIZOVIC, JN ;
SU, WK .
IEEE MICRO, 1995, 15 (01) :29-36
[6]  
Brightwell R, 2002, LECT NOTES COMPUT SC, V2474, P331
[7]   The BXI Interconnect architecture [J].
Derradji, Said ;
Palfer-Sollier, Thibaut ;
Panziera, Jean-Pierre ;
Poudes, Axel ;
Wellenreiter, Francois .
PROCEEDINGS 2015 IEEE 23RD ANNUAL SYMPOSIUM ON HIGH-PERFORMANCE INTERCONNECTS - HOTI 2015, 2015, :18-25
[8]  
Flajslik M., 2016, INT SUP COM IN PRESS
[9]   Mitigating MPI Message Matching Misery [J].
Flajslik, Mario ;
Dinan, James ;
Underwood, Keith D. .
HIGH PERFORMANCE COMPUTING, 2016, 9697 :281-299
[10]  
Gavrilovska A, 2010, ATTAINING HIGH PERFORMANCE COMMUNICATIONS: A VERTICAL APPROACH, P1