A Survey of Communication Performance Models for High-Performance Computing

被引:23
作者
Rico-Gallego, Juan A. [1 ]
Diaz-Martin, Juan C. [1 ]
Manumachu, Ravi Reddy [2 ]
Lastovetsky, Alexey L. [2 ]
机构
[1] Univ Extremadura, Escuela Politecn, Avd Univ S-N, Caceres 10003, Spain
[2] Univ Coll Dublin, Dublin 4, Ireland
基金
爱尔兰科学基金会;
关键词
Communication performance models; analytic modeling; communication performance; high-performance computing; EFFICIENT COLLECTIVE COMMUNICATION; PARALLEL COMPUTATIONAL MODEL; HETEROGENEOUS NETWORKS; MATRIX MULTIPLICATION; ACCURATE; ALGORITHMS; LOGP; OPTIMIZATION; PREDICTION; PARAMETERS;
D O I
10.1145/3284358
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
This survey aims to present the state of the art in analytic communication performance models, providing sufficiently detailed descriptions of particularly noteworthy efforts. Modeling the cost of communications in computer clusters is an important and challenging problem. It provides insights into the design of the communication pattern of parallel scientific applications and mathematical kernels and sets a clear ground for optimization of their deployment in the increasingly complex high-performance computing infrastructure. The survey provides background information on how different performance models represent the underlying platform and shows the evolution of these models over time from early clusters of single-core processors to present-day multi-core and heterogeneous platforms. Prospective directions for future research in the area of analytic communication performance modeling conclude the survey.
引用
收藏
页码:1 / 36
页数:36
相关论文
共 99 条
  • [91] Optimization of collective communication operations in MPICH
    Thakur, R
    Rabenseifner, R
    Gropp, W
    [J]. INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS, 2005, 19 (01) : 49 - 66
  • [92] Top500, 2018, TOP500 LIST
  • [93] Träff JL, 2005, LECT NOTES COMPUT SC, V3666, P48
  • [94] Performance analysis and optimization of MPI collective operations on multi-core clusters
    Tu, Bibo
    Fan, Jianping
    Zhan, Jianfeng
    Zhao, Xiaofang
    [J]. JOURNAL OF SUPERCOMPUTING, 2012, 60 (01) : 141 - 162
  • [95] Protocol-dependent message-passing performance on Linux clusters
    Turner, D
    Chen, XH
    [J]. 2002 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING, PROCEEDINGS, 2002, : 187 - 194
  • [96] A BRIDGING MODEL FOR PARALLEL COMPUTATION
    VALIANT, LG
    [J]. COMMUNICATIONS OF THE ACM, 1990, 33 (08) : 103 - 111
  • [97] VanDeGeijn RA, 1997, CONCURRENCY-PRACT EX, V9, P255, DOI 10.1002/(SICI)1096-9128(199704)9:4<255::AID-CPE250>3.0.CO
  • [98] 2-2
  • [99] Zhu J, 2014, LECT NOTES COMPUT SC, V8374, P259, DOI 10.1007/978-3-642-54420-0_26