Process Placement in Multicore Clusters: Algorithmic Issues and Practical Techniques

被引:79
作者
Jeannot, Emmanuel [1 ]
Mercier, Guillaume [1 ]
Tessier, Francois [1 ]
机构
[1] INRIA Bordeaux Sud Ouest, F-33405 Talence, France
关键词
Parallel programming; high performance computing; multicore processing; MPI;
D O I
10.1109/TPDS.2013.104
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Current generations of NUMA node clusters feature multicore or manycore processors. Programming such architectures efficiently is a challenge because numerous hardware characteristics have to be taken into account, especially the memory hierarchy. One appealing idea to improve the performance of parallel applications is to decrease their communication costs by matching the communication pattern to the underlying hardware architecture. In this paper, we detail the algorithm and techniques proposed to achieve such a result: first, we gather both the communication pattern information and the hardware details. Then we compute a relevant reordering of the various process ranks of the application. Finally, those new ranks are used to reduce the communication costs of the application.
引用
收藏
页码:993 / 1002
页数:10
相关论文
共 42 条
  • [1] [Anonymous], 1995, Technical Report
  • [2] [Anonymous], 2011, ICS 11, DOI [10.1145/1995896.1995909, DOI 10.1145/1995896.1995909]
  • [3] [Anonymous], 1994, MPI MESS PASS INT ST
  • [4] [Anonymous], 2006, P 20 ANN INT C SUPER, DOI [10.1145/1183401.1183451, DOI 10.1145/1183401.1183451]
  • [5] [Anonymous], 2005, LBNL59208
  • [6] [Anonymous], SAND942692 SAND NAT
  • [7] Argonne National Laboratory, 2013, MPICH2
  • [8] BAILEY DH, 1994, 94006 RNR
  • [9] Mapping communication layouts to network hardware characteristics on massive-scale blue gene systems
    Balaji, Pavan
    Gupta, Rinku
    Vishnu, Abhinav
    Beckman, Pete
    [J]. COMPUTER SCIENCE-RESEARCH AND DEVELOPMENT, 2011, 26 (3-4): : 247 - 256
  • [10] Rank reordering for MPI communication optimization
    Brandfass, B.
    Alrutz, T.
    Gerhold, T.
    [J]. COMPUTERS & FLUIDS, 2013, 80 : 372 - 380