Improved schemes for mapping arbitrary algorithms onto processor meshes

被引:3
作者
Robic, B [1 ]
Vilfan, B [1 ]
机构
[1] UNIV LJUBLJANA,FAC COMP & INFORMAT SCI,LJUBLJANA,SLOVENIA
关键词
distributed memory message passing multiprocessor; regular topology; mapping; embedding; performance optimization; combinatorial optimization; heuristic algorithm;
D O I
10.1016/0167-8191(96)00019-1
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
We address the problem of efficient schemes for mapping arbitrary parallel algorithms onto distributed memory message passing multiprocessors with mesh topologies. We analyze a particular mapping scheme, find the reasons for its low efficiency, and show that mapped algorithms tend to be both wider and higher than necessary. As a result, they generally execute too slow while at the same time occupying an excessive number of processors. Two approaches to the improvement of the scheme are presented, one direct, and the other indirect. In the direct approach, we describe four nontrivial improvements of the scheme but also prove their NP-completeness. In contrast, in the indirect approach the original scheme is followed by a refinement procedure that incrementally improves the mapped algorithms. We describe four different heuristic refinement procedures. Experimental results show that the indirect approach offers a 51% saving in processor resources and, at the same time, a 36% saving in execution time, on average.
引用
收藏
页码:701 / 724
页数:24
相关论文
共 15 条
[1]  
[Anonymous], 1979, Computers and Intractablity: A Guide to the Theoryof NP-Completeness
[2]  
BOKHARI SH, 1981, IEEE T COMPUT, V30, P207, DOI 10.1109/TC.1981.1675756
[3]   EFFICIENT ALGORITHMS FOR MAPPING AND PARTITIONING A CLASS OF PARALLEL COMPUTATIONS [J].
CHOI, HA ;
NARAHARI, B .
JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 1993, 19 (04) :349-363
[4]  
FUKUNAGA K, 1987, IEEE T COMPUT, V36, P888, DOI 10.1109/TC.1987.1676984
[5]  
KOREN I, 1988, IEEE COMPUT, V21, P30
[6]  
KOREN I, 1983, 1983 P INT C PAR PRO, P335
[7]  
KRAEMER O, 1989, PARALLEL COMPUT, V9, P213
[8]  
LEE C, 1989, P 3 ANN PAR P S MARC, P671
[9]  
PATNAIK LM, 1986, IEEE T COMPUT, V35, P229, DOI 10.1109/TC.1986.1676747
[10]  
ROBIC B, 1995, COMPUT ARTIF INTELL, V14, P339