Task clustering and scheduling for distributed memory parallel architectures

被引:109
作者
Palis, MA [1 ]
Liou, JC [1 ]
Wei, DSL [1 ]
机构
[1] UNIV AIZU,SCH ENGN & COMP SCI,AIZU WAKAMATSU,FUKUSHIMA 965,JAPAN
基金
美国国家科学基金会;
关键词
program task graph; task granularity; task scheduling; distributed memory architectures; approximation algorithms;
D O I
10.1109/71.481597
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
This paper addresses the problem of scheduling parallel programs represented as directed acyclic task graphs for execution on distributed memory parallel architectures. Because of the high communication overhead in existing parallel machines, a crucial step in scheduling is task clustering, the process of coalescing fine grain tasks into single coarser ones so that the overall execution time is minimized. The task clustering problem is NP-hard, even when the number of processors is unbounded and task duplication is allowed. A simple greedy algorithm is presented for this problem which, for a task graph with arbitrary granularity, produces a schedule whose makespan is at most twice optimal. Indeed, the quality of the schedule improves as the granularity of the task graph becomes larger. For example, if the granularity is at least 1/2, the makespan of the schedule is at most 5/3 times optimal. For a task graph with n tasks and e inter-task communication constraints, the algorithm runs in O(n(n lg n + e)) time, which is n times faster than the currently best known algorithm for this problem. Similar algorithms are developed that produce: (1) optimal schedules for coarse grain graphs; (2) 2-optimal schedules for trees with no task duplication; and (3) optimal schedules for coarse grain trees with no task duplication.
引用
收藏
页码:46 / 55
页数:10
相关论文
共 19 条
[1]  
Aho A.V., 1974, The Design and Analysis of Computer Algorithms
[2]  
Cormen T. H., 1990, INTRO ALGORITHMS
[3]   ON THE GRANULARITY AND CLUSTERING OF DIRECTED ACYCLIC TASK GRAPHS [J].
GERASOULIS, A ;
YANG, T .
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 1993, 4 (06) :686-701
[4]   SCHEDULING PRECEDENCE GRAPHS IN SYSTEMS WITH INTERPROCESSOR COMMUNICATION TIMES [J].
HWANG, JJ ;
CHOW, YC ;
ANGER, FD ;
LEE, CY .
SIAM JOURNAL ON COMPUTING, 1989, 18 (02) :244-257
[5]  
Kim S., 1988, INT C PARALLEL PROCE, V3, P1
[6]   MULTIPROCESSOR SCHEDULING WITH INTERPROCESSOR COMMUNICATION DELAYS [J].
LEE, CY ;
HWANG, JJ ;
CHOW, YC ;
ANGER, FD .
OPERATIONS RESEARCH LETTERS, 1988, 7 (03) :141-147
[7]   TOWARDS AN ARCHITECTURE-INDEPENDENT ANALYSIS OF PARALLEL ALGORITHMS [J].
PAPADIMITRIOU, CH ;
YANNAKAKIS, M .
SIAM JOURNAL ON COMPUTING, 1990, 19 (02) :322-328
[8]  
Wu M.-Y., 1990, IEEE Transactions on Parallel and Distributed Systems, V1, P330, DOI 10.1109/71.80160
[9]  
[No title captured]
[10]  
[No title captured]