A DISTRIBUTED-TERMINATION EXPERIMENT ON A MESH-CONNECTED ARRAY OF PROCESSORS

被引:2
作者
SONG, JJ
机构
[1] Department of Electrical Engineering, National University of Singapore, Singapore
关键词
ASYNCHRONOUS ITERATION; DISTRIBUTED TERMINATION; FINITE ELEMENT COMPUTATION; HYPERCUBE COMPUTER; MESH;
D O I
10.1016/0167-8191(92)90044-8
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In totally asynchronous computation any number of processing elements (PEs) may start simultaneously. Termination detection for it is difficult because of asynchronousness and lack of single root (initiator) to accumulate termination information. This paper presents a symmetric, asynchronous, and distributed solution to the above problem for a mesh-connected array of PEs and its implementation and comparison on a parallel computer. Our algorithm produces less messages for termination purpose than the various counter token schemes. The worst time for it to finish is O(square-root n), where n is the number of PEs on a square mesh. An experiment on a 64-node NCUBE parallel computer showed that it was at least twice as fast as a centralized termination detection algorithm. The algorithm makes three assumptions: only nearest-neighbor communication exists; activation messages are always broadcast to all neighboring PEs to avoid deadlock; and any number of PEs can initiate the computation. It defines tokens and dependency arcs: tokens to collect termination states of regions of the mesh and dependency arcs to establish terminating precedence among connected PEs. They together reduce the number of messages for termination detection. Two sufficient conditions are shown to guarantee that any one of the PEs may detect termination. Although the algorithm was originated for use with totally asynchronous computations for the finite element analysis, it is applicable to any iterative computations on a mesh.
引用
收藏
页码:779 / 791
页数:13
相关论文
共 21 条