Brief Announcement: Robust Network Supercomputing Without Centralized Control

被引：0

作者：

Davtyan, Seda ^{[1
]}

Konwar, Kishori M.

Shvartsman, Alexander A. ^{[1
]}

机构：

[1] Univ Connecticut, Dept Comp Sci & Engn, Storrs, CT 06269 USA

来源：

PODC 11: PROCEEDINGS OF THE 2011 ACM SYMPOSIUM PRINCIPLES OF DISTRIBUTED COMPUTING | 2011年

关键词：

Distributed Algorithms; Fault-Tolerance; Randomized Algorithms; Internet Supercomputing;

D O I：

暂无

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

Traditional approaches to network supercomputing employ a master process and a large number of potentially undependable worker processes that must perform a collection of tasks on behalf of the master. In such a centralized scheme, the master process is a performance bottleneck and a single point of failure. This work develops an original approach that eliminates the master and instead uses a decentralized algorithm, where each worker is able to determine locally that all tasks have been performed, and to collect locally the results of all tasks. The failure model assumes that the average probability of a worker returning a wrong result is inferior to 1/2. A randomized synchronous algorithm for a processes and n tasks is presented. The algorithm terminates in circle minus(log n) rounds, and it is proved that upon termination the workers know the results of all tasks with high probability, and that these results are correct with high probability. The message complexity of the algorithm is circle minus(n log n), and the bit complexity is O(n(2) log(3) n).

引用

页码：293 / 294

页数：2