GROUP-TO-GROUP COMMUNICATIONS FOR FAULT-TOLERANCE IN DISTRIBUTED SYSTEMS

被引:0
作者
HIGAKI, H
SONEOKA, T
机构
关键词
FAULT-TOLERANCE; DISTRIBUTED SYSTEMS; PROCESS GROUP; GROUP COMMUNICATIONS;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper proposes a group-to-group communications algorithm that can extend the range of distributed systems where we can achieve active replication fault-tolerance to partner model distributed systems, in which all processes communicate with each other on an equal footing. Active replication approach, in which all replicated processes are active, can achieve fault-tolerance with low overhead because checkpoint setting and rollback are not required for recovery from process failure. This algorithm guarantees that each replicated process in a process group has the same execution history and that communications between process groups keeps consistency even in the presence of process failure and message loss. The number of control messages that must be transmitted between processes for a communication between process groups is only a linear order of the number of replicated processes in each process group. Furthermore, this algorithm reduces the overhead for reconfiguration of a process group by keeping process failure and recovery information local to each process group.
引用
收藏
页码:1348 / 1357
页数:10
相关论文
empty
未找到相关数据