Gossip-based fault-tolerant load balancing algorithm with low communication overhead

被引:6
作者
Chatterjee, Moumita [1 ]
Mitra, Anirban [2 ]
Setua, Sanjit Kumar [1 ]
Roy, Sudipta [3 ]
机构
[1] Calcutta Univ Technol Campus, Dept Comp Sci & Engn, JD 2,Sect 3, Kolkata 700098, India
[2] Acad Technol, Dept Comp Sci & Engn, Adisaptagram 712121, W Bengal, India
[3] Washington Univ, PRT2L, St Louis, MO 63110 USA
关键词
Load balancing; Fault-tolerance; Gossip protocol; Expander graph; Stability threshold; Network optimization;
D O I
10.1016/j.compeleceng.2019.106517
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In large-scale distributed-computing environments, numerous factors may affect the predictable performance of load balancing policies causing variable load distribution in the network and bottlenecks. This paper presents a two-level load balancing algorithm for solving the dynamic load balancing problem considering the loss of processors and connectivity of the network. The proposed method privileges local load balancing over global load balancing thus reduces the communication cost over the global network. To further minimize communication costs and to handle loss of connectivity, the algorithm proposes a new communication model for global communication among the processors. The performance, correctness, and scalability of the algorithm are analyzed which show that our algorithm requires O(log m) rounds and O(m log m) messages for m clusters and achieves minimum cluster and global load deviation. Simulation results also support our analytical studies. (C) 2019 Elsevier Ltd. All rights reserved.
引用
收藏
页数:16
相关论文
共 14 条
[1]  
Chatterjee M, 2015, IEEE P 2015 3 INT C
[2]   Gossip based fault tolerant protocol in distributed transactional memory using quorum based replication system [J].
Chatterjee, Moumita ;
Mitra, Anirban ;
Roy, Sudipta ;
Setua, Sanjit Kumar .
CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2020, 23 (02) :1103-1124
[3]  
Dhakal S, 2006, IEEE P 20 INT PAR DI
[4]   Optimal dispersal of certificate chains [J].
Jung, Eunjin ;
Elmallah, Ehab S. ;
Gouda, Mohamed G. .
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2007, 18 (04) :474-484
[5]  
Doerr B, 2009, LECT NOTES COMPUTER, V5555
[6]  
Ghanem J, 2005, IFAC P, V38, P124
[7]  
Han K., 2007, P IEEE PAC RIM INT S
[8]   Hydrodynamic load balancing [J].
Hui, CC ;
Chanson, ST .
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 1999, 10 (11) :1118-1137
[9]  
Kasprzyk R., 2012, J TELECOMMUN INF TEC
[10]  
Patel B, 2017, INT J GRID DISTRIB, V10, P11, DOI 10.14257/ijgdc.2017.10.8.02