Load-Optimal Local Fast Rerouting for Dense Networks

被引:8
作者
Borokhovich, Michael [1 ]
Pignolet, Yvonne-Anne [2 ]
Schmid, Stefan [3 ]
Tredan, Gilles [4 ]
机构
[1] AT&T Labs, Florham Pk, NJ 07932 USA
[2] ABB Corp Res, CH-8037 Baden, Switzerland
[3] Univ Vienna, Fac Comp Sci, A-1010 Vienna, Austria
[4] CNRS, Lab Architecture & Anal Syst, F-1000 Toulouse, France
关键词
Computer network reliability; distributed algorithms; FAILURES;
D O I
10.1109/TNET.2018.2871089
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Reliable and highly available computer networks must implement resilient fast rerouting mechanisms: upon a link or node failure, an alternative route is determined quickly, without involving the network control plane. Designing such fast failover mechanisms capable of dealing with multiple concurrent failures, however, is challenging, as failover rules need to be installed proactively, i.e., ahead of time, without knowledge of the actual failures happening at runtime. Indeed, only little is known today about the design of resilient routing algorithms. This paper introduces a general framework to reason about and design local failover algorithms that minimize the resulting load after failover on dense networks, beyond destination-based routing. We show that due to the inherent locality of the failover decisions at runtime, the problem is fundamentally related to the field of distributed algorithms without coordination. We derive an intriguing lower bound on the inherent network load overhead any local fast failover scheme that will introduce in the worst case, even though globally seen, much more balanced traffic allocations exist. We then present different randomized and deterministic failover algorithms and analyze their overhead load. In particular, we build upon the theory of combinatorial designs and develop a novel deterministic failover mechanism based on symmetric block design theory, which tolerates a maximal number of link failures while ensuring low loads.
引用
收藏
页码:2583 / 2597
页数:15
相关论文
共 37 条
[1]   A scalable, commodity data center network architecture [J].
Al-Fares, Mohammad ;
Loukissas, Alexander ;
Vahdat, Amin .
ACM SIGCOMM COMPUTER COMMUNICATION REVIEW, 2008, 38 (04) :63-74
[2]  
Atlas, 2008, 5286 IETF RFC
[3]  
Borokhovich Michael, 2013, Principles of Distributed Systems. 17th International Conference, OPODIS 2013. Proceedings: LNCS 8304, P68, DOI 10.1007/978-3-319-03850-6_6
[4]  
Borokhovich Michael., 2014, P 3 WORKSHOP HOT TOP, P121
[5]  
Canini Marco, 2015, 2015 IEEE Conference on Computer Communications (INFOCOM). Proceedings, P190, DOI 10.1109/INFOCOM.2015.7218382
[6]  
Chiesa M., 2016, ICALP
[7]  
Chiesa M, 2016, IEEE INFOCOM SER
[8]  
Dolev S., 1999, SIROCCO 6. Proceedings of the 6th International Colloquium on Structural Information and Communication Complexity, P111
[9]  
Elhourani T, 2014, IEEE INFOCOM SER, P2148, DOI 10.1109/INFOCOM.2014.6848157
[10]  
Enyedi G., 2007, P M EUR NETW U CO IN