Reliability and Survivability Analysis of Data Center Network Topologies

被引:32
|
作者
Couto, Rodrigo de Souza [1 ,2 ]
Secci, Stefano [3 ]
Mitre Campista, Miguel Elias [1 ]
Maciel Kosmalski Costa, Luis Henrique [1 ]
机构
[1] Univ Fed Rio de Janeiro, POLI DEL, COPPE PEE GTA, POB 68504, BR-21941972 Rio De Janeiro, RJ, Brazil
[2] Univ Estado Rio de Janeiro, FEN DETEL PEL, BR-20550013 Rio De Janeiro, RJ, Brazil
[3] Univ Paris 06, Sorbonne Univ, UMR 7606, LIP6, F-75005 Paris, France
关键词
Data center networks; Cloud networks; Survivability; Reliability; Robustness; AVAILABILITY; FRAMEWORK; COST;
D O I
10.1007/s10922-015-9354-8
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The architecture of several data centers have been proposed as alternatives to the conventional three-layer one. Most of them employ commodity equipment for cost reduction. Thus, robustness to failures becomes even more important, because commodity equipment is more failure-prone. Each architecture has a different network topology design with a specific level of redundancy. In this work, we aim at analyzing the benefits of different data center topologies taking the reliability and survivability requirements into account. We consider the topologies of three alternative data center architecture: Fat-tree, BCube, and DCell. Also, we compare these topologies with a conventional three-layer data center topology. Our analysis is independent of specific equipment, traffic patterns, or network protocols, for the sake of generality. We derive closed-form formulas for the Mean Time To Failure of each topology. The results allow us to indicate the best topology for each failure scenario. In particular, we conclude that BCube is more robust to link failures than the other topologies, whereas DCell has the most robust topology when considering switch failures. Additionally, we show that all considered alternative topologies outperform a three-layer topology for both types of failures. We also determine to which extent the robustness of BCube and DCell is influenced by the number of network interfaces per server.
引用
收藏
页码:346 / 392
页数:47
相关论文
共 50 条
  • [1] Reliability and Survivability Analysis of Data Center Network Topologies
    Rodrigo de Souza Couto
    Stefano Secci
    Miguel Elias Mitre Campista
    Luís Henrique Maciel Kosmalski Costa
    Journal of Network and Systems Management, 2016, 24 : 346 - 392
  • [2] Subsystem Reliability Analysis of Data Center Network BCube
    Wang, Yihong
    Fan, Weibei
    Fan, Jianxi
    Zhou, Jingya
    Cheng, Baolei
    IEEE TRANSACTIONS ON RELIABILITY, 2024, 73 (04) : 1946 - 1957
  • [3] Reliability assessment and profit analysis of distributed data center network topology
    Sanusi A.
    Yusuf I.
    Life Cycle Reliability and Safety Engineering, 2022, 11 (1) : 75 - 86
  • [4] Various Network Topologies and an Analysis Comparative Between Fat-Tree and BCube for a Data Center Network: An Overview
    Cortes Castillo, Antonio
    2022 IEEE CLOUD SUMMIT, 2022, : 1 - 8
  • [5] Wireless Network Reliability Analysis for Arbitrary Network Topologies
    Basaran, Semiha Tedik
    Kurt, Gunes Karabulut
    Kschischang, Frank R.
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2022, 71 (03) : 2788 - 2797
  • [6] A Comparative Analysis of Network Dependability, Fault-tolerance, Reliability, Security, and Survivability
    Al-Kuwaiti, M.
    Kyriakopoulos, N.
    Hussein, S.
    IEEE COMMUNICATIONS SURVEYS AND TUTORIALS, 2009, 11 (02): : 106 - 124
  • [7] Information Fusion Reliability Analysis for Component Survivability
    Blasch, Erik
    Bai, Li
    Chen, Genshe
    PROCEEDINGS OF THE 2012 IEEE NATIONAL AEROSPACE AND ELECTRONICS CONFERENCE (NAECON), 2012, : 212 - 219
  • [8] Reliability Evaluation of Data Center Network DCell
    Lv, Mengjie
    Zhou, Shuming
    Sun, Xueli
    Lian, Guanqin
    Liu, Jiafei
    PARALLEL PROCESSING LETTERS, 2018, 28 (04)
  • [9] Reliability-Sustainable Network Survivability Scheme Against Disaster Failures
    Bao, Ning-Hai
    Su, Guo-Qing
    Wu, Ya-Kun
    Kuang, Ming
    Luo, Da-Yong
    2017 INTERNATIONAL CONFERENCE ON COMPUTER, INFORMATION AND TELECOMMUNICATION SYSTEMS (IEEE CITS), 2017, : 334 - 337
  • [10] A Large Scale Study of Data Center Network Reliability
    Meza, Justin
    Xu, Tianyin
    Veeraraghavan, Kaushik
    Mutlu, Onur
    IMC'18: PROCEEDINGS OF THE INTERNET MEASUREMENT CONFERENCE, 2018, : 393 - 407