CASPR: Connectivity-Aware Scheduling for Partition Resilience

被引:0
|
作者
Qunaibi, Sara [1 ]
Udayashankar, Sreeharsha [1 ]
Al-Kiswany, Samer [1 ,2 ]
机构
[1] Univ Waterloo, Waterloo, ON, Canada
[2] Acronis Res, Toronto, ON, Canada
关键词
cloud computing; computer networks; network partitions; fault tolerance;
D O I
10.1109/SRDS60354.2023.00017
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
We present a comprehensive empirical study of the impact partial network partitions have on cluster managers in data analysis frameworks. Our study shows that modern scheduling approaches are vulnerable to partial network partitions. Partial partitions can lead to a complete cluster pause or a significant loss of performance. To overcome the shortcomings of the state-of-the-art schedulers, we design CASPR, a connectivity-aware scheduler. CASPR incorporates the current network connectivity information when making scheduling decisions to allocate fully connected nodes for a given application. CASPR effectively hides partial partitions from applications. Our evaluation of a CASPR prototype shows that it can tolerate partial network partitions, as well as eliminate application halting or significant loss of performance.
引用
收藏
页码:70 / 81
页数:12
相关论文
共 50 条
  • [31] Online Connectivity-aware Dynamic Deployment for Heterogeneous Multi-Robot Systems
    Lin, Chendi
    Luo, Wenhao
    Sycara, Katia
    2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 8941 - 8947
  • [32] A Complete Set of Connectivity-aware Local Topology Manipulation Operations for Robot Swarms
    Soma, Karthik
    Khateri, Koresh
    Pourgholi, Mahdi
    Montazeri, Mohsen
    Sabattini, Lorenzo
    Beltrame, Giovanni
    2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA, 2023, : 5522 - 5529
  • [33] Robust Connectivity-Aware Energy-Efficient Routing for Wireless Sensor Networks
    Pandana, Charles
    Liu, K. J. Ray
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2008, 7 (10) : 3904 - 3916
  • [34] Connectivity-Aware Minimum-Delay Geographic Routing with Vehicle Tracking in VANETs
    Shafiee, Kaveh
    Leung, Victor C. M.
    AD HOC NETWORKS, 2010, 28 : 256 - 267
  • [35] Enabling Heterogeneous mMTC by Energy-efficient and Connectivity-aware Clustering and Routing
    Li, Zhehan
    Chen, Jun
    Ni, Rui
    Chen, Si
    Li, Xu
    Zhao, Qiyang
    2017 IEEE GLOBECOM WORKSHOPS (GC WKSHPS), 2017,
  • [36] Performance Analysis of Connectivity Probability and Connectivity-Aware MAC Protocol Design for Platoon-Based VANETs
    Shao, Caixing
    Leng, Supeng
    Zhang, Yan
    Vinel, Alexey
    Jonsson, Magnus
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2015, 64 (12) : 5596 - 5609
  • [37] Connectivity-Aware Fast Network Forming Aided By Digital Twin For Emergency Use
    Guo, Terry N.
    IEEE INFOCOM 2022 - IEEE CONFERENCE ON COMPUTER COMMUNICATIONS WORKSHOPS (INFOCOM WKSHPS), 2022,
  • [38] CONNECTIVITY-AWARE TOPOLOGY CONTROL WITH CYCLIC-LIKE STRUCTURES IN WIRELESS SENSOR NETWORKS
    Huang Zhiwei
    Zheng Zimu
    Li Zhicheng
    Peng Xinyi
    INTERNATIONAL JOURNAL ON SMART SENSING AND INTELLIGENT SYSTEMS, 2014, 7 (04): : 1663 - 1682
  • [39] Coverage and connectivity-aware clustering within k hops in wireless sensor and actuator networks
    TUAN Chiu-Ching
    WU Yi-Chao
    Science China(Information Sciences), 2014, 57 (06) : 125 - 140
  • [40] Connectivity-aware Graph: A planar topology for 3D building surface reconstruction
    Yang, Shengming
    Cai, Guorong
    Du, Jing
    Chen, Ping
    Su, Jinhe
    Wu, Yundong
    Wang, Zongyue
    Li, Jonathan
    ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2022, 191 : 302 - 314