Asocial: Adaptive Task Re-Allocation in Distributed Computing Systems with Node Failures

被引:0
|
作者
Zheng, Weiwei [1 ]
Shen, Yanli [1 ]
Xiao, Taoshun [1 ]
机构
[1] China Acad Elect & Informat Technol, 11 Shuangyuan Rd,High Tech Pk, Beijing, Peoples R China
关键词
Task re-allocation mechanism; distributed computing systems; failure prediction;
D O I
10.23919/apnoms50412.2020.9236979
中图分类号
TN [电子技术、通信技术];
学科分类号
0809 ;
摘要
Task allocation the problem of efficiently allocating a set of M tasks to a set of N nodes is a fundamental issue in distributed computing systems (DCSs). The problem is particularly challenging in the presence of fail-prone nodes. In this work, we design an adaptive task (re-)allocation mechanism, named Asocial. The Asocial allows us to optimize the utilization of system resources (including computing resources and network resources) while increasing service-level fault tolerance as well as failure resilience. In addition, we propose a cooperative game model based task (re-)allocation approach for cooperation among Physical Nodes (PNs). We consider both the performance states and reliability levels of candidate PNs when deploying tasks. Specifically, we exploit failure prediction techniques to evaluate PNs' reliability levels. As a result, we can utilize the resources efficiently and thus improve the service reliability (i.e., the probability of serving all the tasks before their delivery time). We show by means of numerical evaluations that the proposed Asocial can significantly improve service reliability, system availability as well as resource utilization. In particular, by using failure prediction result (F-measure is around 0.8), the application completion rate (ACR), the task completion rate (TCR) and the computing resource utilization (CRU) reach 85.20%, 85.67% and 78.34%, respectively. Compared with 67.30%, 70.07% and 65.79% of the initial allocation scheme, the performance achieves a significant improvement (26.60%, 22.26% and 19.08%, respectively).
引用
收藏
页码:179 / 184
页数:6
相关论文
共 50 条
  • [21] Data re-allocation enabled cache locking for embedded systems
    Xue, Chun
    Qiu, Keni
    Zhang, Weigong
    Wang, Jing
    Xu, Yuanchao
    Zhao, Mengying
    JOURNAL OF SYSTEMS ARCHITECTURE, 2017, 77 : 3 - 13
  • [22] A new fragment re-allocation strategy for NoSQL database systems
    Zhikun Chen
    Shuqiang Yang
    Shuang Tan
    Li He
    Hong Yin
    Ge Zhang
    Frontiers of Computer Science, 2015, 9 : 111 - 127
  • [23] A new fragment re-allocation strategy for NoSQL database systems
    Zhikun CHEN
    Shuqiang YANG
    Shuang TAN
    Li HE
    Hong YIN
    Ge ZHANG
    Frontiers of Computer Science, 2015, 9 (01) : 111 - 127
  • [24] (WIP) A Dynamic Channel Re-Allocation Scheme for TETRA Systems
    Lu Biao
    Li Hai
    Ning Xu
    Lin Dongyun
    2014 9TH INTERNATIONAL CONFERENCE ON COMMUNICATIONS AND NETWORKING IN CHINA (CHINACOM), 2014, : 504 - 507
  • [25] Robust Sequential Resource Allocation in Heterogeneous Distributed Systems with Random Compute Node Failures
    Shestak, Vladimir
    Chong, Edwin K. P.
    Maciejewski, Anthony A.
    Siegel, Howard Jay
    2009 IEEE INTERNATIONAL SYMPOSIUM ON PARALLEL & DISTRIBUTED PROCESSING, VOLS 1-5, 2009, : 1437 - +
  • [26] Maximizing Service Reliability in Distributed Computing Systems with Random Node Failures: Theory and Implementation
    Pezoa, Jorge E.
    Dhakal, Sagar
    Hayat, Majeed M.
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2010, 21 (10) : 1531 - 1544
  • [27] A Variation-Aware Approach for Task Allocation in Wireless Distributed Computing Systems
    Ma, Xiaofu
    Volos, Haris I.
    Zheng, Xiangwei
    Reed, Jeffrey H.
    Bose, Tamal
    2013 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2013, : 5006 - 5011
  • [28] Fragment Re-Allocation Strategy Based on Hypergraph for NoSQL Database Systems
    Chen, Zhikun
    Yang, Shuqiang
    Shang, Yunfei
    Liu, Yong
    Wang, Feng
    Wang, Lu
    Fu, Jingjing
    INTERNATIONAL JOURNAL OF GRID AND HIGH PERFORMANCE COMPUTING, 2016, 8 (03) : 1 - 23
  • [29] An Adaptive Mesh Strategy for High Compressible Flows Based on Nodal Re-Allocation
    Bono, Gustavo
    Awruch, Armando Miguel
    JOURNAL OF THE BRAZILIAN SOCIETY OF MECHANICAL SCIENCES AND ENGINEERING, 2008, 30 (03) : 189 - 196
  • [30] PAR Reduction in OFDM Systems Based on Clipped Power Re-allocation
    Xu, Wenchao
    Yu, Shuyang
    2009 INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS AND SIGNAL PROCESSING (WCSP 2009), 2009, : 1291 - 1294