Asocial: Adaptive Task Re-Allocation in Distributed Computing Systems with Node Failures

被引:0
|
作者
Zheng, Weiwei [1 ]
Shen, Yanli [1 ]
Xiao, Taoshun [1 ]
机构
[1] China Acad Elect & Informat Technol, 11 Shuangyuan Rd,High Tech Pk, Beijing, Peoples R China
关键词
Task re-allocation mechanism; distributed computing systems; failure prediction;
D O I
10.23919/apnoms50412.2020.9236979
中图分类号
TN [电子技术、通信技术];
学科分类号
0809 ;
摘要
Task allocation the problem of efficiently allocating a set of M tasks to a set of N nodes is a fundamental issue in distributed computing systems (DCSs). The problem is particularly challenging in the presence of fail-prone nodes. In this work, we design an adaptive task (re-)allocation mechanism, named Asocial. The Asocial allows us to optimize the utilization of system resources (including computing resources and network resources) while increasing service-level fault tolerance as well as failure resilience. In addition, we propose a cooperative game model based task (re-)allocation approach for cooperation among Physical Nodes (PNs). We consider both the performance states and reliability levels of candidate PNs when deploying tasks. Specifically, we exploit failure prediction techniques to evaluate PNs' reliability levels. As a result, we can utilize the resources efficiently and thus improve the service reliability (i.e., the probability of serving all the tasks before their delivery time). We show by means of numerical evaluations that the proposed Asocial can significantly improve service reliability, system availability as well as resource utilization. In particular, by using failure prediction result (F-measure is around 0.8), the application completion rate (ACR), the task completion rate (TCR) and the computing resource utilization (CRU) reach 85.20%, 85.67% and 78.34%, respectively. Compared with 67.30%, 70.07% and 65.79% of the initial allocation scheme, the performance achieves a significant improvement (26.60%, 22.26% and 19.08%, respectively).
引用
收藏
页码:179 / 184
页数:6
相关论文
共 50 条
  • [1] Task allocation in distributed computing systems using adaptive particle swarm optimisation
    Subashini, G.
    Bhuvaneswari, M. C.
    INTERNATIONAL JOURNAL OF COMPUTER APPLICATIONS IN TECHNOLOGY, 2012, 44 (04) : 293 - 302
  • [2] On task allocation in heterogeneous distributed computing systems
    Ignatius, PP
    Murthy, CSR
    COMPUTER SYSTEMS SCIENCE AND ENGINEERING, 1997, 12 (04): : 231 - 238
  • [3] A TASK ALLOCATION MODEL FOR DISTRIBUTED COMPUTING SYSTEMS
    MA, PYR
    LEE, EYS
    TSUCHIYA, M
    IEEE TRANSACTIONS ON COMPUTERS, 1982, 31 (01) : 41 - 47
  • [4] On task allocation in heterogeneous distributed computing systems
    Indian Inst of Technology, Madras, India
    Comput Syst Sci Eng, 4 (231-238):
  • [5] TASK ALLOCATION TECHNIQUES FOR DISTRIBUTED COMPUTING SYSTEMS - A REVIEW
    SAGAR, G
    SARJE, AK
    AHMED, KU
    JOURNAL OF MICROCOMPUTER APPLICATIONS, 1989, 12 (02): : 97 - 105
  • [6] Allocation and re-allocation of data in a Grid using an adaptive Genetic Algorithm
    Siefoddini, Hamed
    El-Fakih, Khaled
    Kawash, Jalal
    Mansour, Nashat
    2006 IEEE INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS, VOLS 1-3, 2006, : 858 - +
  • [7] Precedence-constrained task allocation in distributed computing systems
    Vidyarthi, DP
    Tripathi, AK
    INTERNATIONAL JOURNAL OF HIGH SPEED COMPUTING, 1996, 8 (01): : 47 - 55
  • [8] Task allocation algorithms for maximizing reliability of distributed computing systems
    Kartik, S
    Murthy, CSR
    IEEE TRANSACTIONS ON COMPUTERS, 1997, 46 (06) : 719 - 724
  • [9] Reliability oriented task allocation in heterogeneous distributed computing systems
    Attiya, G
    Hamam, Y
    ISCC2004: NINTH INTERNATIONAL SYMPOSIUM ON COMPUTERS AND COMMUNICATIONS, VOLS 1 AND 2, PROCEEDINGS, 2004, : 68 - 73
  • [10] Dynamic task allocation models for large distributed computing systems
    Chang, HWD
    Oldham, WJB
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 1995, 6 (12) : 1301 - 1315