Reliability of A Distributed Computing System Considering Task Abort

被引:0
作者
Peng, Rui [1 ]
Gao, Kaiye [2 ]
Wu, Di [3 ]
Zhai, Qingqing [4 ]
机构
[1] Beijing Univ Technol, Sch Econ & Management, Beijing, Peoples R China
[2] Beijing Informat Sci & Technol Univ, Sch Econ & Management, Beijing, Peoples R China
[3] Xi An Jiao Tong Univ, Sch Management, Xian, Peoples R China
[4] Shanghai Univ, Sch Management, Shanghai, Peoples R China
来源
2019 PROGNOSTICS AND SYSTEM HEALTH MANAGEMENT CONFERENCE (PHM-QINGDAO) | 2019年
关键词
distributed computing systems; abort policy; multi-valued decision diagram; reliability; PERFORMABILITY ANALYSIS; POLICY;
D O I
暂无
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
This paper studies the reliability of a distributed computing system where the task on a computer may be aborted if it is still not finished until a time threshold. The mission is regarded as successful if all these tasks are completed by at least one computer before a pre-specified time. An abort policy model is proposed and a multi-valued decision diagram approach is adopted to evaluate the system reliability for any given abort policy. A numerical example is provided to illustrate the applications.
引用
收藏
页数:5
相关论文
共 10 条
[1]   Self-adaptive and reconfigurable distributed computing systems [J].
Bagchi, Susmit .
APPLIED SOFT COMPUTING, 2012, 12 (09) :3023-3033
[2]   Optimal mission abort policy for partially repairable heterogeneous systems [J].
Cha, Ji Hwan ;
Finkelstein, Maxim ;
Levitin, Gregory .
EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2018, 271 (03) :818-825
[3]   A uniform approach for programming distributed heterogeneous computing systems [J].
Grasso, Ivan ;
Pellegrini, Simone ;
Cosenza, Biagio ;
Fahringer, Thomas .
JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2014, 74 (12) :3228-3239
[4]   Influence of failure propagation on mission abort policy in heterogeneous warm standby systems [J].
Levitin, Gregory ;
Xing, Liudong ;
Luo, Liang .
RELIABILITY ENGINEERING & SYSTEM SAFETY, 2019, 183 :29-38
[5]   Mission abort policy balancing the uncompleted mission penalty and system loss risk [J].
Levitin, Gregory ;
Finkelstein, Maxim ;
Dai, Yuanshun .
RELIABILITY ENGINEERING & SYSTEM SAFETY, 2018, 176 :194-201
[6]   Performability analysis of multi-state series-parallel systems with heterogeneous components [J].
Mo, Yuchang ;
Liu, Yu ;
Cui, Lirong .
RELIABILITY ENGINEERING & SYSTEM SAFETY, 2018, 171 :48-56
[7]   MDD-based performability analysis of multi-state linear consecutive-k-out-of-n: F systems [J].
Mo, Yuchang ;
Xing, Liudong ;
Cui, Lirong ;
Si, Shubin .
RELIABILITY ENGINEERING & SYSTEM SAFETY, 2017, 166 :124-131
[8]   Joint routing and aborting optimization of cooperative unmanned aerial vehicles [J].
Peng, Rui .
RELIABILITY ENGINEERING & SYSTEM SAFETY, 2018, 177 :131-137
[9]   Reliability of demand-based phased-mission systems subject to fault level coverage [J].
Peng, Rui ;
Zhai, Qingqing ;
Xing, Liudong ;
Yang, Jun .
RELIABILITY ENGINEERING & SYSTEM SAFETY, 2014, 121 :18-25
[10]   Aggregated combinatorial reliability model for non-repairable parallel phased-mission systems [J].
Zhai Qingqing ;
Xing Liudong ;
Peng Rui ;
Yang Jun .
RELIABILITY ENGINEERING & SYSTEM SAFETY, 2018, 176 :242-250