A Failure Handling Framework for Distributed Data Mining Services on the Grid

被引:0
作者
Cesario, Eugenio [1 ]
Talia, Domenico [2 ]
机构
[1] ICAR CNR, Arcavacata Di Rende, Italy
[2] Univ Calabria, ICAR CNR, Arcavacata Di Rende, Italy
来源
PROCEEDINGS OF THE 19TH INTERNATIONAL EUROMICRO CONFERENCE ON PARALLEL, DISTRIBUTED, AND NETWORK-BASED PROCESSING | 2011年
关键词
Distributed Data Mining; Fault Tolerance; Grid computing; FAULT-TOLERANCE;
D O I
10.1109/PDP.2011.50
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Fault tolerance is an important issue in Grid computing, where many and heterogenous machines are used. In this paper we present a flexible failure handling framework which extends a service-oriented architecture for Distributed Data Mining previously proposed, addressing the requirements for fault tolerance in the Grid. The framework allows users to achieve failure recovery whenever a crash can occur on a Grid node involved in the computation. The implemented framework has been evaluated on a real Grid setting to assess its effectiveness and performance.
引用
收藏
页码:70 / 79
页数:10
相关论文
共 21 条
[21]  
Zhang X, 2004, 2004 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING, P105