A fault-tolerance mechanism in grid

被引:1
|
作者
Jin, L [1 ]
Tong, WQ [1 ]
Tang, HQ [1 ]
Wang, B [1 ]
机构
[1] Shanghai Univ, Sch Engn & Comp Sci, Shanghai 200072, Peoples R China
关键词
D O I
10.1109/INDIN.2003.1300379
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Grid appears as an effective technology coupling geographically distributed resources for solving large-scale problems in the wide area network. Fault tolerance in Gird system is a significant and complex issue to secure a stable and reliable performance. Until now, various technique exist for detecting and correcting faults in distributed computing sytems. Unfortunately, few energy focus on fault-tolerance in Grid environment, especially with the emergence of OGSA. A new fault-tolerant mechanism is needed to detect and recover service faults and nodes crash. Based on our previous work on Java threads state capturing and existing Mobile Agent techniques, we put forward a fault-tolerant mechanism in this paper, providing effective fault-handling and recovering methods.
引用
收藏
页码:457 / 461
页数:5
相关论文
共 50 条
  • [1] A new fault-tolerance framework for grid computing
    Derbal, Youcef
    MULTIAGENT AND GRID SYSTEMS, 2006, 2 (02) : 115 - 133
  • [2] Supporting fault-tolerance in streaming grid applications
    Zhu, Qian
    Chen, Liang
    Agrawal, Gagan
    2008 IEEE INTERNATIONAL SYMPOSIUM ON PARALLEL & DISTRIBUTED PROCESSING, VOLS 1-8, 2008, : 1679 - 1690
  • [3] Supporting Fault-Tolerance in Streaming Grid Applications
    Zhu, Qian
    Chen, Liang
    Agrawal, Gagan
    PROCEEDINGS OF THE 2007 ACM SIGPLAN SYMPOSIUM ON PRINCIPLES AND PRACTICE OF PARALLEL PROGRAMMING PPOPP'07, 2007, : 156 - 157
  • [4] Fault-Tolerance Mechanism of Computation Grid Service System Based on Mobile Agent
    Zhang, Zhirou
    Li, Ying
    2008 ISECS INTERNATIONAL COLLOQUIUM ON COMPUTING, COMMUNICATION, CONTROL, AND MANAGEMENT, VOL 1, PROCEEDINGS, 2008, : 161 - +
  • [5] A distributed fault-tolerance mechanism in UNIX
    Gantenbein, RE
    Yu, ZJ
    PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON COMPUTER APPLICATIONS IN INDUSTRY AND ENGINEERING, 1996, : 146 - 149
  • [6] A drug discovery grid environment with fault-tolerance support
    Wang, Yongjian
    Ren, Yinan
    Chen, Ting
    Huang, Yuanqiang
    Yu, Kunqian
    Luan, Zhongzhi
    Jiang, Hualiang
    Qian, Depei
    Hsi-An Chiao Tung Ta Hsueh/Journal of Xi'an Jiaotong University, 2009, 43 (12): : 21 - 25
  • [8] A fault-tolerance mechanism for mobile agent systems
    Leung, Kwai Ki
    Ng, Kam Wing
    INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE FOR MODELLING, CONTROL & AUTOMATION JOINTLY WITH INTERNATIONAL CONFERENCE ON INTELLIGENT AGENTS, WEB TECHNOLOGIES & INTERNET COMMERCE, VOL 1, PROCEEDINGS, 2006, : 1006 - +
  • [9] FAULT-TOLERANCE
    GROSSPIETSCH, KE
    MICROPROCESSING AND MICROPROGRAMMING, 1993, 38 (1-5): : 783 - 783
  • [10] Designing masking fault-tolerance via nonmasking fault-tolerance
    Arora, A
    Kulkarni, SS
    IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 1998, 24 (06) : 435 - 450