Mutable checkpoints: A new checkpointing approach for mobile computing systems

被引:68
作者
Cao, GH [1 ]
Singhal, M
机构
[1] Penn State Univ, Dept Comp Sci & Engn, University Pk, PA 16802 USA
[2] Ohio State Univ, Dept Comp & Informat Sci, Columbus, OH 43210 USA
关键词
mobile computing; coordinated checkpointing; causal dependency; nonblocking;
D O I
10.1109/71.910871
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Mobile computing raises many new issues such as lack of stable storage, low bandwidth of wireless channel, high mobility, and limited battery life. These new issues make traditional checkpointing algorithms unsuitable. Coordinated checkpointing is an attractive approach for transparently adding fault tolerance to distributed applications since it avoids domino effects and minimizes the stable storage requirement. However, it suffers from high overhead associated with the checkpointing process in mobile computing systems. Two approaches have been used to reduce the overhead: First is to minimize the number of synchronization messages and the number of checkpoints; the other is to make the checkpointing process nonblocking. These two approaches were orthogonal previously until the Prakash-Singhal algorithm [28] combined them. However, we [8] found that this algorithm may result in an inconsistency in some situations and we proved that there does not exist a nonblocking algorithm which forces only a minimum number of processes to take their checkpoints. In this paper, we introduce the concept of "mutable checkpoint," which is neither a tentative checkpoint nor a permanent checkpoint, to design efficient checkpointing algorithms for mobile computing systems. Mutable checkpoints can be saved anywhere, e.g., the main memory or local disk of MHs. In this way, taking a mutable checkpoint avoids the overhead of transferring large amounts of data to the stable storage at MSSs over the wireless network. We present techniques to minimize the number of mutable checkpoints. Simulation results show that the overhead of taking mutable checkpoints is negligible. Based on mutable checkpoints, our nonblocking algorithm avoids the avalanche effect and forces only a minimum number of processes to take their checkpoints on the stable storage.
引用
收藏
页码:157 / 172
页数:16
相关论文
共 35 条
[1]  
ACHARYA A, 1994, P 3 INT C PAR DISTR
[2]   Mobility management in next-generation wireless systems [J].
Akyildiz, IF ;
McNair, J ;
Ho, JSM ;
Uzunalioglu, H ;
Wang, WY .
PROCEEDINGS OF THE IEEE, 1999, 87 (08) :1347-1384
[3]  
BARIGAZZI G, 1983, FAULT TOL COMP SYST, V13, P48
[4]  
BHARGAVA B, 1990, PROCEEDINGS : 6TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, P182
[5]  
Bhargava B., 1988, Proceedings. Seventh Symposium on Reliable Distributed Systems (IEEE Cat. No.88CH2612-0), P3, DOI 10.1109/RELDIS.1988.25775
[6]   Low-cost checkpointing with mutable checkpoints in mobile computing systems [J].
Cao, GH ;
Singhal, M .
18TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS, PROCEEDINGS, 1998, :464-471
[7]   On coordinated checkpointing in distributed systems [J].
Cao, GH ;
Singhal, M .
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 1998, 9 (12) :1213-1225
[8]   On the impossibility of min-process non-blocking checkpointing and an efficient checkpointing algorithm for mobile computing systems [J].
Cao, GH ;
Singhal, M .
1998 INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING - PROCEEDINGS, 1998, :37-44
[9]  
CHANDY KM, 1985, ACM T COMPUTER S FEB
[10]  
Cristian F., 1991, Proceedings. Tenth Symposium on Reliable Distributed Systems (Cat. No.91CH3021-3), P12, DOI 10.1109/RELDIS.1991.145399