A Low Overhead Logging Scheme for Fast Recovery in Distributed Shared Memory Systems

被引:0
作者
Taesoon Park
Heon Y. Yeom
机构
[1] Sejong University,Department of Computer Engineering
[2] Seoul National University,Department of Computer Science
来源
The Journal of Supercomputing | 2000年 / 15卷
关键词
checkpointing; distributed shared memory system; fault tolerant system; message logging; rollback-recovery;
D O I
暂无
中图分类号
学科分类号
摘要
This paper presents an efficient, writer-based logging scheme for recoverable distributed shared memory systems, in which logging of a data item is performed by its writer process, instead of every process that accesses the item logging it. Since the writer process maintains the log of data items, volatile storage can be used for logging. Only the readers' access information needs to be logged into the stable storage of the writer process to tolerate multiple failures. Moreover, to reduce the frequency of stable logging, only the data items accessed by multiple processes are logged with their access information when the items are invalidated, and also semantic-based optimization in logging is considered. Compared with the earlier schemes in which stable logging was performed whenever a new data item was accessed or written by a process, the size of the log and the logging frequency can be significantly reduced in the proposed scheme.
引用
收藏
页码:295 / 320
页数:25
相关论文
共 13 条
[1]  
Chandy M.(1985)Distributed snapshot: Determining global states of distributed systems ACM Trans. on Computer Systems 3 63-75
[2]  
Lamport L.(1979)How to make a multiprocessor computer that correctly executes multiprocess programs IEEE Trans. on Computers C-28 690-691
[3]  
Lamport L.(1978)Reliability issues in computing system design ACM Computing Surveys 10 123-165
[4]  
Randell B.(1991)The causal ordering abstraction and a simple way to implement it Information Processing Letters 39 343-350
[5]  
Lee P. A.(1983)Fail-stop processors: An approach to designing fault-tolerant computing systems ACM Trans. on Computer Systems 1 222-238
[6]  
Treleaven P. C.(1990)Recoverable distributed shared memory IEEE Trans. on Computers 39 460-469
[7]  
Raynal M.(undefined)undefined undefined undefined undefined-undefined
[8]  
Schiper A.(undefined)undefined undefined undefined undefined-undefined
[9]  
Toueg S.(undefined)undefined undefined undefined undefined-undefined
[10]  
Schlichting R. D.(undefined)undefined undefined undefined undefined-undefined