Dynamic data prefetching in home-based software DSMs

被引:0
作者
Weiwu Hu
Fuxin Zhang
Haiming Liu
机构
[1] The Chinese Academy of Sciences,Institute of Computing Technology
来源
Journal of Computer Science and Technology | 2001年 / 16卷
关键词
software DSM; remote access; prefetching; performance evaluation;
D O I
暂无
中图分类号
学科分类号
摘要
A major overhead in software DSM (Distributed Shared Memory) is the cost of remote memory accesses necessitated by the protocol as well as induced by false sharing. This paper introduces a dynamic prefetching method implemented in the JIAJIA software DSM to reduce system overhead caused by remote accesses. The prefetching method records the interleaving string of INV (invalidation) and GETP (getting a remote page) operations for each cached page and analyzes the periodicity of the string when a page is invalidated on a lock or barrier. A prefetching request is issued after the lock or barrier if the periodicity analysis indicates that GETP will be the next operation in the string. Multiple prefetching requests are merged into the same message if they are to the same host. Performance evaluation with eight well-accepted benchmarks in a cluster of sixteen Power PC workstations shows that the prefetching scheme can significantly reduce the page fault overhead and as a result achieves a performance increase of 15%–20% in three benchmarks and around 8%–10% in another three. The average extra traffic caused by useless prefetches is only 7%–13% in the evaluation.
引用
收藏
页码:231 / 241
页数:10
相关论文
共 23 条
[1]  
Weiwu Hu(1998)A lock-based cache coherence protocol for scope consistency Journal of Computer Science and Technology 13 97-109
[2]  
Weisong Shi(1997)Quantifying the performance differences between PVM and TreadMarks Journal of Parallel and Distributed Computing 43 65-78
[3]  
Zhimin Tang(1997)Effectiveness of dynamic prefetching in multiple-writer distributed virtual shared memory system Journal of Parallel and Distributed Computing 43 79-93
[4]  
Ming Li(1991)Tolerating latency through software-controlled prefetching in shared-memory multiprocessors Journal of Parallel and Distributed Computing 12 87-106
[5]  
Lu H(1994)Parallelization of general linkage analysis problems Human Heredity 44 127-141
[6]  
Dwarkadas S(1994)Strategies for multilocus analysis in humans PNAS 81 3443-3446
[7]  
Cox A(1994)Avoiding recompoudation in genetic linkage analysis Human Heredity 44 225-237
[8]  
Zwaenepoel W(undefined)undefined undefined undefined undefined-undefined
[9]  
Karlsson M(undefined)undefined undefined undefined undefined-undefined
[10]  
Stenstrom P(undefined)undefined undefined undefined undefined-undefined