Exploiting Replicated Cache Blocks to Reduce L2 Cache Leakage in CMPs

被引:6
作者
Kim, Hyunhee [1 ]
Ahn, Jung Ho [2 ]
Kim, Jihong [3 ]
机构
[1] Samsung Elect Co Ltd, Div Mobile Commun, Suwon 443742, South Korea
[2] Seoul Natl Univ, Grad Sch Convergence Sci & Technol, Seoul 151742, South Korea
[3] Seoul Natl Univ, Dept Comp Sci & Engn, Seoul 151742, South Korea
基金
新加坡国家研究基金会;
关键词
Cache coherence; chip multiprocessors (CMPs); leakage power management; private L2 caches; replication;
D O I
10.1109/TVLSI.2012.2220791
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Modern chip multiprocessors (CMPs) employ large L2 caches to reduce the performance gap between processors and off-chip memory. However, as the size of an L2 cache increases, its leakage power consumption also becomes a major contributor to the total power dissipation. Managing the leakage power of L2 caches, therefore, is an important issue in realizing low-power CMPs. In CMPs with private L2 caches, each processor makes a copy of the data in its local cache in order to access the data faster, which is called replication. In this paper, we propose a novel leakage management technique that dynamically turns off replications in private L2 caches for leakage power reduction by exploiting two key observations: 1) the cost of an extra cache miss due to the turned-off replication is small because the same cache block exists in another on-chip cache and 2) turning off the replication incurs no extra cache miss if it is invalidated by other processors in order to maintain cache coherence. Since blindly turning off the frequently accessed replications can degrade performance, the proposed technique dynamically controls the number of turned-off replications. The proposed technique can be implemented by slightly modifying the MESI protocol with a new turned-off shared (TOS) coherence state. The TOS state indicates that the corresponding block is shared by other caches but turned off. Experiments on a four-processor CMP with private L2 caches show that the proposed technique reduces the energy consumption of the L2 caches and the main memory by 19.4% on average, with less than 1% performance loss over the existing cache leakage management technique.
引用
收藏
页码:1863 / 1877
页数:15
相关论文
共 37 条
[1]  
Abella Jaume, 2005, ACM Trans. Archit. Code Optim., V1, P55, DOI [10.1145/1061267.1061271, DOI 10.1145/1061267.1061271]
[2]  
[Anonymous], P INT S COMP ARCH C
[3]  
[Anonymous], P INT S COMP ARCH C
[4]  
[Anonymous], P INT S COMP ARCH C
[5]  
[Anonymous], INTRO AMD TURION NEO
[6]  
[Anonymous], CALC MEM SYST POW DD
[7]  
[Anonymous], EE143 GEN INFORM
[8]  
[Anonymous], MCPAT
[9]  
[Anonymous], CORTEX A SERIES
[10]  
[Anonymous], P INT S COMP ARCH C