Circuit and microarchitectural techniques for reducing cache leakage power

被引:98
作者
Kim, NS [1 ]
Flautner, K
Blaauw, D
Mudge, T
机构
[1] Univ Michigan, Dept Elect Engn & Comp Sci, Ann Arbor, MI 48109 USA
[2] ARM Ltd, Adv Res, Cambridge CB1 9NJ, England
关键词
dynamic voltage scaling; L1; caches; low power; subthreshold leakage power;
D O I
10.1109/TVLSI.2003.821550
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
On-chip caches represent a sizable fraction of the total power consumption of microprocessors. As feature sizes shrink, the dominant component of this power consumption will be leakage. However, during a fixed period of time, the activity in a data cache is only centered on a small subset of the lines. This behavior can be exploited to cut the leakage power of large data caches by putting the cold cache lines into a state preserving, low-power drowsey mode. In this paper, we investigate policies and circuit techniques for implementing drowsy data caches. We show that with simple microarchitectural techniques, about 80%-90% of the data cache lines can be maintained in a drowsy state without affecting performance by more than 0.6%, even though moving lines into and out of a drowsy state incurs a slight performance loss. According to our projections, in a 70-nm complementary metal-oxide-semiconductor process, drowsy data caches will be able to reduce the total leakage energy consumed in the caches by 60%-75%. In addition, we extend the drowsy cache concept to reduce leakage power of instruction caches without significant impact on execution time. Our results show that data and instruction caches require different control strategies for efficient execution. In order to enable drowsy instruction caches, we propose a technique called cache subbank prediction, which is used to selectively wake up only the necessary parts of the instruction cache, while allowing most of the cache to stay in a low-leakage drowsy mode. This prediction technique reduces the negative performance impact by 78% compared with the no-prediction policy. Our technique works well even with small predictor sizes and enables a 75% reduction of leakage energy in a 32-kB instruction cache.
引用
收藏
页码:167 / 184
页数:18
相关论文
共 32 条
  • [1] A single-Vt low-leakage gated-Ground cache for deep submicron
    Agarwal, A
    Li, H
    Roy, K
    [J]. IEEE JOURNAL OF SOLID-STATE CIRCUITS, 2003, 38 (02) : 319 - 328
  • [2] [Anonymous], 2001, AUTOMATIC PERFORMANC, DOI [10.1145/381677.381702, DOI 10.1145/381677.381702]
  • [3] SimpleScalar: An infrastructure for computer system modeling
    Austin, T
    Larson, E
    Ernst, D
    [J]. COMPUTER, 2002, 35 (02) : 59 - +
  • [4] Azizi N, 2002, ISLPED'02: PROCEEDINGS OF THE 2002 INTERNATIONAL SYMPOSIUM ON LOW POWER ELECTRONICS AND DESIGN, P48, DOI 10.1109/LPE.2002.1029518
  • [5] DOUSEKI T, 2000, P IEEE INT SOI C OCT, P24
  • [6] Doyle B., 2002, INTEL TECH J, V6, P42
  • [7] Drowsy caches: Simple techniques for reducing leakage power
    Flautner, K
    Kim, NS
    Martin, S
    Blaauw, D
    Mudge, T
    [J]. 29TH ANNUAL INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE, PROCEEDINGS, 2002, : 148 - 157
  • [8] Ghose K., 1999, Proceedings. 1999 International Symposium on Low Power Electronics and Design (Cat. No.99TH8477), P70, DOI 10.1109/LPE.1999.799412
  • [9] Analysis of dual-VT SRAM cells with full-swing single-ended bit line sensing for on-chip cache
    Hamzaoglu, F
    Ye, YB
    Keshavarzi, A
    Zhang, K
    Narendra, S
    Borkar, S
    Stan, M
    De, V
    [J]. IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2002, 10 (02) : 91 - 95
  • [10] Dynamic fine-grain leakage reduction using leakage-biased bitlines
    Heo, S
    Barr, K
    Hampton, M
    Asanovic, K
    [J]. 29TH ANNUAL INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE, PROCEEDINGS, 2002, : 137 - 147