Using dynamic cache management techniques to reduce energy in general purpose processors

被引:17
作者
Bellas, NE [1 ]
Hajj, IN
Polychronopoulos, CD
机构
[1] Motorola Inc, DigitalDNA Syst Architecture Lab, Schaumburg, IL 60195 USA
[2] Univ Illinois, Dept Elect & Comp Engn, Coordinated Sci Lab, Urbana, IL 61801 USA
关键词
low-power-design; memory; performance-tradeoffs; system-level;
D O I
10.1109/92.902264
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The memory hierarchy of high-performance and embedded processors has been shown to be one of the major energy consumers. For example, the Level-1 (L1) instruction cache (I-Cache) of the StrongARM processor accounts for 27% of the power dissipation of the whole chip [1], whereas the instruction fetch unit (IFU) and the I-Cache of Intel's Pentium Pro processor are the single most important power consuming modules with 14% of the total power dissipation [2]. Extrapolating current trends, this portion is likely to increase in the near future, since the devices devoted to the caches occupy an increasingly larger percentage of the total area of the chip, In this paper, we propose a technique that uses an additional mini cache, the L0-Cache, located between the I-Cache and the CPU core. This mechanism can provide the instruction stream to the data path and, when managed properly, it can effectively eliminate the need for high utilization of the more expensive I-Cache. We propose, implement, and evaluate five techniques for dynamic analysis of the program instruction access behavior, which is then used to proactively guide the access of the L0-Cache. The basic idea is that only the most frequently executed portions of the code should be stored in the LO-Cache since this is where the program spends most of its time. We present experimental results to evaluate the effectiveness of our scheme in terms of performance and energy dissipation for a series of SPEC95 benchmarks. We also discuss the performance and energy tradeoffs that are involved in these dynamic schemes. Results for these benchmarks indicate that more than 60% of the dissipated energy in the I-Cache subsystem can be saved.
引用
收藏
页码:693 / 708
页数:16
相关论文
共 28 条
[1]  
Bahar RI, 1998, 1998 INTERNATIONAL SYMPOSIUM ON LOW POWER ELECTRONICS AND DESIGN - PROCEEDINGS, P64, DOI 10.1109/LPE.1998.708157
[2]   Instruction buffering to reduce power in processors for signal processing [J].
Bajwa, RS ;
Hiraki, M ;
Kojima, H ;
Gorny, DJ ;
Nitta, K ;
Shridhar, A ;
Seki, K ;
Sasaki, K .
IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 1997, 5 (04) :417-424
[3]  
Bellas N, 1999, ISCAS '99: PROCEEDINGS OF THE 1999 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOL 6, P198, DOI 10.1109/ISCAS.1999.780129
[4]  
Diguet JP, 1997, 1997 INTERNATIONAL SYMPOSIUM ON LOW POWER ELECTRONICS AND DESIGN, PROCEEDINGS, P30, DOI 10.1109/LPE.1997.621203
[5]  
DOBBERPUHL D, 1998, P INT S LOW POW EL D, P11
[6]  
Fromm R, 1997, ACM COMP AR, P327, DOI 10.1145/384286.264214
[7]   Energy dissipation in general purpose microprocessors [J].
Gonzalez, R ;
Horowitz, M .
IEEE JOURNAL OF SOLID-STATE CIRCUITS, 1996, 31 (09) :1277-1284
[8]   Confidence estimation for speculation control [J].
Grunwald, D ;
Klauser, A ;
Manne, S ;
Pleszkun, A .
25TH ANNUAL INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE, PROCEEDINGS, 1998, :122-131
[9]  
HENNESY JL, 1996, COMPUTER ARCHITECTUR
[10]   Assigning confidence to conditional branch predictions [J].
Jacobsen, E ;
Rotenberg, E ;
Smith, JE .
PROCEEDINGS OF THE 29TH ANNUAL IEEE/ACM INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE - MICRO-29, 1996, :142-152