Dynamic resizing of superscalar datapath components for energy efficiency

被引:14
作者
Ponomarev, D [1 ]
Kucuk, G
Ghose, K
机构
[1] SUNY Binghamton, Dept Comp Sci, Binghamton, NY 13902 USA
[2] Yeditepe Univ, Dept Comp Engn, TR-34755 Istanbul, Turkey
基金
美国国家科学基金会;
关键词
superscalar processor; energy-efficient datapath; power reduction; dynamic instruction scheduling;
D O I
10.1109/TC.2006.23
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The "one-size-fits-all" philosophy used for permanently allocating datapath resources in today's superscalar CPUs to maximize performance across a wide range of applications results in the overcommitment of resources in general. To reduce power dissipation in the datapath, the resource allocations can be dynamically adjusted based on the demands of applications. We propose a mechanism to dynamically, simultaneously, and independently adjust the sizes of the issue queue (IQ), the reorder buffer (ROB), and the load/store queue (LSQ) based on the periodic sampling of their occupancies to achieve significant power savings with minimal impact on performance. Resource upsizing is done more aggressively (compared to downsizing), using the relative rate of blocked dispatches to limit the performance penalty. Our results are validated by the execution of the SPEC 2000 benchmark suite on a substantially modified version of the Simplescalar simulator, where the IQ, the ROB, the LSQ, and the register files are implemented as separate structures, as is the case with most practical implementations. We also use actual VLSI layouts of the datapath components in a 0.18 micron process to accurately measure the energy dissipations for each type of access. For a 4-way superscalar CPU, an average power savings of about 42 percent within the IQ, 74 percent within the ROB (integrating the register file), and 41 percent within the LSQ can be achieved with an average performance penalty of about 5 percent.
引用
收藏
页码:199 / 213
页数:15
相关论文
共 27 条
  • [1] ALBONESI DH, 1999, P INT S MICR
  • [2] [Anonymous], P WORKSH COMPL EFF D
  • [3] Bahar RI, 2001, ACM COMP AR, P218, DOI 10.1109/ISCA.2001.937451
  • [4] BALASUBRAMONIAN R, 2000, P 34 INT S MICR
  • [5] BHANDARKAR DP, 1996, ALPHA IMPLEMENTATION
  • [6] BROOKS D, 2000, P 27 INT S COMP ARCH
  • [7] Brooks D., 2001, P 7 INT S HIGH PERF
  • [8] BURGER D., 1997, SIMPLESCALAR TOOL SE
  • [9] BUYUKTOSUNOGLU A, 2001, P GREAT LAKES S VLSI
  • [10] Buyuktosunoglu A., 2000, P WORKSH POW AW COMP