Best of both latency and throughput

被引:42
作者
Grochowski, E [1 ]
Ronen, R [1 ]
Shen, J [1 ]
Wang, H [1 ]
机构
[1] Intel Corp, Santa Clara, CA 95054 USA
来源
IEEE INTERNATIONAL CONFERENCE ON COMPUTER DESIGN: VLSI IN COMPUTERS & PROCESSORS, PROCEEDINGS | 2004年
关键词
D O I
10.1109/ICCD.2004.1347928
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
This paper describes the tradeoff between latency performance and throughput performance in a power-constrained environment. We show that the key to achieving both excellent latency performance as well as excellent throughput performance is to dynamically vary the amount of energy expended to process instructions according to the amount of parallelism available in the software. We survey four techniques for achieving variable energy per instruction: voltage/frequency scaling, asymmetric cores, variable-size cores, and speculation control. We estimate the potential range of energies obtainable by each technique and conclude that a combination of asymmetric cores and voltage/frequency scaling offers the most promising approach to designing a chip-level multiprocessor that can achieve both excellent latency performance and excellent throughput performance.
引用
收藏
页码:236 / 243
页数:8
相关论文
共 27 条
  • [1] ALBONESI DH, 2003, DYNAMICALLY TUNING P, V36
  • [2] ALBONESI DH, 1999, P 32 ANN INT S MICR
  • [3] *AMD, TECHNOLOGY
  • [4] [Anonymous], 2001, LONGRUN POWER MANAGE
  • [5] ARAGON JL, 2003, P 9 INT S HIGH PERF
  • [6] BAHAR RI, 2001, P 28 ANN INT S COMP
  • [7] BANIASADI A, 2002, P IEEE INT C COMP DE
  • [8] BARROSO L, 2000, P 27 INT S COMP ARCH
  • [9] Brooks D., 2001, 7 INT S HIGH PERF CO
  • [10] BUYUKTOSUNOGLU A, 2003, P 30 ANN INT S COMP