Fine-Grained DRAM: Energy-Efficient DRAM for Extreme Bandwidth Systems

被引:133
作者
O'Connor, Mike [1 ,2 ]
Chatterjee, Niladrish [1 ]
Lee, Donghyuk [1 ]
Wilson, John [1 ]
Agrawal, Aditya [1 ]
Keckler, Stephen W. [1 ,2 ]
Dally, William J. [1 ,3 ]
机构
[1] NVIDIA, Santa Clara, CA 95051 USA
[2] Univ Texas Austin, Austin, TX 78712 USA
[3] Stanford Univ, Stanford, CA 94305 USA
来源
50TH ANNUAL IEEE/ACM INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE (MICRO) | 2017年
关键词
DRAM; Energy-Efficiency; High Bandwidth; GPU;
D O I
10.1145/3123939.3124545
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Future GPUs and other high-performance throughput processors will require multiple TB/s of bandwidth to DRAM. Satisfying this bandwidth demand within an acceptable energy budget is a challenge in these extreme bandwidth memory systems. We propose a new high-bandwidth DRAM architecture, Fine-Grained DRAM (FGDRAM), which improves bandwidth by 4x and improves the energy efficiency of DRAM by 2x relative to the highest-bandwidth, most energy-efficient contemporary DRAM, High Bandwidth Memory (HBM2). These benefits are in large measure achieved by partitioning the DRAM die into many independent units, called grains, each of which has a local, adjacent I/O. This approach unlocks the bandwidth of all the banks in the DRAM to be used simultaneously, eliminating shared buses interconnecting various banks. Furthermore, the on-DRAM data movement energy is significantly reduced due to the much shorter wiring distance between the cell array and the local I/O. This FGDRAM architecture readily lends itself to leveraging existing techniques to reducing the effective DRAM row size in an area efficient manner, reducing wasteful row activate energy in applications with low locality. In addition, when FGDRAM is paired with a memory controller optimized to exploit the additional concurrency provided by the independent grains, it improves GPU system performance by 19% over an iso-bandwidth and iso-capacity future HBM baseline. Thus, this energy-efficient, high-bandwidth FGDRAM architecture addresses the needs of future extreme-bandwidth memory systems.
引用
收藏
页码:41 / 54
页数:14
相关论文
共 44 条
  • [11] Burtscher M., 2012, 2012 IEEE International Symposium on Workload Characterization (IISWC 2012), P141, DOI 10.1109/IISWC.2012.6402918
  • [12] Cha S., 2017, P INT S HIGH PERF CO
  • [13] Chatterjee N., 2017, P INT S HIGH PERF CO
  • [14] Che SA, 2009, I S WORKL CHAR PROC, P44, DOI 10.1109/IISWC.2009.5306797
  • [15] FINE-GRAINED ACTIVATION FOR POWER REDUCTION IN DRAM
    Cooper-Balis, Elliott
    Jacob, Bruce
    [J]. IEEE MICRO, 2010, 30 (03) : 34 - 47
  • [16] Ha H, 2016, INT SYMP MICROARCH
  • [17] Harvard Q, 2011, MIDWEST SYMP CIRCUIT
  • [18] Heroux M. A, 2009, SAND20085574 SAND
  • [19] Intel, 2016, INTR MCDRAM HIGH BAN
  • [20] Recent Innovations in DRAM Manufacturing
    James, Dick
    [J]. 2010 IEEE/SEMI ADVANCED SEMICONDUCTOR MANUFACTURING CONFERENCE, 2010, : 264 - 269