Banshee: Bandwidth-Efficient DRAM Caching via Software/Hardware Cooperation

被引:41
|
作者
Yu, Xiangyao [1 ]
Hughes, Christopher J. [2 ]
Satish, Nadathur [2 ]
Mutlu, Onur [3 ]
Devadas, Srinivas [1 ]
机构
[1] MIT, Cambridge, MA 02139 USA
[2] Intel Labs, Santa Clara, CA USA
[3] Swiss Fed Inst Technol, Zurich, Switzerland
来源
50TH ANNUAL IEEE/ACM INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE (MICRO) | 2017年
关键词
DRAM Cache; Main Memory; In-Package DRAM; Hybrid Memory Systems; TLB Coherence; Cache Replacement;
D O I
10.1145/3123939.3124555
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Placing the DRAM in the same package as a processor enables several times higher memory bandwidth than conventional offpackage DRAM. Yet, the latency of in-package DRAM is not appreciably lower than that of off-package DRAM. A promising use of in-package DRAM is as a large cache. Unfortunately, most previous DRAM cache designs optimize mainly for cache hit latency and do not consider bandwidth efficiency as a first-class design constraint. Hence, as we show in this paper, these designs are suboptimal for use with in-package DRAM. We propose a new DRAM cache design, Banshee, that optimizes for both in-package and off-package DRAM bandwidth efficiency without degrading access latency. Banshee is based on two key ideas. First, it eliminates the tag lookup overhead by tracking the contents of the DRAM cache using TLBs and page table entries, which is efficiently enabled by a new lightweight TLB coherence protocol we introduce. Second, it reduces unnecessary DRAM cache replacement traffic with a new bandwidth-aware frequency-based replacement policy. Our evaluations show that Banshee significantly improves performance (15% on average) and reduces DRAM traffic (35.8% on average) over the best-previous latency-optimized DRAM cache design.
引用
收藏
页码:1 / 14
页数:14
相关论文
共 24 条
  • [21] An Efficient Real-Time Object Detection Framework on Resource-Constricted Hardware Devices via Software and Hardware Co-design (Invited Paper)
    Liu, Mingshuo
    Luo, Shiyi
    Han, Kevin
    Yuan, Bo
    DeMara, Ronald F.
    Bai, Yu
    2021 IEEE 32ND INTERNATIONAL CONFERENCE ON APPLICATION-SPECIFIC SYSTEMS, ARCHITECTURES AND PROCESSORS (ASAP 2021), 2021, : 77 - 84
  • [22] Software-Hardware Co-Design for Energy-Efficient Continuous Health Monitoring via Task-Aware Compression
    Wu, Di
    Zhao, Shiqi
    Yang, Jie
    Sawan, Mohamad
    IEEE TRANSACTIONS ON BIOMEDICAL CIRCUITS AND SYSTEMS, 2023, 17 (02) : 180 - 191
  • [23] Hardware/software co-design of complex embedded systems -: An approach using efficient process models, multiple formalism specification and validation via co-simulation
    Voros, NS
    Sánchez, L
    Alonso, A
    Birbas, AN
    Birbas, M
    Jerraya, A
    DESIGN AUTOMATION FOR EMBEDDED SYSTEMS, 2003, 8 (01) : 5 - 49
  • [24] Hardware/Software Co-Design of Complex Embedded Systems: An Approach Using Efficient Process Models, Multiple Formalism Specification and Validation via Co-Simulation
    Nikolaos S. Voros
    Luis Sánchez
    Alejandro Alonso
    Alexios N. Birbas
    Michael Birbas
    Ahmed Jerraya
    Design Automation for Embedded Systems, 2003, 8 : 5 - 49