Banshee: Bandwidth-Efficient DRAM Caching via Software/Hardware Cooperation

被引:41
|
作者
Yu, Xiangyao [1 ]
Hughes, Christopher J. [2 ]
Satish, Nadathur [2 ]
Mutlu, Onur [3 ]
Devadas, Srinivas [1 ]
机构
[1] MIT, Cambridge, MA 02139 USA
[2] Intel Labs, Santa Clara, CA USA
[3] Swiss Fed Inst Technol, Zurich, Switzerland
来源
50TH ANNUAL IEEE/ACM INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE (MICRO) | 2017年
关键词
DRAM Cache; Main Memory; In-Package DRAM; Hybrid Memory Systems; TLB Coherence; Cache Replacement;
D O I
10.1145/3123939.3124555
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Placing the DRAM in the same package as a processor enables several times higher memory bandwidth than conventional offpackage DRAM. Yet, the latency of in-package DRAM is not appreciably lower than that of off-package DRAM. A promising use of in-package DRAM is as a large cache. Unfortunately, most previous DRAM cache designs optimize mainly for cache hit latency and do not consider bandwidth efficiency as a first-class design constraint. Hence, as we show in this paper, these designs are suboptimal for use with in-package DRAM. We propose a new DRAM cache design, Banshee, that optimizes for both in-package and off-package DRAM bandwidth efficiency without degrading access latency. Banshee is based on two key ideas. First, it eliminates the tag lookup overhead by tracking the contents of the DRAM cache using TLBs and page table entries, which is efficiently enabled by a new lightweight TLB coherence protocol we introduce. Second, it reduces unnecessary DRAM cache replacement traffic with a new bandwidth-aware frequency-based replacement policy. Our evaluations show that Banshee significantly improves performance (15% on average) and reduces DRAM traffic (35.8% on average) over the best-previous latency-optimized DRAM cache design.
引用
收藏
页码:1 / 14
页数:14
相关论文
共 24 条
  • [1] TicToc: Enabling Bandwidth-Efficient DRAM Caching for both Hits and Misses in Hybrid Memory Systems
    Young, Vinson
    Chishti, Zeshan A.
    Qureshi, Moinuddin K.
    2019 IEEE 37TH INTERNATIONAL CONFERENCE ON COMPUTER DESIGN (ICCD 2019), 2019, : 341 - 349
  • [2] To Update or Not To Update?: Bandwidth-Efficient Intelligent Replacement Policies for DRAM Caches
    Young, Vinson
    Qureshi, Moinuddin K.
    2019 IEEE 37TH INTERNATIONAL CONFERENCE ON COMPUTER DESIGN (ICCD 2019), 2019, : 119 - 128
  • [3] Bandwidth-efficient code design for coordinate interleaved coded cooperation
    Oruc, O.
    Aygolu, U.
    IET COMMUNICATIONS, 2009, 3 (09) : 1509 - 1519
  • [4] Bandwidth-Efficient Multi-user Cooperation with Optimal Transmission
    Huang, Ying
    Gao, Yong
    Lei, Jing
    Wei, Jibo
    2011 7TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, NETWORKING AND MOBILE COMPUTING (WICOM), 2011,
  • [5] Scalable and Bandwidth-Efficient Multicast for Software-Defined Networks
    Huang, Liang-Hao
    Hung, Hui-Ju
    Lin, Chih-Chung
    Yang, De-Nian
    2014 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM 2014), 2014, : 1890 - 1896
  • [6] Adaptively Reduced DRAM Caching for Energy-Efficient High Bandwidth Memory
    Behnam, Payman
    Bojnordi, Mahdi Nazm
    IEEE TRANSACTIONS ON COMPUTERS, 2022, 71 (10) : 2675 - 2686
  • [7] Bandwidth-efficient Hardware-Based Volume Rendering for Large Unstructured Meshes
    Carrard, Thierry
    Juliachs, Manuel
    WSCG 2006: FULL PAPERS PROCEEDINGS: 14TH INTERNATIONAL CONFERENCE IN CENTRAL EUROPE ON COMPUTER GRAPHICS, VISUALIZATION AND COMPUTER VISION 2006, 2006, : 169 - 176
  • [8] EFastLane: Toward Bandwidth-Efficient Flow Setup in Software-Defined Networking
    Xu, Jianfeng
    Wang, Liming
    Xu, Zhen
    2019 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE (WCNC), 2019,
  • [9] A bandwidth-efficient coded user-cooperation scheme for flat block fading channels
    Ho-Van, Khuong
    Le-Ngoc, Tho
    2007 FOURTH INTERNATIONAL SYMPOSIUM ON WIRELESS COMMUNICATION SYSTEMS, VOLS 1 AND 2, 2007, : 109 - 113
  • [10] Bandwidth-efficient space-time coded cooperation for resource-constrained networks
    Kim, Jee-Hoon
    Vasilakos, Athanasios V.
    Song, Hyoung-Kyu
    WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2011, 11 (10): : 1358 - 1365