Architecting Selective Refresh based Multi-Retention Cache for Heterogeneous System (ARMOUR)

被引:3
作者
Agarwal, Sukarn [1 ]
Chakraborty, Shounak [2 ]
Sjalander, Magnus [2 ]
机构
[1] Univ Edinburgh, Edinburgh, Midlothian, Scotland
[2] Norwegian Univ Sci & Technol, Trondheim, Norway
来源
2023 60TH ACM/IEEE DESIGN AUTOMATION CONFERENCE, DAC | 2023年
基金
英国工程与自然科学研究理事会;
关键词
STT-RAM; NVM; retention time; multi-retention cache; heterogeneous system; GPU; energy efficiency;
D O I
10.1109/DAC56929.2023.10247878
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The increasing use of chiplets, and the demand for high-performance yet low-power systems, will result in heterogeneous systems that combine both CPUs and accelerators (e.g., general-purpose GPUs). Chiplet based designs also enable the inclusion of emerging memory technologies, since such technologies can reside on a separate chiplet without requiring complex integration in existing high-performance process technologies. One such emerging memory technology is spin-transfer torque (STT) memory, which has the potential to replace SRAM as the last-level cache (LLC). STT-RAM has the advantage of high density, non-volatility, and reduced leakage power, but suffers from a higher write latency and energy, as compared to SRAM. However, by relaxing the retention time, the write latency and energy can be reduced at the cost of the STT-RAM becoming more volatile. The retention time and write latency/energy can be traded against each other by creating an LLC with multiple retention zones. With a multi-retention LLC, the challenge is to direct the memory accesses to the most advantageous zone, to optimize for overall performance and energy efficiency. We propose ARMOUR, a mechanism for efficient management of memory accesses to a multi-retention LLC, where based on the initial requester (CPU or GPU) the cache blocks are allocated in the high (CPU) or low (GPU) retention zone. Furthermore, blocks that are about to expire are either refreshed (CPU) or written back (GPU). In addition, ARMOUR evicts CPU blocks with an estimated short lifetime, which further improves cache performance by reducing cache pollution. Our evaluation shows that ARMOUR improves average performance by 28.9% compared to a baseline STT-RAM based LLC and reduces the energy-delay product (EDP) by 74.5% compared to an iso-area SRAM LLC.
引用
收藏
页数:6
相关论文
共 7 条
  • [1] A Novel Hybrid Last Level Cache Based on Multi-retention STT-RAM Cells
    Zhang, Hongguang
    Zhang, Minxuan
    Zhao, Zhenyu
    Tian, Shuo
    ADVANCED COMPUTER ARCHITECTURE, ACA 2016, 2016, 626 : 28 - 39
  • [2] TEEMO: Temperature Aware Energy Efficient Multi-Retention STT-RAM Cache Architecture
    Agarwal, Sukarn
    Chakraborty, Shounak
    Sjalander, Magnus
    PROCEEDINGS 2024 IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM, IPDPS 2024, 2024, : 852 - 864
  • [3] ABACa: Access Based Allocation on Set Wise Multi-Retention in STT-RAM Last Level Cache
    Agarwal, Sukarn
    Chakraborty, Shounak
    2021 IEEE 32ND INTERNATIONAL CONFERENCE ON APPLICATION-SPECIFIC SYSTEMS, ARCHITECTURES AND PROCESSORS (ASAP 2021), 2021, : 171 - 174
  • [4] DAMUS: Dynamic Allocation based on Write Frequency in MUlti-Retention STT-RAM based Last Level Caches
    Baranwal, Mayank
    Chugh, Udbhav
    Dalal, Shivang
    Agarwal, Sukarn
    Kapoor, Hemangee K.
    PROCEEDINGS OF THE 2021 TWENTY SECOND INTERNATIONAL SYMPOSIUM ON QUALITY ELECTRONIC DESIGN (ISQED 2021), 2021, : 469 - 475
  • [5] Latency Sensitivity-Based Cache Partitioning for Heterogeneous Multi-core Architecture
    Wang, Po-Han
    Li, Cheng-Hsuan
    Yang, Chia-Lin
    2016 ACM/EDAC/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2016,
  • [6] Pareto Front Based Realistic Soft Real-Time Task Scheduling with Multi-objective Genetic Algorithm on Arbitrary Heterogeneous Multiprocessor System
    Sedaghat, Nafiseh
    Tabatabaee-Yazdi, Hamid
    Akbarzadeh-T, Mohammad-R.
    JOURNAL OF INTERNET TECHNOLOGY, 2011, 12 (01): : 85 - 93
  • [7] Pareto Front Based Realistic Soft Real-Time Task Scheduling with Multi-objective Genetic Algorithm in Unstructured Heterogeneous Distributed System
    Sedaghat, Nafiseh
    Tabatabaee-Yazdi, Hamid
    Akbarzadeh-T, Mohammad-R
    ADVANCES IN GRID AND PERVASIVE COMPUTING, PROCEEDINGS, 2010, 6104 : 268 - +