STT-RAM-Based Hierarchical in-Memory Computing

被引:0
作者
Gajaria, Dhruv [1 ]
Gomez, Kevin Antony [2 ]
Adegbija, Tosiron [1 ]
机构
[1] Univ Arizona, Dept Elect & Comp Engn, Tucson, AZ 85721 USA
[2] Univ Massachusetts, Dept Comp Sci, Amherst, MA 01003 USA
基金
美国国家科学基金会;
关键词
Random access memory; In-memory computing; Nonvolatile memory; Computer architecture; Resistance; Microprocessors; Magnetization; In-cache computing; in-memory computing; relaxed retention time; STT-RAM; PERFORMANCE;
D O I
10.1109/TPDS.2024.3430853
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In-memory computing promises to overcome the von Neumann bottleneck in computer systems by performing computations directly within the memory. Previous research has suggested using Spin-Transfer Torque RAM (STT-RAM) for in-memory computing due to its non-volatility, low leakage power, high density, endurance, and commercial viability. This paper explores hierarchical in-memory computing, where different levels of the memory hierarchy are augmented with processing elements to optimize workload execution. The paper investigates processing in memory (PiM) using non-volatile STT-RAM and processing in cache (PiC) using volatile STT-RAM with relaxed retention, which helps mitigate STT-RAM's write latency and energy overheads. We analyze tradeoffs and overheads associated with data movement for PiC versus write overheads for PiM using STT-RAMs for various workloads. We examine workload characteristics, such as computational intensity and CPU-dependent workloads with limited instruction-level parallelism, and their impact on PiC/PiM tradeoffs. Using these workloads, we evaluate computing in STT-RAM versus SRAM at different cache hierarchy levels and explore the potential of heterogeneous STT-RAM cache architectures with various retention times for PiC and CPU-based computing. Our experiments reveal significant advantages of STT-RAM-based PiC over PiM for specific workloads. Finally, we describe open research problems in hierarchical in-memory computing architectures to further enhance this paradigm.
引用
收藏
页码:1615 / 1629
页数:15
相关论文
共 33 条
[31]   STT-RAM Cache Hierarchy With Multiretention MTJ Designs [J].
Sun, Zhenyu ;
Bi, Xiuyuan ;
Li, Hai ;
Wong, Weng-Fai ;
Zhu, Xiaochun .
IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2014, 22 (06) :1281-1293
[32]   Low-power non-volatile spintronic memory: STT-RAM and beyond [J].
Wang, K. L. ;
Alzate, J. G. ;
Amiri, P. Khalili .
JOURNAL OF PHYSICS D-APPLIED PHYSICS, 2013, 46 (07)
[33]   Infinity Stream: Enabling Transparent and Automated In-Memory Computing [J].
Wang, Zhengrong ;
Liu, Christopher ;
Nowatzki, Tony .
IEEE COMPUTER ARCHITECTURE LETTERS, 2022, 21 (02) :85-88