共 33 条
STT-RAM-Based Hierarchical in-Memory Computing
被引:0
作者:
Gajaria, Dhruv
[1
]
Gomez, Kevin Antony
[2
]
Adegbija, Tosiron
[1
]
机构:
[1] Univ Arizona, Dept Elect & Comp Engn, Tucson, AZ 85721 USA
[2] Univ Massachusetts, Dept Comp Sci, Amherst, MA 01003 USA
基金:
美国国家科学基金会;
关键词:
Random access memory;
In-memory computing;
Nonvolatile memory;
Computer architecture;
Resistance;
Microprocessors;
Magnetization;
In-cache computing;
in-memory computing;
relaxed retention time;
STT-RAM;
PERFORMANCE;
D O I:
10.1109/TPDS.2024.3430853
中图分类号:
TP301 [理论、方法];
学科分类号:
081202 ;
摘要:
In-memory computing promises to overcome the von Neumann bottleneck in computer systems by performing computations directly within the memory. Previous research has suggested using Spin-Transfer Torque RAM (STT-RAM) for in-memory computing due to its non-volatility, low leakage power, high density, endurance, and commercial viability. This paper explores hierarchical in-memory computing, where different levels of the memory hierarchy are augmented with processing elements to optimize workload execution. The paper investigates processing in memory (PiM) using non-volatile STT-RAM and processing in cache (PiC) using volatile STT-RAM with relaxed retention, which helps mitigate STT-RAM's write latency and energy overheads. We analyze tradeoffs and overheads associated with data movement for PiC versus write overheads for PiM using STT-RAMs for various workloads. We examine workload characteristics, such as computational intensity and CPU-dependent workloads with limited instruction-level parallelism, and their impact on PiC/PiM tradeoffs. Using these workloads, we evaluate computing in STT-RAM versus SRAM at different cache hierarchy levels and explore the potential of heterogeneous STT-RAM cache architectures with various retention times for PiC and CPU-based computing. Our experiments reveal significant advantages of STT-RAM-based PiC over PiM for specific workloads. Finally, we describe open research problems in hierarchical in-memory computing architectures to further enhance this paradigm.
引用
收藏
页码:1615 / 1629
页数:15
相关论文