Exploiting In-Memory Data Patterns for Performance Improvement on Crossbar Resistive Memory

被引：3

作者：

Wen, Wen ^{[1
]}

Zhao, Lei ^{[2
]}

Zhang, Youtao ^{[2
]}

Yang, Jun ^{[1
]}

机构：

[1] Univ Pittsburgh, Dept Elect & Comp Engn, Pittsburgh, PA 15261 USA

[2] Univ Pittsburgh, Dept Comp Sci, Pittsburgh, PA 15260 USA

来源：

IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS | 2020年 / 39卷 / 10期

基金：

美国国家科学基金会;

关键词：

Computer architecture; Microprocessors; Resistance; Random access memory; Correlation; Switches; Nonvolatile memory; Crossbar array; data pattern; resistive memory (ReRAM); write performance; DEVICE; ENERGY; TECHNOLOGY; CHALLENGES; FUTURE; MODEL; ARRAY; WRITE;

D O I：

10.1109/TCAD.2019.2940685

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Resistive memory (ReRAM) has emerged as a promising nonvolatile memory technology that may replace a significant portion of DRAM in future computer systems. ReRAM has many advantages, such as high density, low standby power, and good scalability. When adopting crossbar architecture, ReRAM cell can achieve the smallest theoretical size in fabrication, which is ideal for constructing dense memory with large capacity. However, crossbar cell structure suffers from a variety of reliability issues, which come from large voltage drops on long wires. To ensure operation reliability, ReRAM writes conservatively use the worst-case access latency of all cells in ReRAM arrays, which leads to significant performance degradation and dynamic energy waste. In this article, we study the correlation between the ReRAM cell switching latency and the number of cells in low-resistance state (LRS) along bitlines, and propose to dynamically speed up write operations based on bitline data patterns, i.e., the number of LRS cells presented in bitlines. We leverage the intrinsic in-memory processing capability of ReRAM crossbar and propose a low-overhead runtime profiler that effectively tracks the data patterns in different bitlines. To achieve further write latency reduction, we employ data compression and row address dependent memory data layout to reduce the numbers of LRS cells on bitlines. Moreover, we further present two optimization techniques, i.e., selective profiling and fine-grained profiling, to mitigate energy overhead brought by bitline data patterns tracking. The experimental results show that, on average, our design improves system performance by 20.5% and 14.2%, and reduces memory dynamic energy by 20.3% and 12.6%, compared to the baseline and the state-of-the-art crossbar design, respectively.

引用

页码：2347 / 2360

页数：14

共 67 条

[1] RAPS: Restore-Aware Policy Selection for STT-MRAM-Based Main Memory Under Read Disturbance
Aboutalebi, Armin Haj
Duan, Lide
[J]. 2017 IEEE 35TH INTERNATIONAL CONFERENCE ON COMPUTER DESIGN (ICCD), 2017, : 625 - 632
[2] Alameldeen AlaaR., 2004, FREQUENT PATTERN COM
[3] BioBench: A benchmark suite of bioinformatics applications
Albayraktaroglu, K
Jaleel, A
Wu, X
Franklin, M
Jacob, B
Tseng, CW
Yeung, D
[J]. ISPASS 2005: IEEE INTERNATIONAL SYMPOSIUM ON PERFORMANCE ANALYSIS OF SYSTEMS AND SOFTWARE, 2005, : 2 - 9
[4] [Anonymous], 2017, P 2017 IEEE 6 NONVOL, DOI DOI 10.1109/NVMSA.2017.8064464
[5] [Anonymous], 2011, P 5 ANN WORKSH MOD B
[6] Thermal-aware Optimizations of ReRAM-based Neuromorphic Computing Systems
Beigi, Majed Valad
Memik, Gokhan
[J]. 2018 55TH ACM/ESDA/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2018,
[7] Bojnordi MN, 2016, INT S HIGH PERF COMP, P1, DOI 10.1109/HPCA.2016.7446049
[8] Bruel P., 2017, Rebooting Computing (ICRC), 2017 IEEE International Conference on, P1
[9] Calkins H, 2017, J ARRYTHM, V33, P369, DOI 10.1016/j.joa.2017.08.001
[10] Challenges and Circuit Techniques for Energy-Efficient On-Chip Nonvolatile Memory Using Memristive Devices
Chang, Meng-Fan
Lee, Albert
Chen, Pin-Cheng
Lin, Chrong Jung
King, Ya-Chin
Sheu, Shyh-Shyuan
Ku, Tzu-Kun
[J]. IEEE JOURNAL ON EMERGING AND SELECTED TOPICS IN CIRCUITS AND SYSTEMS, 2015, 5 (02) : 183 - 193

← 1 2 3 4 5 6 7 →