Exploiting In-Memory Data Patterns for Performance Improvement on Crossbar Resistive Memory

被引:3
作者
Wen, Wen [1 ]
Zhao, Lei [2 ]
Zhang, Youtao [2 ]
Yang, Jun [1 ]
机构
[1] Univ Pittsburgh, Dept Elect & Comp Engn, Pittsburgh, PA 15261 USA
[2] Univ Pittsburgh, Dept Comp Sci, Pittsburgh, PA 15260 USA
基金
美国国家科学基金会;
关键词
Computer architecture; Microprocessors; Resistance; Random access memory; Correlation; Switches; Nonvolatile memory; Crossbar array; data pattern; resistive memory (ReRAM); write performance; DEVICE; ENERGY; TECHNOLOGY; CHALLENGES; FUTURE; MODEL; ARRAY; WRITE;
D O I
10.1109/TCAD.2019.2940685
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Resistive memory (ReRAM) has emerged as a promising nonvolatile memory technology that may replace a significant portion of DRAM in future computer systems. ReRAM has many advantages, such as high density, low standby power, and good scalability. When adopting crossbar architecture, ReRAM cell can achieve the smallest theoretical size in fabrication, which is ideal for constructing dense memory with large capacity. However, crossbar cell structure suffers from a variety of reliability issues, which come from large voltage drops on long wires. To ensure operation reliability, ReRAM writes conservatively use the worst-case access latency of all cells in ReRAM arrays, which leads to significant performance degradation and dynamic energy waste. In this article, we study the correlation between the ReRAM cell switching latency and the number of cells in low-resistance state (LRS) along bitlines, and propose to dynamically speed up write operations based on bitline data patterns, i.e., the number of LRS cells presented in bitlines. We leverage the intrinsic in-memory processing capability of ReRAM crossbar and propose a low-overhead runtime profiler that effectively tracks the data patterns in different bitlines. To achieve further write latency reduction, we employ data compression and row address dependent memory data layout to reduce the numbers of LRS cells on bitlines. Moreover, we further present two optimization techniques, i.e., selective profiling and fine-grained profiling, to mitigate energy overhead brought by bitline data patterns tracking. The experimental results show that, on average, our design improves system performance by 20.5% and 14.2%, and reduces memory dynamic energy by 20.3% and 12.6%, compared to the baseline and the state-of-the-art crossbar design, respectively.
引用
收藏
页码:2347 / 2360
页数:14
相关论文
共 67 条
  • [1] RAPS: Restore-Aware Policy Selection for STT-MRAM-Based Main Memory Under Read Disturbance
    Aboutalebi, Armin Haj
    Duan, Lide
    [J]. 2017 IEEE 35TH INTERNATIONAL CONFERENCE ON COMPUTER DESIGN (ICCD), 2017, : 625 - 632
  • [2] Alameldeen AlaaR., 2004, FREQUENT PATTERN COM
  • [3] BioBench: A benchmark suite of bioinformatics applications
    Albayraktaroglu, K
    Jaleel, A
    Wu, X
    Franklin, M
    Jacob, B
    Tseng, CW
    Yeung, D
    [J]. ISPASS 2005: IEEE INTERNATIONAL SYMPOSIUM ON PERFORMANCE ANALYSIS OF SYSTEMS AND SOFTWARE, 2005, : 2 - 9
  • [4] [Anonymous], 2017, P 2017 IEEE 6 NONVOL, DOI DOI 10.1109/NVMSA.2017.8064464
  • [5] [Anonymous], 2011, P 5 ANN WORKSH MOD B
  • [6] Thermal-aware Optimizations of ReRAM-based Neuromorphic Computing Systems
    Beigi, Majed Valad
    Memik, Gokhan
    [J]. 2018 55TH ACM/ESDA/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2018,
  • [7] Bojnordi MN, 2016, INT S HIGH PERF COMP, P1, DOI 10.1109/HPCA.2016.7446049
  • [8] Bruel P., 2017, Rebooting Computing (ICRC), 2017 IEEE International Conference on, P1
  • [9] Calkins H, 2017, J ARRYTHM, V33, P369, DOI 10.1016/j.joa.2017.08.001
  • [10] Challenges and Circuit Techniques for Energy-Efficient On-Chip Nonvolatile Memory Using Memristive Devices
    Chang, Meng-Fan
    Lee, Albert
    Chen, Pin-Cheng
    Lin, Chrong Jung
    King, Ya-Chin
    Sheu, Shyh-Shyuan
    Ku, Tzu-Kun
    [J]. IEEE JOURNAL ON EMERGING AND SELECTED TOPICS IN CIRCUITS AND SYSTEMS, 2015, 5 (02) : 183 - 193