High-Performance STT-MRAM-Based Computing-in-Memory Scheme Utilizing Data Read Feature

被引:1
作者
Wu, Bi [1 ,2 ]
Liu, Kai [1 ,2 ]
Yu, Tianyang [1 ,2 ]
Zhu, Haonan [1 ,2 ]
Chen, Ke [1 ,2 ]
Yan, Chenggang [1 ,2 ]
Deng, Erya [1 ,2 ]
Liu, Weiqiang [1 ,2 ]
机构
[1] Nanjing Univ Aeronaut & Astronaut, Coll Elect & Informat Engn, Coll Integrated Circuits, Nanjing 211106, Peoples R China
[2] Minist Ind & Informat Technol, Key Lab Aerosp Integrated Circuits & Microsyst, Nanjing 211106, Peoples R China
基金
中国国家自然科学基金;
关键词
Computing-in-memory; STT-MRAM; boolean logics; BNN; XNOR-PC; SPINTRONIC PROCESSING UNIT; SPIN;
D O I
10.1109/TNANO.2023.3336910
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
With the development of Artificial Intelligence (AI) and Binary neural networks (BNN), the computing efficiency of the computing system is expected to be much better, however, enormous amounts of data processing have caused an intolerable 'memory wall' and 'power wall' challenge for traditional Von Neumann architectures. Therefore, more advanced Computing-in-memory (CiM) architectures are proposed. The emerging non-volatile memory Spin Transfer Torque Magnetic Random Access Memory (STT-MRAM), with its fast access speed, near-zero leakage power consumption and high density is one of the most competitive carriers for CiM architectures.Due to the inherent read-write asymmetry of STT-MRAM, the read overhead is multiples smaller than the exaggerated write overhead. To better exploit the potential of MRAM, we focus on implementing an CiM based on the reading process.This work introduces the principle of CiM and proposes four basic logic operations (XNOR, XOR, AND and OR) based on STT-MRAM. Furthermore, a delay-based XNOR-popcount (XNOR-PC) operation for BNN based on XNOR operation is proposed. Incorporating the reading characteristics of STT-MRAM and slight modifications to the peripheral circuitry, these operations achieve significant optimisation in terms of performance and energy consumption. Simulation results show the proposed scheme can reduce the latency of XOR, AND, and OR operations at least by 99.3%, 82.2% and 80.2% compared with the existing design. The proposed XNOR-PC operation achieves at least a 59.4% reduction in power consumption and has a high degree of reliability. Also, Monte Carlo simulations prove the feasibility and robustness of the schemes.
引用
收藏
页码:817 / 826
页数:10
相关论文
共 27 条
  • [1] Compute Caches
    Aga, Shaizeen
    Jeloka, Supreet
    Subramaniyan, Arun
    Narayanasamy, Satish
    Blaauw, David
    Das, Reetuparna
    [J]. 2017 23RD IEEE INTERNATIONAL SYMPOSIUM ON HIGH PERFORMANCE COMPUTER ARCHITECTURE (HPCA), 2017, : 481 - 492
  • [2] In-Memory Low-Cost Bit-Serial Addition Using Commodity DRAM Technology
    Ali, Mustafa E.
    Jaiswal, Akhilesh
    Roy, Kaushik
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2020, 67 (01) : 155 - 165
  • [3] Bian ZJ., 2021, P IEEE INT C INT CIR, P141, DOI 10.1109/ICTA53157.2021.9661808
  • [4] Cai H., 2023, IEEE INT SOLID STATE, P500
  • [5] A survey of in-spin transfer torque MRAM computing
    Cai, Hao
    Liu, Bo
    Chen, Juntong
    Naviner, Lirida
    Zhou, Yongliang
    Wang, Zhen
    Yang, Jun
    [J]. SCIENCE CHINA-INFORMATION SCIENCES, 2021, 64 (06)
  • [6] PXNOR-BNN: In/With Spin-Orbit Torque MRAM Preset-XNOR Operation-Based Binary Neural Networks
    Chang, Liang
    Ma, Xin
    Wang, Zhaohao
    Zhang, Youguang
    Xie, Yuan
    Zhao, Weisheng
    [J]. IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2019, 27 (11) : 2668 - 2679
  • [7] Hosomi M, 2005, INT EL DEVICES MEET, P473
  • [8] A crossbar array of magnetoresistive memory devices for in-memory computing
    Jung, Seungchul
    Lee, Hyungwoo
    Myung, Sungmeen
    Kim, Hyunsoo
    Yoon, Seung Keun
    Kwon, Soon-Wan
    Ju, Yongmin
    Kim, Minje
    Yi, Wooseok
    Han, Shinhee
    Kwon, Baeseong
    Seo, Boyoung
    Lee, Kilho
    Koh, Gwan-Hyeob
    Lee, Kangho
    Song, Yoonjong
    Choi, Changkyu
    Ham, Donhee
    Kim, Sang Joon
    [J]. NATURE, 2022, 601 (7892) : 211 - +
  • [9] A Novel Sensing Circuit for Deep Submicron Spin Transfer Torque MRAM (STT-MRAM)
    Kim, Jisu
    Ryu, Kyungho
    Kang, Seung H.
    Jung, Seong-Ook
    [J]. IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2012, 20 (01) : 181 - 186
  • [10] RADAR: A Fast and Energy-Efficient Programming Technique for Multiple Bits-Per-Cell RRAM Arrays
    Le, Binh Q.
    Levy, Akash
    Wu, Tony F.
    Radway, Robert M.
    Hsieh, E. Ray
    Zheng, Xin
    Nelson, Mark
    Raina, Priyanka
    Wong, H. -S. Philip
    Wong, Simon
    Mitra, Subhasish
    [J]. IEEE TRANSACTIONS ON ELECTRON DEVICES, 2021, 68 (09) : 4397 - 4403