An ADC-Less RRAM-Based Computing-in-Memory Macro With Binary CNN for Efficient Edge AI

被引:17
作者
Li, Yi [1 ,2 ]
Chen, Jia [3 ]
Wang, Linfang [1 ,2 ]
Zhang, Woyu [1 ,2 ]
Guo, Zeyu [1 ,2 ]
Wang, Jun [1 ,2 ]
Han, Yongkang [4 ]
Li, Zhi [1 ,2 ]
Wang, Fei [1 ,2 ]
Dou, Chunmeng [1 ,2 ]
Xu, Xiaoxin [1 ,2 ]
Yang, Jianguo [1 ,2 ]
Wang, Zhongrui [3 ,5 ]
Shang, Dashan [1 ,2 ]
机构
[1] Chinese Acad Sci, Inst Microelect, Beijing 100029, Peoples R China
[2] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
[3] ACCESS AI Chip Ctr Emerging Smart Syst, InnoHK Ctr, Hong Kong Sci Pk, Hong Kong, Peoples R China
[4] Zhejiang Lab, Hangzhou 311121, Peoples R China
[5] Univ Hong Kong, Dept Elect & Elect Engn, Hong Kong, Peoples R China
基金
中国国家自然科学基金;
关键词
Inverters; Convolutional neural networks; Hardware; Software; Kernel; Resistance; Edge computing; RRAM; hardware-implementation; binary CNN; computing-in-memory; ADC free; energy-efficient;
D O I
10.1109/TCSII.2022.3233396
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Resistive random-access memory (RRAM) based non-volatile computing-in-memory (nvCIM) has been regarded as a promising solution to enable efficient data-intensive artificial intelligence (AI) applications on resource-limited edge systems. However, existing weighted-current summation-based nvCIM suffers from device non-idealities and significant time, storage, and energy overheads when processing high-precision analog signals. To address these issues, we propose a 3T2R digital nvCIM macro for a fully hardware-implemented binary convolutional neural network (HBCNN), focusing on accelerating edge AI applications at low weight precision. By quantizing the voltage-division results of RRAMs through inverters, the 3T2R macro provides a stable rail-to-rail output without analog-to-digital converters or sensing amplifiers. Moreover, both batch normalization and sign activation are integrated on-chip. The hybrid simulation results show that the proposed 3T2R digital macro achieves an 86.2% (95.6%) accuracy on the CIFAR-10 (MNIST) dataset, corresponding to a 4.7% (1.9%) accuracy loss compared to the software baselines, which also feature a peak energy efficiency of 51.3 TOPS/W and a minimum latency of 8 ns, realizing an energy-efficient, low-latency, and robust AI processor.
引用
收藏
页码:1871 / 1875
页数:5
相关论文
共 15 条
  • [1] Compact Modeling of RRAM Devices and Its Applications in 1T1R and 1S1R Array Design
    Chen, Pai-Yu
    Yu, Shimeng
    [J]. IEEE TRANSACTIONS ON ELECTRON DEVICES, 2015, 62 (12) : 4022 - 4028
  • [2] An 89TOPS/W and 16.3TOPS/mm2 All-Digital SRAM-Based Full-Precision Compute-In Memory Macro in 22nm for Machine-Learning Edge Applications
    Chih, Yu-Der
    Lee, Po-Hao
    Fujiwara, Hidehiro
    Shih, Yi-Chun
    Lee, Chia-Fu
    Naous, Rawan
    Chen, Yu-Lin
    Lo, Chieh-Pu
    Lu, Cheng-Han
    Mori, Haruki
    Zhao, Wei-Cheng
    Sun, Dar
    Sinangil, Mahmut E.
    Chen, Yen-Huei
    Chou, Tan-Li
    Akarvardar, Kerem
    Liao, Hung-Jen
    Wang, Yih
    Chang, Meng-Fan
    Chang, Tsung-Yung Jonathan
    [J]. 2021 IEEE INTERNATIONAL SOLID-STATE CIRCUITS CONFERENCE (ISSCC), 2021, 64 : 252 - +
  • [3] Courbariaux M, 2016, Arxiv, DOI arXiv:1602.02830
  • [4] In-memory computing with resistive switching devices
    Ielmini, Daniele
    Wong, H. -S. Philip
    [J]. NATURE ELECTRONICS, 2018, 1 (06): : 333 - 343
  • [5] Ioffe S, 2015, Arxiv, DOI [arXiv:1502.03167, DOI 10.48550/ARXIV.1502.03167]
  • [6] Resistive random-access memory based on ratioed memristors
    Lastras-Montano, Miguel Angel
    Cheng, Kwang-Ting
    [J]. NATURE ELECTRONICS, 2018, 1 (08): : 466 - 472
  • [7] A Review of Binarized Neural Networks
    Simons, Taylor
    Lee, Dah-Jye
    [J]. ELECTRONICS, 2019, 8 (06)
  • [8] Sun XY, 2018, DES AUT TEST EUROPE, P1423, DOI 10.23919/DATE.2018.8342235
  • [9] STT-BNN: A Novel STT-MRAM In-Memory Computing Macro for Binary Neural Networks
    Thi-Nhan Pham
    Quang-Kien Trinh
    Chang, Ik-Joon
    Alioto, Massimo
    [J]. IEEE JOURNAL ON EMERGING AND SELECTED TOPICS IN CIRCUITS AND SYSTEMS, 2022, 12 (02) : 569 - 579
  • [10] Efficient and Robust Nonvolatile Computing-In-Memory Based on Voltage Division in 2T2R RRAM With Input-Dependent Sensing Control
    Wang, Linfang
    Ye, Wang
    Dou, Chunmeng
    Si, Xin
    Xu, Xiaoxin
    Liu, Jing
    Shang, Dashan
    Gao, Jianfeng
    Zhang, Feng
    Liu, Yongpan
    Chang, Meng-Fan
    Liu, Qi
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2021, 68 (05) : 1640 - 1644