A 3D MCAM architecture based on flash memory enabling binary neural network computing for edge AI

被引:0
|
作者
Maoying BAI [1 ]
Shuhao WU [1 ]
Hai WANG [1 ]
Hua WANG [1 ]
Yang FENG [1 ]
Yueran QI [1 ]
Chengcheng WANG [1 ]
Zheng CHAI [2 ]
Tai MIN [2 ]
Jixuan WU [1 ]
Xuepeng ZHAN [1 ]
Jiezhi CHEN [1 ]
机构
[1] School of Information Science and Engineering, Shandong University
[2] Center for Spintronic and Quantum Systems, State Key Laboratory for Mechanical Behavior of Materials,School of Materials Science and Engineering, Xi'an Jiaotong
关键词
D O I
暂无
中图分类号
TP333 [存贮器]; TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The in-memory computing(IMC) architecture implemented by non-volatile memory units shows great possibilities to break the traditional von Neumann bottleneck. In this paper, a 3D IMC architecture is proposed whose unit is based on a multi-bit content-addressable memory(MCAM). The MCAM unit is comprised of two 65 nm flash memory and two transistors(2Flash2T), which is reconfigurable and multifunctional for both data write/search and XNOR logic operation. Moreover, the MCAM array can also support the population count(POPCOUNT) operation, which can be beneficial for the training and inference process in binary neural network(BNN) computing. Based on the well-known MNIST dataset, the proposed 3D MCAM architecture shows a 98.63% recognition accuracy and a 300% noise-tolerant performance without significant accuracy deterioration. Our findings can provide the potential for developing highly energy-efficient BNN computing for complex artificial intelligence(AI) tasks based on flash-based MCAM units.
引用
收藏
页码:302 / 310
页数:9
相关论文
共 50 条
  • [41] Mitigation of Accuracy Degradation in 3D Flash Memory based Approximate Nearest Neighbor Search with Binary Tree Balanced Soft Clustering for Retrieval-augmented AI
    Sasaki, Shinichi
    Aiba, Yuta
    Komano, Yusuke
    Iizuka, Takahiko
    Fujimatsu, Motohiko
    Kawasumi, Atsushi
    Miyashita, Daisuke
    Deguchi, Jun
    Maeda, Takashi
    Miyano, Shinji
    Maruyama, Tooru
    2024 22ND IEEE INTERREGIONAL NEWCAS CONFERENCE, NEWCAS 2024, 2024, : 238 - 242
  • [42] TETRIS: Scalable and Efficient Neural Network Acceleration with 3D Memory
    Gao, Mingyu
    Pu, Jing
    Yang, Xuan
    Horowitz, Mark
    Kozyrakis, Christos
    OPERATING SYSTEMS REVIEW, 2017, 51 (02) : 751 - 764
  • [43] TETRIS: Scalable and Efficient Neural Network Acceleration with 3D Memory
    Gao, Mingyu
    Pu, Jing
    Yang, Xuan
    Horowitz, Mark
    Kozyrakis, Christos
    ACM SIGPLAN NOTICES, 2017, 52 (04) : 751 - 764
  • [44] In-Memory Computing Architecture for a Convolutional Neural Network Based on Spin Orbit Torque MRAM
    Huang, Jun-Ying
    Syu, Jing-Lin
    Tsou, Yao-Tung
    Kuo, Sy-Yen
    Chang, Ching-Ray
    ELECTRONICS, 2022, 11 (08)
  • [45] TETRIS: Scalable and Efficient Neural Network Acceleration with 3D Memory
    Gao, Mingyu
    Pu, Jing
    Yang, Xuan
    Horowitz, Mark
    Kozyrakis, Christos
    TWENTY-SECOND INTERNATIONAL CONFERENCE ON ARCHITECTURAL SUPPORT FOR PROGRAMMING LANGUAGES AND OPERATING SYSTEMS (ASPLOS XXII), 2017, : 751 - 764
  • [46] TETRIS: Scalable and efficient neural network acceleration with 3D memory
    Gao M.
    Pu J.
    Yang X.
    Horowitz M.
    Kozyrakis C.
    1600, Association for Computing Machinery, 2 Penn Plaza, Suite 701, New York, NY 10121-0701, United States (52): : 751 - 764
  • [47] 3D reconstruction approach based on neural network
    Hu, Haifeng
    Yang, Zhi
    ADVANCES IN NEURAL NETWORKS - ISNN 2007, PT 2, PROCEEDINGS, 2007, 4492 : 630 - +
  • [48] Binocular 3D reconstruction based on neural network
    Lin, MX
    Zhao, YR
    Guan, ZG
    Ding, FH
    Xu, QX
    Wang, XH
    ADVANCES IN NEURAL NETWORKS - ISNN 2005, PT 2, PROCEEDINGS, 2005, 3497 : 765 - 771
  • [49] 3D Model Classification Based on Neural Architecture Search
    Zhou, Peng
    Yang, Jun
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2022, 34 (05): : 722 - 733
  • [50] An Approach of 3D NAND Flash Based Nonvolatile Computing-In-Memory (nvCIM) Accelerator for Deep Neural Networks (DNNs) with Calibration and Read Disturb Analysis
    Hsu, Po-Kai
    Du, Pei-Ying
    Lo, Chieh
    Lue, Hang-Ting
    Chen, Wei-Chen
    Hsu, Tzu-Hsuan
    Yeh, Teng-Hao
    Hsieh, Chih-Chang
    Wei, Ming-Liang
    Wang, Keh-Chung
    Lu, Chih-Yuan
    2020 IEEE INTERNATIONAL MEMORY WORKSHOP (IMW 2020), 2020, : 99 - 102