Training of quantized deep neural networks using a magnetic tunnel junction-based synapse

被引:2
|
作者
Greenberg-Toledo, Tzofnat [1 ]
Perach, Ben [1 ]
Hubara, Itay [1 ,2 ]
Soudry, Daniel [1 ]
Kvatinsky, Shahar [1 ]
机构
[1] Technion Israel Inst Technol, Andrew & Erna Viterbi Fac Elect Engn, IL-3200003 Haifa, Israel
[2] Habana Labs Intel Co, Intel Co, Tel Aviv, Israel
基金
欧洲研究理事会;
关键词
magnetic tunnel junction; memristor; deep neural networks; quantized neural networks; MEMORY;
D O I
10.1088/1361-6641/ac251b
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Quantized neural networks (QNNs) are being actively researched as a solution for the computational complexity and memory intensity of deep neural networks. This has sparked efforts to develop algorithms that support both inference and training with quantized weight and activation values, without sacrificing accuracy. A recent example is the GXNOR framework for stochastic training of ternary and binary neural networks (TNNs and BNNs, respectively). In this paper, we show how magnetic tunnel junction (MTJ) devices can be used to support QNN training. We introduce a novel hardware synapse circuit that uses the MTJ stochastic behaviour to support the quantize update. The proposed circuit enables processing near memory (PNM) of QNN training, which subsequently reduces data movement. We simulated MTJ-based stochastic training of a TNN over the MNIST, SVHN, and CIFAR10 datasets and achieved an accuracy of 98.61% , 93.99% 83.02% , respectively (less than 1% degradation compared to the GXNOR algorithm). We evaluated the synapse array performance potential and showed that the proposed synapse circuit can train TNNs in situ, with 18.3TOPs/W 3TOPs/W for weight update.
引用
收藏
页数:14
相关论文
共 50 条
  • [31] Unconventional computing based on magnetic tunnel junction
    Baofang Cai
    Yihan He
    Yue Xin
    Zhengping Yuan
    Xue Zhang
    Zhifeng Zhu
    Gengchiau Liang
    Applied Physics A, 2023, 129
  • [32] A new method for securing binary deep neural networks against model replication attacks using magnetic tunnel junctions
    Rezayati, Mohammad Hadi
    Amirany, Abdolah
    Moaiyeri, Mohammad Hossein
    Jafari, Kian
    INTERNATIONAL JOURNAL OF INFORMATION SECURITY, 2025, 24 (01)
  • [33] A Compact Memristor-Based Dynamic Synapse for Spiking Neural Networks
    Hu, Miao
    Chen, Yiran
    Yang, J. Joshua
    Wang, Yu
    Li, Hai
    IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2017, 36 (08) : 1353 - 1366
  • [34] Evaluations on Deep Neural Networks Training Using Posit Number System
    Lu, Jinming
    Fang, Chao
    Xu, Mingyang
    Lin, Jun
    Wang, Zhongfeng
    IEEE TRANSACTIONS ON COMPUTERS, 2021, 70 (02) : 174 - 187
  • [35] Embedding-Based Speaker Adaptive Training of Deep Neural Networks
    Cui, Xiaodong
    Goel, Vaibhava
    Saon, George
    18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 122 - 126
  • [36] A Methodology to Design Quantized Deep Neural Networks for Automatic Modulation Recognition
    Goez, David
    Soto, Paola
    Latre, Steven
    Gaviria, Natalia
    Camelo, Miguel
    ALGORITHMS, 2022, 15 (12)
  • [37] Human Activity Recognition on Microcontrollers with Quantized and Adaptive Deep Neural Networks
    Daghero, Francesco
    Burrello, Alessio
    Xie, Chen
    Castellano, Marco
    Gandolfi, Luca
    Calimera, Andrea
    Macii, Enrico
    Poncino, Massimo
    Pagliari, Daniele Jahier
    ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 2022, 21 (04)
  • [38] An Optimization Strategy for Deep Neural Networks Training
    Wu, Tingting
    Zeng, Peng
    Song, Chunhe
    2022 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, COMPUTER VISION AND MACHINE LEARNING (ICICML), 2022, : 596 - 603
  • [39] A Charge-Domain Design of Ferroelectric Tunneling Junction Synapse for Spiking Neural Networks
    Zhu, Xiaobao
    Feng, Ning
    Qiu, Jiajun
    Wang, Xianyu
    Zeng, Min
    Wu, Yanqing
    Zhang, Lining
    IEEE TRANSACTIONS ON ELECTRON DEVICES, 2025, 72 (04) : 1730 - 1737
  • [40] A Novel Compound Synapse Using Probabilistic Spin-Orbit-Torque Switching for MTJ-Based Deep Neural Networks
    Ostwal, Vaibhav
    Zand, Ramtin
    Demara, Ronald
    Appenzeller, Joerg
    IEEE JOURNAL ON EXPLORATORY SOLID-STATE COMPUTATIONAL DEVICES AND CIRCUITS, 2019, 5 (02): : 182 - 187