LDD: High-Precision Training of Deep Spiking Neural Network Transformers Guided by an Artificial Neural Network

被引:1
|
作者
Liu, Yuqian [1 ,2 ]
Zhao, Chujie [1 ,2 ]
Jiang, Yizhou [1 ,2 ]
Fang, Ying [3 ,4 ]
Chen, Feng [1 ,2 ]
机构
[1] Tsinghua Univ, Dept Automat, Beijing 100084, Peoples R China
[2] LSBDPA Beijing Key Lab, Beijing 100084, Peoples R China
[3] Fujian Normal Univ, Coll Comp & Cyber Secur, Fuzhou 350117, Peoples R China
[4] Fujian Normal Univ, Digital Fujian Internet of Thing Lab Environm Moni, Fuzhou 350117, Peoples R China
基金
中国国家自然科学基金;
关键词
spiking neural networks (SNNs); Transformer; distillation; image classification;
D O I
10.3390/biomimetics9070413
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
The rise of large-scale Transformers has led to challenges regarding computational costs and energy consumption. In this context, spiking neural networks (SNNs) offer potential solutions due to their energy efficiency and processing speed. However, the inaccuracy of surrogate gradients and feature space quantization pose challenges for directly training deep SNN Transformers. To tackle these challenges, we propose a method (called LDD) to align ANN and SNN features across different abstraction levels in a Transformer network. LDD incorporates structured feature knowledge from ANNs to guide SNN training, ensuring the preservation of crucial information and addressing inaccuracies in surrogate gradients through designing layer-wise distillation losses. The proposed approach outperforms existing methods on the CIFAR10 (96.1%), CIFAR100 (82.3%), and ImageNet (80.9%) datasets, and enables training of the deepest SNN Transformer network using ImageNet.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] Spikeformer: Training high-performance spiking neural network with transformer
    Li, Yudong
    Lei, Yunlin
    Yang, Xu
    NEUROCOMPUTING, 2024, 574
  • [2] High-Precision Microseismic Source Localization Using a Fusion Network Combining Convolutional Neural Network and Transformer
    Feng, Qiang
    Han, Liguo
    Ma, Liyun
    Li, Qiang
    SURVEYS IN GEOPHYSICS, 2024, 45 (05) : 1527 - 1560
  • [3] Bio-Inspired Deep Spiking Neural Network for Image Classification
    Li, Jingling
    Hu, Weitai
    Yuan, Ye
    Huo, Hong
    Fang, Tao
    NEURAL INFORMATION PROCESSING (ICONIP 2017), PT II, 2017, 10635 : 294 - 304
  • [4] AUTOST: TRAINING-FREE NEURAL ARCHITECTURE SEARCH FOR SPIKING TRANSFORMERS
    Wang, Ziqing
    Zhao, Qidong
    Cui, Jinku
    Liu, Xu
    Xu, Dongkuan
    2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024, 2024, : 3455 - 3459
  • [5] Spike-Event-Driven Deep Spiking Neural Network With Temporal Encoding
    Zhang, Zhixuan
    Liu, Qi
    IEEE SIGNAL PROCESSING LETTERS, 2021, 28 : 484 - 488
  • [6] A new approach to online training for the Fuzzy ARTMAP artificial neural network
    Santos-Junior, Carlos R.
    Abreu, Thays
    Lopes, Mara L. M.
    Lotufo, Anna D. P.
    APPLIED SOFT COMPUTING, 2021, 113
  • [7] On the Application of Artificial Neural Network for Classification of Incipient Faults in Dissolved Gas Analysis of Power Transformers
    Thango, Bonginkosi A.
    MACHINE LEARNING AND KNOWLEDGE EXTRACTION, 2022, 4 (04): : 839 - 851
  • [8] Discriminative Region Guided Deep Neural Network Towards Food Image Classification
    Chen, Yali
    Yang, Yanping
    Fang, Qing
    Yao, Xiaoyu
    COMPUTER VISION, PT II, 2017, 772 : 577 - 587
  • [9] Hardware implementation of the simplified digital spiking neural network on FPGA
    Lee K.
    Kim Y.
    IEIE Transactions on Smart Processing and Computing, 2019, 8 (05): : 405 - 414
  • [10] A Progressive Deep Neural Network Training Method for Image Classification with Noisy Labels
    Yan, Xuguo
    Xia, Xuhui
    Wang, Lei
    Zhang, Zelin
    APPLIED SCIENCES-BASEL, 2022, 12 (24):