LDD: High-Precision Training of Deep Spiking Neural Network Transformers Guided by an Artificial Neural Network

被引:1
|
作者
Liu, Yuqian [1 ,2 ]
Zhao, Chujie [1 ,2 ]
Jiang, Yizhou [1 ,2 ]
Fang, Ying [3 ,4 ]
Chen, Feng [1 ,2 ]
机构
[1] Tsinghua Univ, Dept Automat, Beijing 100084, Peoples R China
[2] LSBDPA Beijing Key Lab, Beijing 100084, Peoples R China
[3] Fujian Normal Univ, Coll Comp & Cyber Secur, Fuzhou 350117, Peoples R China
[4] Fujian Normal Univ, Digital Fujian Internet of Thing Lab Environm Moni, Fuzhou 350117, Peoples R China
基金
中国国家自然科学基金;
关键词
spiking neural networks (SNNs); Transformer; distillation; image classification;
D O I
10.3390/biomimetics9070413
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
The rise of large-scale Transformers has led to challenges regarding computational costs and energy consumption. In this context, spiking neural networks (SNNs) offer potential solutions due to their energy efficiency and processing speed. However, the inaccuracy of surrogate gradients and feature space quantization pose challenges for directly training deep SNN Transformers. To tackle these challenges, we propose a method (called LDD) to align ANN and SNN features across different abstraction levels in a Transformer network. LDD incorporates structured feature knowledge from ANNs to guide SNN training, ensuring the preservation of crucial information and addressing inaccuracies in surrogate gradients through designing layer-wise distillation losses. The proposed approach outperforms existing methods on the CIFAR10 (96.1%), CIFAR100 (82.3%), and ImageNet (80.9%) datasets, and enables training of the deepest SNN Transformer network using ImageNet.
引用
收藏
页数:15
相关论文
共 50 条
  • [41] A Training-Efficient Hybrid-Structured Deep Neural Network With Reconfigurable Memristive Synapses
    Bai, Kangjun
    An, Qiyuan
    Liu, Lingjia
    Yi, Yang
    IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2020, 28 (01) : 62 - 75
  • [42] Neuromorphic Camera Denoising Using Graph Neural Network-Driven Transformers
    Alkendi, Yusra
    Azzam, Rana
    Ayyad, Abdulla
    Javed, Sajid
    Seneviratne, Lakmal
    Zweiri, Yahya
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (03) : 4110 - 4124
  • [43] Neural network modeling of distribution transformers with internal short circuit winding faults
    Wang, H
    Butler, KL
    PICA 2001: 22ND IEEE POWER ENGINEERING SOCIETY INTERNATIONAL CONFERENCE ON POWER INDUSTRY COMPUTER APPLICATIONS, 2001, : 122 - 127
  • [44] Partial Discharge Pattern Recognition of Transformers Based on MobileNets Convolutional Neural Network
    Sun, Yuanyuan
    Ma, Shuo
    Sun, Shengya
    Liu, Ping
    Zhang, Lina
    Ouyang, Jun
    Ni, Xianfeng
    APPLIED SCIENCES-BASEL, 2021, 11 (15):
  • [45] Particle swarm trained neural network for fault diagnosis of transformers by acoustic emission
    Kuo, Cheng-Chien
    ADVANCED INTELLIGENT COMPUTING THEORIES AND APPLICATIONS, PROCEEDINGS: WITH ASPECTS OF ARTIFICIAL INTELLIGENCE, 2007, 4682 : 992 - 1003
  • [46] Identifying Transformer Inrush Current Based On Artificial Neural Network
    Xu, Hang
    Yang, Xuhong
    Ye, Jianhua
    Qian, Hong
    Xue, Yang
    Liu, Gang
    INFORMATION TECHNOLOGY FOR MANUFACTURING SYSTEMS II, PTS 1-3, 2011, 58-60 : 1779 - 1785
  • [47] The Application of Artificial Neural Network in the Transformer Oil Chromatographic Test
    Wang Qinghao
    Li Weijun
    Pang Yanjun
    Liu Bo
    Wang Yi
    Liu Zhitong
    Liu Peng
    Liu Xiao
    He Lishuai
    Tao Shuyi
    PROCEEDINGS OF THE 2017 5TH INTERNATIONAL CONFERENCE ON MACHINERY, MATERIALS AND COMPUTING TECHNOLOGY (ICMMCT 2017), 2017, 126 : 962 - 968
  • [48] Diagnosis of transformer winding deformation on the basis of artificial neural network
    Jin, ZJ
    Li, JT
    Zhu, ZS
    PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON PROPERTIES AND APPLICATIONS OF DIELECTRIC MATERIALS, VOLS 1 & 2, 2000, : 173 - 176
  • [49] Comprehensive Review of Artificial Neural Network Applications to Pattern Recognition
    Abiodun, Oludare Isaac
    Jantan, Aman
    Omolara, Abiodun Esther
    Dada, Kemi Victoria
    Umar, Abubakar Malah
    Linus, Okafor Uchenwa
    Arshad, Humaira
    Kazaure, Abdullahi Aminu
    Gana, Usman
    Kiru, Muhammad Ubale
    IEEE ACCESS, 2019, 7 : 158820 - 158846
  • [50] Iron core saturation detector supplemented by an artificial neural network
    Dezelak, Klemen
    Stumberger, Gorazd
    Klopcic, Beno
    Dolinar, Drago
    Pihler, Joze
    PRZEGLAD ELEKTROTECHNICZNY, 2008, 84 (12): : 157 - 159