ACE-SNN: Algorithm-Hardware Co-design of Energy-Efficient & Low-Latency Deep Spiking Neural Networks for 3D Image Recognition

被引:15
作者
Datta, Gourav [1 ]
Kundu, Souvik [1 ]
Jaiswal, Akhilesh R. [1 ]
Beerel, Peter A. [1 ]
机构
[1] Univ Southern Calif, Ming Hsieh Dept Elect & Comp Engn, Los Angeles, CA 90007 USA
关键词
hyperspectral images; spiking neural networks; quantization-aware; gradient descent; processing-in-memory; SRAM; CLASSIFICATION; ARCHITECTURE; ACCELERATOR; POWER;
D O I
10.3389/fnins.2022.815258
中图分类号
Q189 [神经科学];
学科分类号
071006 ;
摘要
High-quality 3D image recognition is an important component of many vision and robotics systems. However, the accurate processing of these images requires the use of compute-expensive 3D Convolutional Neural Networks (CNNs). To address this challenge, we propose the use of Spiking Neural Networks (SNNs) that are generated from iso-architecture CNNs and trained with quantization-aware gradient descent to optimize their weights, membrane leak, and firing thresholds. During both training and inference, the analog pixel values of a 3D image are directly applied to the input layer of the SNN without the need to convert to a spike-train. This significantly reduces the training and inference latency and results in high degree of activation sparsity, which yields significant improvements in computational efficiency. However, this introduces energy-hungry digital multiplications in the first layer of our models, which we propose to mitigate using a processing-in-memory (PIM) architecture. To evaluate our proposal, we propose a 3D and a 3D/2D hybrid SNN-compatible convolutional architecture and choose hyperspectral imaging (HSI) as an application for 3D image recognition. We achieve overall test accuracy of 98.68, 99.50, and 97.95% with 5 time steps (inference latency) and 6-bit weight quantization on the Indian Pines, Pavia University, and Salinas Scene datasets, respectively. In particular, our models implemented using standard digital hardware achieved accuracies similar to state-of-the-art (SOTA) with ~560.6x and ~44.8x less average energy than an iso-architecture full-precision and 6-bit quantized CNN, respectively. Adopting the PIM architecture in the first layer, further improves the average energy, delay, and energy-delay-product (EDP) by 30, 7, and 38%, respectively.
引用
收藏
页数:21
相关论文
共 76 条
[1]   Xcel-RAM: Accelerating Binary Neural Networks in High-Throughput SRAM Compute Arrays [J].
Agrawal, Amogh ;
Jaiswal, Akhilesh ;
Roy, Deboleena ;
Han, Bing ;
Srinivasan, Gopalakrishnan ;
Ankit, Aayush ;
Roy, Kaushik .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2019, 66 (08) :3064-3076
[2]   X-SRAM: Enabling In-Memory Boolean Computations in CMOS Static Random Access Memories [J].
Agrawal, Amogh ;
Jaiswal, Akhilesh ;
Lee, Chankyu ;
Roy, Kaushik .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2018, 65 (12) :4219-4232
[3]   A 35.5-127.2 TOPS/W Dynamic Sparsity-Aware Reconfigurable-Precision Compute-in-Memory SRAM Macro for Machine Learning [J].
Ali, Mustafa ;
Chakraborty, Indranil ;
Saxena, Utkarsh ;
Agrawal, Amogh ;
Ankit, Aayush ;
Roy, Kaushik .
IEEE SOLID-STATE CIRCUITS LETTERS, 2021, 4 :129-132
[4]   IMAC: In-Memory Multi-Bit Multiplication and ACcumulation in 6T SRAM Array [J].
Ali, Mustafa ;
Jaiswal, Akhilesh ;
Kodge, Sangamesh ;
Agrawal, Amogh ;
Chakraborty, Indranil ;
Roy, Kaushik .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2020, 67 (08) :2521-2531
[5]   Multibranch Selective Kernel Networks for Hyperspectral Image Classification [J].
Alipour-Fard, T. ;
Paoletti, M. E. ;
Haut, Juan M. ;
Arefi, H. ;
Plaza, J. ;
Plaza, A. .
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2021, 18 (06) :1089-1093
[6]   A comparative study on spiking neural network encoding schema: implemented with cloud computing [J].
Almomani, Ammar ;
Alauthman, Mohammad ;
Alweshah, Mohammed ;
Dorgham, O. ;
Albalas, Firas .
CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2019, 22 (02) :419-433
[7]   PANTHER: A Programmable Architecture for Neural Network Training Harnessing Energy-Efficient ReRAM [J].
Ankit, Aayush ;
El Hajj, Izzat ;
Chalamalasetti, Sai Rahul ;
Agarwal, Sapan ;
Marinella, Matthew ;
Foltin, Martin ;
Strachan, John Paul ;
Milojicic, Dejan ;
Hwu, Wen-Mei ;
Roy, Kaushik .
IEEE TRANSACTIONS ON COMPUTERS, 2020, 69 (08) :1128-1142
[8]  
[Anonymous], 2015, Advances in neural information processing systems
[9]  
Bellec G, 2018, 32 C NEURAL INFORM P
[10]   3-D Deep Learning Approach for Remote Sensing Image Classification [J].
Ben Hamida, Amina ;
Benoit, Alexandre ;
Lambert, Patrick ;
Ben Amar, Chokri .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2018, 56 (08) :4420-4434