ACE-SNN: Algorithm-Hardware Co-design of Energy-Efficient & Low-Latency Deep Spiking Neural Networks for 3D Image Recognition

被引：15

作者：

Datta, Gourav ^{[1
]}

Kundu, Souvik ^{[1
]}

Jaiswal, Akhilesh R. ^{[1
]}

Beerel, Peter A. ^{[1
]}

机构：

[1] Univ Southern Calif, Ming Hsieh Dept Elect & Comp Engn, Los Angeles, CA 90007 USA

来源：

FRONTIERS IN NEUROSCIENCE | 2022年 / 16卷

关键词：

hyperspectral images; spiking neural networks; quantization-aware; gradient descent; processing-in-memory; SRAM; CLASSIFICATION; ARCHITECTURE; ACCELERATOR; POWER;

D O I：

10.3389/fnins.2022.815258

中图分类号：

Q189 [神经科学];

学科分类号：

071006 ;

摘要：

High-quality 3D image recognition is an important component of many vision and robotics systems. However, the accurate processing of these images requires the use of compute-expensive 3D Convolutional Neural Networks (CNNs). To address this challenge, we propose the use of Spiking Neural Networks (SNNs) that are generated from iso-architecture CNNs and trained with quantization-aware gradient descent to optimize their weights, membrane leak, and firing thresholds. During both training and inference, the analog pixel values of a 3D image are directly applied to the input layer of the SNN without the need to convert to a spike-train. This significantly reduces the training and inference latency and results in high degree of activation sparsity, which yields significant improvements in computational efficiency. However, this introduces energy-hungry digital multiplications in the first layer of our models, which we propose to mitigate using a processing-in-memory (PIM) architecture. To evaluate our proposal, we propose a 3D and a 3D/2D hybrid SNN-compatible convolutional architecture and choose hyperspectral imaging (HSI) as an application for 3D image recognition. We achieve overall test accuracy of 98.68, 99.50, and 97.95% with 5 time steps (inference latency) and 6-bit weight quantization on the Indian Pines, Pavia University, and Salinas Scene datasets, respectively. In particular, our models implemented using standard digital hardware achieved accuracies similar to state-of-the-art (SOTA) with ~560.6x and ~44.8x less average energy than an iso-architecture full-precision and 6-bit quantized CNN, respectively. Adopting the PIM architecture in the first layer, further improves the average energy, delay, and energy-delay-product (EDP) by 30, 7, and 38%, respectively.

引用

页数：21

共 76 条

[1] Xcel-RAM: Accelerating Binary Neural Networks in High-Throughput SRAM Compute Arrays [J].

Agrawal, Amogh ;

Jaiswal, Akhilesh ;

Roy, Deboleena ;

Han, Bing ;

Srinivasan, Gopalakrishnan ;

Ankit, Aayush ;

Roy, Kaushik .

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2019, 66 (08) :3064-3076

[2] X-SRAM: Enabling In-Memory Boolean Computations in CMOS Static Random Access Memories [J].

Agrawal, Amogh ;

Jaiswal, Akhilesh ;

Lee, Chankyu ;

Roy, Kaushik .

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2018, 65 (12) :4219-4232

[3] A 35.5-127.2 TOPS/W Dynamic Sparsity-Aware Reconfigurable-Precision Compute-in-Memory SRAM Macro for Machine Learning [J].

Ali, Mustafa ;

Chakraborty, Indranil ;

Saxena, Utkarsh ;

Agrawal, Amogh ;

Ankit, Aayush ;

Roy, Kaushik .

IEEE SOLID-STATE CIRCUITS LETTERS, 2021, 4 :129-132

[4] IMAC: In-Memory Multi-Bit Multiplication and ACcumulation in 6T SRAM Array [J].

Ali, Mustafa ;

Jaiswal, Akhilesh ;

Kodge, Sangamesh ;

Agrawal, Amogh ;

Chakraborty, Indranil ;

Roy, Kaushik .

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2020, 67 (08) :2521-2531

[5] Multibranch Selective Kernel Networks for Hyperspectral Image Classification [J].

Alipour-Fard, T. ;

Paoletti, M. E. ;

Haut, Juan M. ;

Arefi, H. ;

Plaza, J. ;

Plaza, A. .

IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2021, 18 (06) :1089-1093

[6] A comparative study on spiking neural network encoding schema: implemented with cloud computing [J].

Almomani, Ammar ;

Alauthman, Mohammad ;

Alweshah, Mohammed ;

Dorgham, O. ;

Albalas, Firas .

CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2019, 22 (02) :419-433

[7] PANTHER: A Programmable Architecture for Neural Network Training Harnessing Energy-Efficient ReRAM [J].

Ankit, Aayush ;

El Hajj, Izzat ;

Chalamalasetti, Sai Rahul ;

Agarwal, Sapan ;

Marinella, Matthew ;

Foltin, Martin ;

Strachan, John Paul ;

Milojicic, Dejan ;

Hwu, Wen-Mei ;

Roy, Kaushik .

IEEE TRANSACTIONS ON COMPUTERS, 2020, 69 (08) :1128-1142

[8]

[Anonymous], 2015, Advances in neural information processing systems

[9]

Bellec G, 2018, 32 C NEURAL INFORM P

[10] 3-D Deep Learning Approach for Remote Sensing Image Classification [J].

Ben Hamida, Amina ;

Benoit, Alexandre ;

Lambert, Patrick ;

Ben Amar, Chokri .

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2018, 56 (08) :4420-4434

← 1 2 3 4 5 6 7 8 →