Efficient Spike Encoding Algorithms for Neuromorphic Speech Recognition

被引：2

作者：

Yarga, Sidi Yaya Arnaud ^{[1
]}

Rouat, Jean ^{[1
]}

Wood, Sean U. N. ^{[1
]}

机构：

[1] Univ Sherbrooke, Dept Elect & Comp Engn, Sherbrooke, PQ, Canada

来源：

PROCEEDINGS OF INTERNATIONAL CONFERENCE ON NEUROMORPHIC SYSTEMS 2022, ICONS 2022 | 2022年

基金：

加拿大自然科学与工程研究理事会;

关键词：

Spiking Neural Networks; Spike Encoding; Neuromorphic Computing; Speech Processing; Speech Recognition; OPTIMIZATION;

D O I：

10.1145/3546790.3546803

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

Spiking Neural Networks are known to be very effective for neuromorphic processor implementations, achieving orders of magnitude improvements in energy efficiency and computational latency over traditional deep learning approaches. Comparable algorithmic performance was recently made possible as well with the adaptation of supervised training algorithms to the context of spiking neural networks. However, information including audio, video, and other sensor-derived data are typically encoded as real-valued signals that are not well-suited to spiking neural networks, preventing the network from leveraging spike timing information. Efficient encoding from real-valued signals to spikes is therefore critical and significantly impacts the performance of the overall system. To efficiently encode signals into spikes, both the preservation of information relevant to the task at hand as well as the density of the encoded spikes must be considered. In this paper, we study four spike encoding methods in the context of a speaker independent digit classification system: Send on Delta, Time to First Spike, Leaky Integrate and Fire Neuron and Bens Spiker Algorithm. We first show that all encoding methods yield higher classification accuracy using significantly fewer spikes when encoding a bio-inspired cochleagram as opposed to a traditional short-time Fourier transform. We then show that two Send On Delta variants result in classification results comparable with a state of the art deep convolutional neural network baseline, while simultaneously reducing the encoded bit rate. Finally, we show that several encoding methods result in improved performance over the conventional deep learning baseline in certain cases, further demonstrating the power of spike encoding algorithms in the encoding of real-valued signals and that neuromorphic implementation has the potential to outperform state of the art techniques.

引用

页数：8

共 50 条

[41] A Comparative Study of Dictionary Learning Algorithms on Speech Recognition Task
Kiran, Kadambari Sai
Mandal, Anupam
Kumar, K. R. Prasanna
Mitra, Pabitra
Veni, S.
2018 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2018, : 588 - 594
[42] Spike-time encoding as a data compression technique for pattern recognition of temporal data
Sengupta, Neelava
Kasabov, Nikola
INFORMATION SCIENCES, 2017, 406 : 133 - 145
[43] Research on Robust Audio-Visual Speech Recognition Algorithms
Yang, Wenfeng
Li, Pengyi
Yang, Wei
Liu, Yuxing
He, Yulong
Petrosian, Ovanes
Davydenko, Aleksandr
MATHEMATICS, 2023, 11 (07)
[44] Feature Selection Using Various Hybrid Algorithms for Speech Recognition
Pacharne, Manisha
Nayak, Vidyavati S.
COMPUTATIONAL INTELLIGENCE AND INFORMATION TECHNOLOGY, 2011, 250 : 652 - +
[45] Efficient Sparse Banded Acoustic Models for Speech Recognition
Zhang, Weibin
Fung, Pascale
IEEE SIGNAL PROCESSING LETTERS, 2014, 21 (03) : 280 - 283
[46] Efficient Neuromorphic Signal Processing with Loihi 2
Orchard, Garrick
Frady, E. Paxon
Rubin, Daniel Ben Dayan
Sanborn, Sophia
Shrestha, Sumit Bam
Sommer, Friedrich T.
Davies, Mike
2021 IEEE WORKSHOP ON SIGNAL PROCESSING SYSTEMS (SIPS 2021), 2021, : 254 - 259
[47] Resistive memories for spike-based neuromorphic circuits
Vianello, E.
Werner, T.
Bichler, O.
Valentian, A.
Molas, G.
Yvert, B.
De Salvo, B.
Perniola, L.
2017 IEEE 9TH INTERNATIONAL MEMORY WORKSHOP (IMW), 2017, : 135 - 140
[48] Variational Auto-Encoder Based Variability Encoding for Dysarthric Speech Recognition
Xie, Xurong
Ruzi, Rukiye
Liu, Xunying
Wang, Lan
INTERSPEECH 2021, 2021, : 4808 - 4812
[49] Robust Environmental Sound Recognition With Sparse Key-Point Encoding and Efficient Multispike Learning
Yu, Qiang
Yao, Yanli
Wang, Longbiao
Tang, Huajin
Dang, Jianwu
Tan, Kay Chen
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 32 (02) : 625 - 638
[50] Lateral inhibition net and weighted matching algorithms for speech recognition in noise
Yoma, NB
McInnes, F
Jack, M
IEE PROCEEDINGS-VISION IMAGE AND SIGNAL PROCESSING, 1996, 143 (05): : 324 - 330

← 1 2 3 4 5 →