Efficient Spike Encoding Algorithms for Neuromorphic Speech Recognition

被引：2

作者：

Yarga, Sidi Yaya Arnaud ^{[1
]}

Rouat, Jean ^{[1
]}

Wood, Sean U. N. ^{[1
]}

机构：

[1] Univ Sherbrooke, Dept Elect & Comp Engn, Sherbrooke, PQ, Canada

来源：

PROCEEDINGS OF INTERNATIONAL CONFERENCE ON NEUROMORPHIC SYSTEMS 2022, ICONS 2022 | 2022年

基金：

加拿大自然科学与工程研究理事会;

关键词：

Spiking Neural Networks; Spike Encoding; Neuromorphic Computing; Speech Processing; Speech Recognition; OPTIMIZATION;

D O I：

10.1145/3546790.3546803

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

Spiking Neural Networks are known to be very effective for neuromorphic processor implementations, achieving orders of magnitude improvements in energy efficiency and computational latency over traditional deep learning approaches. Comparable algorithmic performance was recently made possible as well with the adaptation of supervised training algorithms to the context of spiking neural networks. However, information including audio, video, and other sensor-derived data are typically encoded as real-valued signals that are not well-suited to spiking neural networks, preventing the network from leveraging spike timing information. Efficient encoding from real-valued signals to spikes is therefore critical and significantly impacts the performance of the overall system. To efficiently encode signals into spikes, both the preservation of information relevant to the task at hand as well as the density of the encoded spikes must be considered. In this paper, we study four spike encoding methods in the context of a speaker independent digit classification system: Send on Delta, Time to First Spike, Leaky Integrate and Fire Neuron and Bens Spiker Algorithm. We first show that all encoding methods yield higher classification accuracy using significantly fewer spikes when encoding a bio-inspired cochleagram as opposed to a traditional short-time Fourier transform. We then show that two Send On Delta variants result in classification results comparable with a state of the art deep convolutional neural network baseline, while simultaneously reducing the encoded bit rate. Finally, we show that several encoding methods result in improved performance over the conventional deep learning baseline in certain cases, further demonstrating the power of spike encoding algorithms in the encoding of real-valued signals and that neuromorphic implementation has the potential to outperform state of the art techniques.

引用

页数：8

共 50 条

[21] Relative Positional Encoding for Speech Recognition and Direct Translation
Pham, Ngoc-Quan
Ha, Thanh-Le
Nguyen, Tuan-Nam
Nguyen, Thai-Son
Salesky, Elizabeth
Stuker, Sebastian
Niehues, Jan
Waibel, Alex
INTERSPEECH 2020, 2020, : 31 - 35
[22] Algorithms and Methods for the Automatic Speech Recognition in Spanish Language using Syllables
Oropeza Rodriguez, Jose Luis
Suarez Guerra, Sergio
COMPUTACION Y SISTEMAS, 2006, 9 (03): : 270 - 286
[23] Spike-based information encoding in vertical cavity surface emitting lasers for neuromorphic photonic systems
Hejda, Matej
Robertson, Joshua
Bueno, Julian
Hurtado, Antonio
JOURNAL OF PHYSICS-PHOTONICS, 2020, 2 (04):
[24] NBSSN: A Neuromorphic Binary Single-Spike Neural Network for Efficient Edge Intelligence
Shen, Ziyang
Tian, Fengshi
Jiang, Jingwen
Fang, Chaoming
Xue, Xiaoyong
Yang, Jie
Sawan, Mohamad
2023 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, ISCAS, 2023,
[25] Evaluating Encoding and Decoding Approaches for Spiking Neuromorphic Systems
Schuman, Catherine D.
Rizzo, Charles
McDonald-Carmack, John
Skuda, Nicholas
Plank, James S.
PROCEEDINGS OF INTERNATIONAL CONFERENCE ON NEUROMORPHIC SYSTEMS 2022, ICONS 2022, 2022,
[26] Review of spike-based neuromorphic computing for brain-inspired vision: biology, algorithms, and hardware
Hendy, Hagar
Merkel, Cory
JOURNAL OF ELECTRONIC IMAGING, 2022, 31 (01)
[27] Development of sEMG sensors and algorithms for silent speech recognition
Meltzner, Geoffrey S.
Heaton, James T.
Deng, Yunbin
De Luca, Gianluca
Roy, Serge H.
Kline, Joshua C.
JOURNAL OF NEURAL ENGINEERING, 2018, 15 (04)
[28] Evolution of the performance of automatic speech recognition algorithms in transcribing conversational telephone speech
Padmanabhan, M
Saon, G
Zweig, G
Huang, J
Kingsbury, B
Mangu, L
IMTC/2001: PROCEEDINGS OF THE 18TH IEEE INSTRUMENTATION AND MEASUREMENT TECHNOLOGY CONFERENCE, VOLS 1-3: REDISCOVERING MEASUREMENT IN THE AGE OF INFORMATICS, 2001, : 1926 - 1931
[29] Efficient Weight factorization for Multilingual Speech Recognition
Ngoc-Quan Pham
Tuan-Nam Nguyen
Stuker, Sebastian
Waibel, Alex
INTERSPEECH 2021, 2021, : 2421 - 2425
[30] Efficient, low latency adaptation for speech recognition
Kozat, Suleyman S.
Visweswariah, Karthik
Gopinath, Ramesh
2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 777 - +

← 1 2 3 4 5 →