Efficient Spike Encoding Algorithms for Neuromorphic Speech Recognition

被引：2

作者：

Yarga, Sidi Yaya Arnaud ^{[1
]}

Rouat, Jean ^{[1
]}

Wood, Sean U. N. ^{[1
]}

机构：

[1] Univ Sherbrooke, Dept Elect & Comp Engn, Sherbrooke, PQ, Canada

来源：

PROCEEDINGS OF INTERNATIONAL CONFERENCE ON NEUROMORPHIC SYSTEMS 2022, ICONS 2022 | 2022年

基金：

加拿大自然科学与工程研究理事会;

关键词：

Spiking Neural Networks; Spike Encoding; Neuromorphic Computing; Speech Processing; Speech Recognition; OPTIMIZATION;

D O I：

10.1145/3546790.3546803

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

Spiking Neural Networks are known to be very effective for neuromorphic processor implementations, achieving orders of magnitude improvements in energy efficiency and computational latency over traditional deep learning approaches. Comparable algorithmic performance was recently made possible as well with the adaptation of supervised training algorithms to the context of spiking neural networks. However, information including audio, video, and other sensor-derived data are typically encoded as real-valued signals that are not well-suited to spiking neural networks, preventing the network from leveraging spike timing information. Efficient encoding from real-valued signals to spikes is therefore critical and significantly impacts the performance of the overall system. To efficiently encode signals into spikes, both the preservation of information relevant to the task at hand as well as the density of the encoded spikes must be considered. In this paper, we study four spike encoding methods in the context of a speaker independent digit classification system: Send on Delta, Time to First Spike, Leaky Integrate and Fire Neuron and Bens Spiker Algorithm. We first show that all encoding methods yield higher classification accuracy using significantly fewer spikes when encoding a bio-inspired cochleagram as opposed to a traditional short-time Fourier transform. We then show that two Send On Delta variants result in classification results comparable with a state of the art deep convolutional neural network baseline, while simultaneously reducing the encoded bit rate. Finally, we show that several encoding methods result in improved performance over the conventional deep learning baseline in certain cases, further demonstrating the power of spike encoding algorithms in the encoding of real-valued signals and that neuromorphic implementation has the potential to outperform state of the art techniques.

引用

页数：8

共 50 条

[31] Neuromorphic crossbar circuit with nanoscale filamentary-switching binary memristors for speech recognition
Son Ngoc Truong
Ham, Seok-Jin
Min, Kyeong-Sik
NANOSCALE RESEARCH LETTERS, 2014, 9 : 1 - 9
[32] Neuromorphic crossbar circuit with nanoscale filamentary-switching binary memristors for speech recognition
Son Ngoc Truong
Seok-Jin Ham
Kyeong-Sik Min
Nanoscale Research Letters, 9
[33] LEARNING EFFICIENT SPARSE STRUCTURES IN SPEECH RECOGNITION
Zhang, Jingchi
Wen, Wei
Deisher, Michael
Cheng, Hsin-Pai
Li, Hai
Chen, Yiran
2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 2717 - 2721
[34] Neuromorphic detection of speech dynamics
Gomez-Vilda, Pedro
Ferrandez-Vicente, Jose M.
Rodellar-Biarge, Victoria
Alvarez-Marquina, Agustin
Miguel Mazaira-Fernandez, Luis
Martinez Olalla, Rafael
Munoz-Mulas, Cristina
NEUROCOMPUTING, 2011, 74 (08) : 1191 - 1202
[35] Compact Convolutional SNN Architecture for the Neuromorphic Speech Denoising
Dorzhigulov, Anuar
Saxena, Vishal
2024 IEEE 67TH INTERNATIONAL MIDWEST SYMPOSIUM ON CIRCUITS AND SYSTEMS, MWSCAS 2024, 2024, : 1191 - 1195
[36] Using Speech Recognition Algorithms to Improve Listening Training in College English Listening Instruction
Chen, Xiaohua
JOURNAL OF ELECTRICAL SYSTEMS, 2024, 20 (06) : 1775 - 1785
[37] Performance Analysis of Various Single Channel Speech Enhancement Algorithms for Automatic Speech Recognition
Song, Myung-Suk
Lee, Chang-Heon
Kang, Hong-Goo
INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1451 - 1454
[38] Criteria for the Evaluation of Automated Speech-Recognition Scoring Algorithms
Dobrisek, Simon
ELEKTROTEHNISKI VESTNIK-ELECTROCHEMICAL REVIEW, 2008, 75 (04): : 229 - 234
[39] Algorithms for Vowel Recognition in Fluent Speech Based on Formant Positions
Stanek, Miroslav
Polak, Ladislav
2013 36TH INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS AND SIGNAL PROCESSING (TSP), 2013, : 521 - 525
[40] A Family of Discriminative Manifold Learning Algorithms and Their Application to Speech Recognition
Tomar, Vikrant Singh
Rose, Richard C.
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2014, 22 (01) : 161 - 171

← 1 2 3 4 5 →