Efficient Spike Encoding Algorithms for Neuromorphic Speech Recognition

被引:2
|
作者
Yarga, Sidi Yaya Arnaud [1 ]
Rouat, Jean [1 ]
Wood, Sean U. N. [1 ]
机构
[1] Univ Sherbrooke, Dept Elect & Comp Engn, Sherbrooke, PQ, Canada
来源
PROCEEDINGS OF INTERNATIONAL CONFERENCE ON NEUROMORPHIC SYSTEMS 2022, ICONS 2022 | 2022年
基金
加拿大自然科学与工程研究理事会;
关键词
Spiking Neural Networks; Spike Encoding; Neuromorphic Computing; Speech Processing; Speech Recognition; OPTIMIZATION;
D O I
10.1145/3546790.3546803
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Spiking Neural Networks are known to be very effective for neuromorphic processor implementations, achieving orders of magnitude improvements in energy efficiency and computational latency over traditional deep learning approaches. Comparable algorithmic performance was recently made possible as well with the adaptation of supervised training algorithms to the context of spiking neural networks. However, information including audio, video, and other sensor-derived data are typically encoded as real-valued signals that are not well-suited to spiking neural networks, preventing the network from leveraging spike timing information. Efficient encoding from real-valued signals to spikes is therefore critical and significantly impacts the performance of the overall system. To efficiently encode signals into spikes, both the preservation of information relevant to the task at hand as well as the density of the encoded spikes must be considered. In this paper, we study four spike encoding methods in the context of a speaker independent digit classification system: Send on Delta, Time to First Spike, Leaky Integrate and Fire Neuron and Bens Spiker Algorithm. We first show that all encoding methods yield higher classification accuracy using significantly fewer spikes when encoding a bio-inspired cochleagram as opposed to a traditional short-time Fourier transform. We then show that two Send On Delta variants result in classification results comparable with a state of the art deep convolutional neural network baseline, while simultaneously reducing the encoded bit rate. Finally, we show that several encoding methods result in improved performance over the conventional deep learning baseline in certain cases, further demonstrating the power of spike encoding algorithms in the encoding of real-valued signals and that neuromorphic implementation has the potential to outperform state of the art techniques.
引用
收藏
页数:8
相关论文
共 50 条
  • [31] Neuromorphic crossbar circuit with nanoscale filamentary-switching binary memristors for speech recognition
    Son Ngoc Truong
    Ham, Seok-Jin
    Min, Kyeong-Sik
    NANOSCALE RESEARCH LETTERS, 2014, 9 : 1 - 9
  • [32] Neuromorphic crossbar circuit with nanoscale filamentary-switching binary memristors for speech recognition
    Son Ngoc Truong
    Seok-Jin Ham
    Kyeong-Sik Min
    Nanoscale Research Letters, 9
  • [33] LEARNING EFFICIENT SPARSE STRUCTURES IN SPEECH RECOGNITION
    Zhang, Jingchi
    Wen, Wei
    Deisher, Michael
    Cheng, Hsin-Pai
    Li, Hai
    Chen, Yiran
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 2717 - 2721
  • [34] Neuromorphic detection of speech dynamics
    Gomez-Vilda, Pedro
    Ferrandez-Vicente, Jose M.
    Rodellar-Biarge, Victoria
    Alvarez-Marquina, Agustin
    Miguel Mazaira-Fernandez, Luis
    Martinez Olalla, Rafael
    Munoz-Mulas, Cristina
    NEUROCOMPUTING, 2011, 74 (08) : 1191 - 1202
  • [35] Compact Convolutional SNN Architecture for the Neuromorphic Speech Denoising
    Dorzhigulov, Anuar
    Saxena, Vishal
    2024 IEEE 67TH INTERNATIONAL MIDWEST SYMPOSIUM ON CIRCUITS AND SYSTEMS, MWSCAS 2024, 2024, : 1191 - 1195
  • [36] Using Speech Recognition Algorithms to Improve Listening Training in College English Listening Instruction
    Chen, Xiaohua
    JOURNAL OF ELECTRICAL SYSTEMS, 2024, 20 (06) : 1775 - 1785
  • [37] Performance Analysis of Various Single Channel Speech Enhancement Algorithms for Automatic Speech Recognition
    Song, Myung-Suk
    Lee, Chang-Heon
    Kang, Hong-Goo
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1451 - 1454
  • [38] Criteria for the Evaluation of Automated Speech-Recognition Scoring Algorithms
    Dobrisek, Simon
    ELEKTROTEHNISKI VESTNIK-ELECTROCHEMICAL REVIEW, 2008, 75 (04): : 229 - 234
  • [39] Algorithms for Vowel Recognition in Fluent Speech Based on Formant Positions
    Stanek, Miroslav
    Polak, Ladislav
    2013 36TH INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS AND SIGNAL PROCESSING (TSP), 2013, : 521 - 525
  • [40] A Family of Discriminative Manifold Learning Algorithms and Their Application to Speech Recognition
    Tomar, Vikrant Singh
    Rose, Richard C.
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2014, 22 (01) : 161 - 171