Efficient Spike Encoding Algorithms for Neuromorphic Speech Recognition

被引:2
|
作者
Yarga, Sidi Yaya Arnaud [1 ]
Rouat, Jean [1 ]
Wood, Sean U. N. [1 ]
机构
[1] Univ Sherbrooke, Dept Elect & Comp Engn, Sherbrooke, PQ, Canada
来源
PROCEEDINGS OF INTERNATIONAL CONFERENCE ON NEUROMORPHIC SYSTEMS 2022, ICONS 2022 | 2022年
基金
加拿大自然科学与工程研究理事会;
关键词
Spiking Neural Networks; Spike Encoding; Neuromorphic Computing; Speech Processing; Speech Recognition; OPTIMIZATION;
D O I
10.1145/3546790.3546803
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Spiking Neural Networks are known to be very effective for neuromorphic processor implementations, achieving orders of magnitude improvements in energy efficiency and computational latency over traditional deep learning approaches. Comparable algorithmic performance was recently made possible as well with the adaptation of supervised training algorithms to the context of spiking neural networks. However, information including audio, video, and other sensor-derived data are typically encoded as real-valued signals that are not well-suited to spiking neural networks, preventing the network from leveraging spike timing information. Efficient encoding from real-valued signals to spikes is therefore critical and significantly impacts the performance of the overall system. To efficiently encode signals into spikes, both the preservation of information relevant to the task at hand as well as the density of the encoded spikes must be considered. In this paper, we study four spike encoding methods in the context of a speaker independent digit classification system: Send on Delta, Time to First Spike, Leaky Integrate and Fire Neuron and Bens Spiker Algorithm. We first show that all encoding methods yield higher classification accuracy using significantly fewer spikes when encoding a bio-inspired cochleagram as opposed to a traditional short-time Fourier transform. We then show that two Send On Delta variants result in classification results comparable with a state of the art deep convolutional neural network baseline, while simultaneously reducing the encoded bit rate. Finally, we show that several encoding methods result in improved performance over the conventional deep learning baseline in certain cases, further demonstrating the power of spike encoding algorithms in the encoding of real-valued signals and that neuromorphic implementation has the potential to outperform state of the art techniques.
引用
收藏
页数:8
相关论文
共 50 条
  • [41] A Comparative Study of Dictionary Learning Algorithms on Speech Recognition Task
    Kiran, Kadambari Sai
    Mandal, Anupam
    Kumar, K. R. Prasanna
    Mitra, Pabitra
    Veni, S.
    2018 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2018, : 588 - 594
  • [42] Spike-time encoding as a data compression technique for pattern recognition of temporal data
    Sengupta, Neelava
    Kasabov, Nikola
    INFORMATION SCIENCES, 2017, 406 : 133 - 145
  • [43] Research on Robust Audio-Visual Speech Recognition Algorithms
    Yang, Wenfeng
    Li, Pengyi
    Yang, Wei
    Liu, Yuxing
    He, Yulong
    Petrosian, Ovanes
    Davydenko, Aleksandr
    MATHEMATICS, 2023, 11 (07)
  • [44] Feature Selection Using Various Hybrid Algorithms for Speech Recognition
    Pacharne, Manisha
    Nayak, Vidyavati S.
    COMPUTATIONAL INTELLIGENCE AND INFORMATION TECHNOLOGY, 2011, 250 : 652 - +
  • [45] Efficient Sparse Banded Acoustic Models for Speech Recognition
    Zhang, Weibin
    Fung, Pascale
    IEEE SIGNAL PROCESSING LETTERS, 2014, 21 (03) : 280 - 283
  • [46] Efficient Neuromorphic Signal Processing with Loihi 2
    Orchard, Garrick
    Frady, E. Paxon
    Rubin, Daniel Ben Dayan
    Sanborn, Sophia
    Shrestha, Sumit Bam
    Sommer, Friedrich T.
    Davies, Mike
    2021 IEEE WORKSHOP ON SIGNAL PROCESSING SYSTEMS (SIPS 2021), 2021, : 254 - 259
  • [47] Resistive memories for spike-based neuromorphic circuits
    Vianello, E.
    Werner, T.
    Bichler, O.
    Valentian, A.
    Molas, G.
    Yvert, B.
    De Salvo, B.
    Perniola, L.
    2017 IEEE 9TH INTERNATIONAL MEMORY WORKSHOP (IMW), 2017, : 135 - 140
  • [48] Variational Auto-Encoder Based Variability Encoding for Dysarthric Speech Recognition
    Xie, Xurong
    Ruzi, Rukiye
    Liu, Xunying
    Wang, Lan
    INTERSPEECH 2021, 2021, : 4808 - 4812
  • [49] Robust Environmental Sound Recognition With Sparse Key-Point Encoding and Efficient Multispike Learning
    Yu, Qiang
    Yao, Yanli
    Wang, Longbiao
    Tang, Huajin
    Dang, Jianwu
    Tan, Kay Chen
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 32 (02) : 625 - 638
  • [50] Lateral inhibition net and weighted matching algorithms for speech recognition in noise
    Yoma, NB
    McInnes, F
    Jack, M
    IEE PROCEEDINGS-VISION IMAGE AND SIGNAL PROCESSING, 1996, 143 (05): : 324 - 330