Efficient Spike Encoding Algorithms for Neuromorphic Speech Recognition

被引:2
|
作者
Yarga, Sidi Yaya Arnaud [1 ]
Rouat, Jean [1 ]
Wood, Sean U. N. [1 ]
机构
[1] Univ Sherbrooke, Dept Elect & Comp Engn, Sherbrooke, PQ, Canada
来源
PROCEEDINGS OF INTERNATIONAL CONFERENCE ON NEUROMORPHIC SYSTEMS 2022, ICONS 2022 | 2022年
基金
加拿大自然科学与工程研究理事会;
关键词
Spiking Neural Networks; Spike Encoding; Neuromorphic Computing; Speech Processing; Speech Recognition; OPTIMIZATION;
D O I
10.1145/3546790.3546803
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Spiking Neural Networks are known to be very effective for neuromorphic processor implementations, achieving orders of magnitude improvements in energy efficiency and computational latency over traditional deep learning approaches. Comparable algorithmic performance was recently made possible as well with the adaptation of supervised training algorithms to the context of spiking neural networks. However, information including audio, video, and other sensor-derived data are typically encoded as real-valued signals that are not well-suited to spiking neural networks, preventing the network from leveraging spike timing information. Efficient encoding from real-valued signals to spikes is therefore critical and significantly impacts the performance of the overall system. To efficiently encode signals into spikes, both the preservation of information relevant to the task at hand as well as the density of the encoded spikes must be considered. In this paper, we study four spike encoding methods in the context of a speaker independent digit classification system: Send on Delta, Time to First Spike, Leaky Integrate and Fire Neuron and Bens Spiker Algorithm. We first show that all encoding methods yield higher classification accuracy using significantly fewer spikes when encoding a bio-inspired cochleagram as opposed to a traditional short-time Fourier transform. We then show that two Send On Delta variants result in classification results comparable with a state of the art deep convolutional neural network baseline, while simultaneously reducing the encoded bit rate. Finally, we show that several encoding methods result in improved performance over the conventional deep learning baseline in certain cases, further demonstrating the power of spike encoding algorithms in the encoding of real-valued signals and that neuromorphic implementation has the potential to outperform state of the art techniques.
引用
收藏
页数:8
相关论文
共 50 条
  • [21] Relative Positional Encoding for Speech Recognition and Direct Translation
    Pham, Ngoc-Quan
    Ha, Thanh-Le
    Nguyen, Tuan-Nam
    Nguyen, Thai-Son
    Salesky, Elizabeth
    Stuker, Sebastian
    Niehues, Jan
    Waibel, Alex
    INTERSPEECH 2020, 2020, : 31 - 35
  • [22] Algorithms and Methods for the Automatic Speech Recognition in Spanish Language using Syllables
    Oropeza Rodriguez, Jose Luis
    Suarez Guerra, Sergio
    COMPUTACION Y SISTEMAS, 2006, 9 (03): : 270 - 286
  • [23] Spike-based information encoding in vertical cavity surface emitting lasers for neuromorphic photonic systems
    Hejda, Matej
    Robertson, Joshua
    Bueno, Julian
    Hurtado, Antonio
    JOURNAL OF PHYSICS-PHOTONICS, 2020, 2 (04):
  • [24] NBSSN: A Neuromorphic Binary Single-Spike Neural Network for Efficient Edge Intelligence
    Shen, Ziyang
    Tian, Fengshi
    Jiang, Jingwen
    Fang, Chaoming
    Xue, Xiaoyong
    Yang, Jie
    Sawan, Mohamad
    2023 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, ISCAS, 2023,
  • [25] Evaluating Encoding and Decoding Approaches for Spiking Neuromorphic Systems
    Schuman, Catherine D.
    Rizzo, Charles
    McDonald-Carmack, John
    Skuda, Nicholas
    Plank, James S.
    PROCEEDINGS OF INTERNATIONAL CONFERENCE ON NEUROMORPHIC SYSTEMS 2022, ICONS 2022, 2022,
  • [26] Review of spike-based neuromorphic computing for brain-inspired vision: biology, algorithms, and hardware
    Hendy, Hagar
    Merkel, Cory
    JOURNAL OF ELECTRONIC IMAGING, 2022, 31 (01)
  • [27] Development of sEMG sensors and algorithms for silent speech recognition
    Meltzner, Geoffrey S.
    Heaton, James T.
    Deng, Yunbin
    De Luca, Gianluca
    Roy, Serge H.
    Kline, Joshua C.
    JOURNAL OF NEURAL ENGINEERING, 2018, 15 (04)
  • [28] Evolution of the performance of automatic speech recognition algorithms in transcribing conversational telephone speech
    Padmanabhan, M
    Saon, G
    Zweig, G
    Huang, J
    Kingsbury, B
    Mangu, L
    IMTC/2001: PROCEEDINGS OF THE 18TH IEEE INSTRUMENTATION AND MEASUREMENT TECHNOLOGY CONFERENCE, VOLS 1-3: REDISCOVERING MEASUREMENT IN THE AGE OF INFORMATICS, 2001, : 1926 - 1931
  • [29] Efficient Weight factorization for Multilingual Speech Recognition
    Ngoc-Quan Pham
    Tuan-Nam Nguyen
    Stuker, Sebastian
    Waibel, Alex
    INTERSPEECH 2021, 2021, : 2421 - 2425
  • [30] Efficient, low latency adaptation for speech recognition
    Kozat, Suleyman S.
    Visweswariah, Karthik
    Gopinath, Ramesh
    2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 777 - +