Speech Command Recognition Based on Convolutional Spiking Neural Networks

被引:5
作者
Sadovsky, Erik [1 ]
Jakubec, Maros [1 ]
Jarina, Roman [1 ]
机构
[1] Univ Zilina, Dept Multimedia & Informat Commun Technol, FEIT, Zilina, Slovakia
来源
2023 33RD INTERNATIONAL CONFERENCE RADIOELEKTRONIKA, RADIOELEKTRONIKA | 2023年
关键词
spiking neural network; spiking speech commands; command recognition; convolutional spiking neural network;
D O I
10.1109/RADIOELEKTRONIKA57919.2023.10109082
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This article presents a new technique for speech recognition that combines Convolutional Neural Networks (CNNs) with Spiking Neural Networks (SNNs) to create an SNN-CNN model. The model is tested on the Google Speech Command Dataset and achieves an accuracy of 72.03%, which is similar to the current state-of-the-art speech recognition methods. The study also compares the performance of the SNN-CNN model with other SNN models that use Multi-Layer Perceptrons (MLPs) and traditional Artificial Neural Networks (ANNs). The results show that the CNN-based SNNs outperform both MLPs and ANNs, demonstrating the superiority of the proposed model. The approach presented in this study can potentially be applied to other speech recognition tasks and could lead to further improvements in the field.
引用
收藏
页数:5
相关论文
共 31 条
  • [1] Bellec G, 2018, Arxiv, DOI arXiv:1803.09574
  • [2] A surrogate gradient spiking baseline for speech command recognition
    Bittar, Alexandre
    Garner, Philip N.
    [J]. FRONTIERS IN NEUROSCIENCE, 2022, 16
  • [3] Spiking Deep Convolutional Neural Networks for Energy-Efficient Object Recognition
    Cao, Yongqiang
    Chen, Yang
    Khosla, Deepak
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2015, 113 (01) : 54 - 66
  • [4] Spike timing-dependent plasticity: A Hebbian learning rule
    Caporale, Natalia
    Dan, Yang
    [J]. ANNUAL REVIEW OF NEUROSCIENCE, 2008, 31 : 25 - 46
  • [5] The Heidelberg Spiking Data Sets for the Systematic Evaluation of Spiking Neural Networks
    Cramer, Benjamin
    Stradmann, Yannik
    Schemmel, Johannes
    Zenke, Friedemann
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (07) : 2744 - 2757
  • [6] Diehl PU, 2015, IEEE IJCNN
  • [7] SPIKING NEURAL NETWORKS
    Ghosh-Dastidar, Samanwoy
    Adeli, Hojjat
    [J]. INTERNATIONAL JOURNAL OF NEURAL SYSTEMS, 2009, 19 (04) : 295 - 308
  • [8] Holmberg M., 2005, Eurospeech, P1253
  • [9] Comparison of a spiking neural network and an MLP for robust identification of generator dynamics in a multimachine power system
    Johnson, Cameron
    Venayagamoorthy, Ganesh Kumar
    Mitra, Pinaki
    [J]. NEURAL NETWORKS, 2009, 22 (5-6) : 833 - 841
  • [10] Towards a neurocomputational model of speech production and perception
    Kroeger, Bernd J.
    Kannampuzha, Jim
    Neuschaefer-Rube, Christiane
    [J]. SPEECH COMMUNICATION, 2009, 51 (09) : 793 - 809