Utilizing the Neuronal Behavior of Spiking Neurons to Recognize Music Signals Based on Time Coding Features

被引:2
|
作者
Shah, Dhvani [1 ]
Narayanan, Ajit [1 ]
Espinosa-Ramos, Josafath Israel [1 ]
机构
[1] Auckland Univ Technol, Sch Engn Comp & Math Sci, Auckland 1142, New Zealand
来源
IEEE ACCESS | 2022年 / 10卷
关键词
Instruments; Music; Feature extraction; Neurons; Membrane potentials; Biological neural networks; Encoding; Classification; music; spiking neurons; spiking neural networks; STDP; temporal data; unsupervised learning; CLASSIFICATION; IDENTIFICATION; OPTIMIZATION; INTEGRATION; NETWORKS;
D O I
10.1109/ACCESS.2022.3164440
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents a Spiking Neural Network(SNN) architecture to distinguish two musical instruments: piano and violin. The acoustic characteristics of music such as frequency and time convey a lot of information that help humans in distinguishing music instruments within few seconds. SNNs are neural networks that work effectively with temporal data. In this study, 2-layer SNN temporal based architecture is implemented for instrument (piano and violin) recognition. Further, this research investigates the behaviour of spiking neurons for piano and violin samples through different spike based statistics. Additionally, a Gamma metric that utilises spike time information and Root Mean Square Error (RMSE) from the membrane potential are used for classification and recognition. SNN achieved an overall classification accuracy of 92.38% and 93.19%, indicating the potential of SNNs in this inherently temporal recognition and classification domain. On the other hand, we implemented rate-coding techniques using machine learning (ML) techniques. Through this research, we demonstrated that SNN are more effective than conventional ML methods for capturing important the acoustic characteristics of music such as frequency and time. Overall, this research showed the potential capability of temporal coding over rate coding techniques while processing spatial and temporal data.
引用
收藏
页码:37317 / 37329
页数:13
相关论文
共 29 条
  • [1] Emotional State Classification from MUSIC-Based Features of Multichannel EEG Signals
    Hossain, Sakib Abrar
    Rahman, Md. Asadur
    Chakrabarty, Amitabha
    Rashid, Mohd Abdur
    Kuwana, Anna
    Kobayashi, Haruo
    BIOENGINEERING-BASEL, 2023, 10 (01):
  • [2] Unsupervised AER Object Recognition Based on Multiscale Spatio-Temporal Features and Spiking Neurons
    Liu, Qianhui
    Pan, Gang
    Ruan, Haibo
    Xing, Dong
    Xu, Qi
    Tang, Huajin
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 31 (12) : 5300 - 5311
  • [3] Specific neural coding of fMRI spiking neural network based on time coding
    Guo, Lei
    Guo, Minxin
    Wu, Youxi
    Xu, Guizhi
    CHAOS SOLITONS & FRACTALS, 2023, 174
  • [4] Dual-Function Integrated Emotion-Based Music Classification System Using Features From Physiological Signals
    Kim, Hyoung-Gook
    Lee, Gi Yong
    Kim, Min-Soo
    IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2021, 67 (04) : 341 - 349
  • [5] Run-Time Interoperability Between Neuronal Network Simulators Based on the MUSIC Framework
    Mikael Djurfeldt
    Johannes Hjorth
    Jochen M. Eppler
    Niraj Dudani
    Moritz Helias
    Tobias C. Potjans
    Upinder S. Bhalla
    Markus Diesmann
    Jeanette Hellgren Kotaleski
    Örjan Ekeberg
    Neuroinformatics, 2010, 8 : 43 - 60
  • [6] Run-Time Interoperability Between Neuronal Network Simulators Based on the MUSIC Framework
    Djurfeldt, Mikael
    Hjorth, Johannes
    Eppler, Jochen M.
    Dudani, Niraj
    Helias, Moritz
    Potjans, Tobias C.
    Bhalla, Upinder S.
    Diesmann, Markus
    Kotaleski, Jeanette Hellgren
    Ekeberg, Orjan
    NEUROINFORMATICS, 2010, 8 (01) : 43 - 60
  • [7] Remembering Key Features of Visual Images based on Spike Timing Dependent Plasticity of Spiking Neurons
    Wu, QingXiang
    Cai, Rongtai
    McGinnity, T. M.
    Maguire, Liam
    Harkin, Jim
    PROCEEDINGS OF THE 2009 2ND INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, VOLS 1-9, 2009, : 2168 - 2172
  • [8] Music recommender using deep embedding-based features and behavior-based reinforcement learning
    Chang, Jia-Wei
    Chiou, Ching-Yi
    Liao, Jia-Yi
    Hung, Ying-Kai
    Huang, Chien-Che
    Lin, Kuan-Cheng
    Pu, Ying-Hung
    MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (26-27) : 34037 - 34064
  • [9] An Approach for Classifying Alcoholic and Non-Alcoholic Persons Based on Time Domain Features Extracted From EEG Signals
    Fattah, S. A.
    Fatima, K.
    Shahnaz, C.
    2015 IEEE International WIE Conference on Electrical and Computer Engineering (WIECON-ECE), 2015, : 479 - 482
  • [10] Cold-Temperature Coding with Bursting and Spiking Based on TRP Channel Dynamics in Drosophila Larva Sensory Neurons
    Maksymchuk, Natalia
    Sakurai, Akira
    Cox, Daniel N.
    Cymbalyuk, Gennady S.
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2023, 24 (19)