Pattern recognition study of musical rhythm perception based on multimodal deep learning

被引:0
|
作者
Bai W. [1 ]
机构
[1] Jingchu University of Technology, Hubei, Jingmen
关键词
Deep learning; Feature fusion; Hidden Markov; Music perception; Rhythm recognition;
D O I
10.2478/amns-2024-0549
中图分类号
学科分类号
摘要
Rhythm perception is becoming more and more important in the field of music information processing and music understanding.The study first adopts signal processing methods to extract musical features, then uses feature fusion techniques to integrate features of different modalities into a single feature vector.Based on this model, the study identifies the rhythmic activation function of music and combines it with the hidden Markov model to infer the rhythm of the music.One of the key points of the study is to perform rhythm recognition on music containing drums, to explore the recognition effect. One of the focuses of the study is to recognize the rhythm of music containing drums to explore the recognition effect.In addition, the study also analyzes the Softmax output values of the music and compares the recognition effect of different models.The results show that the rhythm recognition of music using the multimodal deep learning method performs the best in terms of the F-Measure value, the Cemgil value, the Goto value, and the P-score value, with the respective 65.65%, 66.76%, 36.75%, and 36.75%. 66.76%, 36.75%, and 75.68%.Especially in the drum music recognition, the position of each drum music is accurately recognized, proving the model’s effectiveness in this paper.The research provides a new feasible method for the recognition and understanding of music rhythms and a valuable reference for the research in this field. © 2023 Wen Bai, published by Sciendo.
引用
收藏
相关论文
共 50 条
  • [31] Closing Editorial for Computer Vision and Pattern Recognition Based on Deep Learning
    Yuan, Hui
    APPLIED SCIENCES-BASEL, 2024, 14 (09):
  • [32] Deep Imbalanced Learning for Multimodal Emotion Recognition in Conversations
    Meng, Tao
    Shou, Yuntao
    Ai, Wei
    Yin, Nan
    Li, Keqin
    IEEE Transactions on Artificial Intelligence, 2024, 5 (12): : 6472 - 6487
  • [33] Deep learning based spraying pattern recognition and prediction for electrohydrodynamic system
    Wang, Jin-Xin
    Wang, Xiao
    Ran, Xiong
    Cheng, Yongpan
    Yan, Wei-Cheng
    CHEMICAL ENGINEERING SCIENCE, 2024, 295
  • [34] DEEP MULTIMODAL LEARNING FOR EMOTION RECOGNITION IN SPOKEN LANGUAGE
    Gu, Yue
    Chen, Shuhong
    Marsic, Ivan
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 5079 - 5083
  • [35] Multimodal Arabic emotion recognition using deep learning
    Al Roken, Noora
    Barlas, Gerassimos
    SPEECH COMMUNICATION, 2023, 155
  • [36] Multimodal Deep Learning Framework for Mental Disorder Recognition
    Zhang, Ziheng
    Lin, Weizhe
    Liu, Mingyu
    Mahmoud, Marwa
    2020 15TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG 2020), 2020, : 344 - 350
  • [37] A Multimodal Deep Learning Network for Group Activity Recognition
    Rossi, Silvia
    Capasso, Roberto
    Acampora, Giovanni
    Staffa, Mariacarla
    2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018,
  • [38] Multimodal Emotion Recognition using Deep Learning Architectures
    Ranganathan, Hiranmayi
    Chakraborty, Shayok
    Panchanathan, Sethuraman
    2016 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2016), 2016,
  • [39] Annotation Efficiency in Multimodal Emotion Recognition with Deep Learning
    Zhu, Lili
    Spachos, Petros
    2022 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM 2022), 2022, : 560 - 565
  • [40] The Nature and Nurture of Melody: A Twin Study of Musical Pitch and Rhythm Perception
    Seesjarvi, Erik
    Sarkamo, Teppo
    Vuoksimaa, Eero
    Tervaniemi, Mari
    Peretz, Isabelle
    Kaprio, Jaakko
    BEHAVIOR GENETICS, 2016, 46 (04) : 506 - 515