Pattern recognition study of musical rhythm perception based on multimodal deep learning

被引:0
|
作者
Bai W. [1 ]
机构
[1] Jingchu University of Technology, Hubei, Jingmen
关键词
Deep learning; Feature fusion; Hidden Markov; Music perception; Rhythm recognition;
D O I
10.2478/amns-2024-0549
中图分类号
学科分类号
摘要
Rhythm perception is becoming more and more important in the field of music information processing and music understanding.The study first adopts signal processing methods to extract musical features, then uses feature fusion techniques to integrate features of different modalities into a single feature vector.Based on this model, the study identifies the rhythmic activation function of music and combines it with the hidden Markov model to infer the rhythm of the music.One of the key points of the study is to perform rhythm recognition on music containing drums, to explore the recognition effect. One of the focuses of the study is to recognize the rhythm of music containing drums to explore the recognition effect.In addition, the study also analyzes the Softmax output values of the music and compares the recognition effect of different models.The results show that the rhythm recognition of music using the multimodal deep learning method performs the best in terms of the F-Measure value, the Cemgil value, the Goto value, and the P-score value, with the respective 65.65%, 66.76%, 36.75%, and 36.75%. 66.76%, 36.75%, and 75.68%.Especially in the drum music recognition, the position of each drum music is accurately recognized, proving the model’s effectiveness in this paper.The research provides a new feasible method for the recognition and understanding of music rhythms and a valuable reference for the research in this field. © 2023 Wen Bai, published by Sciendo.
引用
收藏
相关论文
共 50 条
  • [41] The Nature and Nurture of Melody: A Twin Study of Musical Pitch and Rhythm Perception
    Erik Seesjärvi
    Teppo Särkämö
    Eero Vuoksimaa
    Mari Tervaniemi
    Isabelle Peretz
    Jaakko Kaprio
    Behavior Genetics, 2016, 46 : 506 - 515
  • [42] Prompt Learning for Multimodal Intent Recognition with Modal Alignment Perception
    Chen, Yuzhao
    Zhu, Wenhua
    Yu, Weilun
    Xue, Hongfei
    Fu, Hao
    Lin, Jiali
    Jiang, Dazhi
    COGNITIVE COMPUTATION, 2024, 16 (06) : 3417 - 3428
  • [43] Research on motion pattern recognition of exoskeleton robot based on multimodal machine learning model
    Zheng, Yi
    Song, Qingjun
    Liu, Jixin
    Song, Qinghui
    Yue, Qingchao
    NEURAL COMPUTING & APPLICATIONS, 2020, 32 (07): : 1869 - 1877
  • [44] Research on motion pattern recognition of exoskeleton robot based on multimodal machine learning model
    Yi Zheng
    Qingjun Song
    Jixin Liu
    Qinghui Song
    Qingchao Yue
    Neural Computing and Applications, 2020, 32 : 1869 - 1877
  • [45] Research on Music Emotion Recognition Model of Deep Learning Based on Musical Stage Effect
    Huang, Cuiqing
    Zhang, Qiang
    SCIENTIFIC PROGRAMMING, 2021, 2021
  • [46] An Emergency Application for Smartphones Based on Rhythm Pattern Recognition
    Niwa, Yoshino
    Inamura, Mayu
    Kaji, Katsuhiko
    ADJUNCT PROCEEDINGS OF THE 13TH INTERNATIONAL CONFERENCE ON MOBILE AND UBIQUITOUS SYSTEMS: COMPUTING NETWORKING AND SERVICES (MOBIQUITOUS 2016), 2016, : 112 - 117
  • [47] Deep Learning for Advanced Similar Musical Instrument Detection and Recognition
    Dewi, Christine
    Chen, Rung-Ching
    IAENG International Journal of Computer Science, 2022, 49 (03)
  • [48] Deep Learning Based Intelligent Voiceprint Recognition, Positioning, and Perception in Cable Monitoring
    Huo, Yajun
    Sun, Kai
    Du, Juan
    Liu, Jun
    Wang, Yong
    Wang, Chun
    Guo, Liang
    Cheng, Xu
    Duan, Shangxiang
    IEEE ACCESS, 2025, 13 : 44928 - 44935
  • [49] A Survey of Deep Learning-Based Multimodal Emotion Recognition: Speech, Text, and Face
    Lian, Hailun
    Lu, Cheng
    Li, Sunan
    Zhao, Yan
    Tang, Chuangao
    Zong, Yuan
    ENTROPY, 2023, 25 (10)
  • [50] Audio-Video Based Multimodal Emotion Recognition Using SVMs and Deep Learning
    Sun, Bo
    Xu, Qihua
    He, Jun
    Yu, Lejun
    Li, Liandong
    Wei, Qinglan
    PATTERN RECOGNITION (CCPR 2016), PT II, 2016, 663 : 621 - 631