A real-time system for online learning-based visual transcription of piano music

被引:0
|
作者
Mohammad Akbari
Jie Liang
Howard Cheng
机构
[1] Simon Fraser University,School of Engineering Science
[2] University of Lethbridge,Department of Mathematics and Computer Science
来源
关键词
Music information retrieval; Real-time piano music transcription; Image and video processing; Convolutional neural networks; Support vector machines; Online learning;
D O I
暂无
中图分类号
学科分类号
摘要
In order to deal with the challenges arising from acoustic-based music information retrieval such as automatic music transcription, the video of the musical performances can be utilized. In this paper, a new real-time learning-based system for visually transcribing piano music using the CNN-SVM classification of the pressed black and white keys is presented. The whole process in this technique is based on visual analysis of the piano keyboard and the pianist’s hands and fingers. A high accuracy with an average F1 score of 0.95 even under non-ideal camera view, hand coverage, and lighting conditions is achieved. The proposed system has a low latency (about 20 ms) in real-time music transcription. In addition, a new dataset for visual transcription of piano music is created and made available to researchers in this area. Since not all possible varying patterns of the data used in our work are available, an online learning approach is applied to efficiently update the original model based on the new data added to the training dataset.
引用
收藏
页码:25513 / 25535
页数:22
相关论文
共 50 条
  • [41] Automated construction safety reporting system integrating deep learning-based real-time advanced detection and visual question answering
    Wen, Shihao
    Park, Minsoo
    Tran, Dai Quoc
    Lee, Seungsoo
    Park, Seunghee
    ADVANCES IN ENGINEERING SOFTWARE, 2024, 198
  • [42] Deep Learning-Based Assessment Model for Real-Time Identification of Visual Learners Using Raw EEG
    Jawed, Soyiba
    Faye, Ibrahima
    Malik, Aamir Saeed
    IEEE TRANSACTIONS ON NEURAL SYSTEMS AND REHABILITATION ENGINEERING, 2024, 32 : 378 - 390
  • [43] Real-time film thickness monitoring in complex environments using deep learning-based visual imaging
    Zhong, Liang
    Cheng, Hengqiang
    Gao, Lele
    Li, Lian
    Yin, Wenping
    Wang, Hui
    Miao, Qiyi
    Zhang, Yunshi
    Nie, Lei
    Zang, Hengchang
    POWDER TECHNOLOGY, 2025, 456
  • [44] SONIC: A system for transcription of piano music
    Marolt, Matija
    Privosnik, Marko
    Advances in Automation, Multimedia and Video Systems, and Modern Computer Science, 2001, : 236 - 239
  • [45] VisOJ: real-time visual learning analytics dashboard for online programming judge
    Fu, Qian
    Bai, Xue
    Zheng, Yafeng
    Du, Runsheng
    Wang, Dongqing
    Zhang, Tianyi
    VISUAL COMPUTER, 2023, 39 (06): : 2393 - 2405
  • [46] Real-time visual tracking via online weighted multiple instance learning
    Zhang, Kaihua
    Song, Huihui
    PATTERN RECOGNITION, 2013, 46 (01) : 397 - 411
  • [47] VisOJ: real-time visual learning analytics dashboard for online programming judge
    Qian Fu
    Xue Bai
    Yafeng Zheng
    Runsheng Du
    Dongqing Wang
    Tianyi Zhang
    The Visual Computer, 2023, 39 (6) : 2393 - 2405
  • [48] Music software with a Machine Learning-based feedback system as an alternative for initial piano study in children
    Borja, Miguel A.
    Camargo, Jorge E.
    INTELIGENCIA ARTIFICIAL-IBEROAMERICAN JOURNAL OF ARTIFICIAL INTELLIGENCE, 2024, 27 (73): : 92 - 110
  • [49] A machine learning-based real-time tumor tracking system for fluoroscopic gating of lung radiotherapy
    Sakata, Yukinobu
    Hirai, Ryusuke
    Kobuna, Kyoka
    Tanizawa, Akiyuki
    Mori, Shinichiro
    PHYSICS IN MEDICINE AND BIOLOGY, 2020, 65 (08):
  • [50] Deep Learning-Based Portable Image Analysis System for Real-Time Detection of Vespa velutina
    Jeon, Moon-Seok
    Jeong, Yuseok
    Lee, Jaesu
    Yu, Seung-Hwa
    Kim, Su-bae
    Kim, Dongwon
    Kim, Kyoung-Chul
    Lee, Siyoung
    Lee, Chang-Woo
    Choi, Inchan
    APPLIED SCIENCES-BASEL, 2023, 13 (13):