Real-Time Piano Music Transcription Based on Computer Vision

被引:17
|
作者
Akbari, Mohammad [1 ]
Cheng, Howard [1 ]
机构
[1] Univ Lethbridge, Dept Math & Comp Sci, Lethbridge, AB T1K 3M4, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Automatic music transcription; claVision; computer vision; multipitch estimation; piano;
D O I
10.1109/TMM.2015.2473702
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
One important problem in musical information retrieval is automatic music transcription, which is an automated conversion process from played music to a symbolic notation such as MIDI file. Since the accuracy of previous audio-based transcription systems is not satisfactory, we propose an innovative computer vision-based automatic music transcription system named claVision to perform piano music transcription. Instead of processing the music audio, the system performs the transcription only from the video performance captured by a camera mounted over the piano keyboard. In this paper, we describe the architecture and the algorithms used in claVision. The claVision system has a high accuracy (F-1 score over 0.95) and a very low latency (about 7.0 ms) in real-time music transcription, even under different illumination conditions. This technology can also be used for other musical keyboard instruments.
引用
收藏
页码:2113 / 2121
页数:9
相关论文
共 50 条
  • [1] Agent-based computer vision in a dynamic, real-time environment
    Zhou, Q
    Parrott, D
    Gillen, M
    Chelberg, DM
    Welch, L
    PATTERN RECOGNITION, 2004, 37 (04) : 691 - 705
  • [2] Real-time shape grading technique for fruit based on computer vision
    Li, QZ
    Wang, MH
    ACTUAL TASKS ON AGRICULTURAL ENGINEERING, PROCEEDINGS, 2000, 28 : 243 - 250
  • [3] Computer vision based real-time vehicle tracking and classification system
    Humberto Pena-Gonzalez, Raul
    Aurelio Nuno-Maganda, Marco
    2014 IEEE 57TH INTERNATIONAL MIDWEST SYMPOSIUM ON CIRCUITS AND SYSTEMS (MWSCAS), 2014, : 679 - 682
  • [4] A Real-time Fire Detection and Notification System Based on Computer Vision
    Bayoumi, Sahar
    AlSobky, Elham
    Almohsin, Moneerah
    Altwaim, Manahel
    Alkaldi, Monira
    Alkahtani, Munera
    2013 INTERNATIONAL CONFERENCE ON IT CONVERGENCE AND SECURITY (ICITCS), 2013,
  • [5] A Real-Time Computer Vision Based Approach to Detection and Classification of Traffic Incidents
    Ahmed, Mohammed Imran Basheer
    Zaghdoud, Rim
    Ahmed, Mohammed Salih
    Sendi, Razan
    Alsharif, Sarah
    Alabdulkarim, Jomana
    Saad, Bashayr Adnan Albin
    Alsabt, Reema
    Rahman, Atta
    Krishnasamy, Gomathi
    BIG DATA AND COGNITIVE COMPUTING, 2023, 7 (01)
  • [6] A Modal-Based Real-Time Piano Synthesizer
    Bank, Balazs
    Zambon, Stefano
    Fontana, Federico
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (04): : 809 - 821
  • [7] Towards Efficient and Real-Time Piano Transcription Using Neural Autoregressive Models
    Kwon, Taegyun
    Jeong, Dasaem
    Nam, Juhan
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 : 5106 - 5116
  • [8] Real-time monitoring of elderly people through computer vision
    Ravankar, Abhijeet
    Rawankar, Arpit
    Ravankar, Ankit A.
    ARTIFICIAL LIFE AND ROBOTICS, 2023, 28 (03) : 496 - 501
  • [9] Real-Time Ergonomic Risk Assessment Approach for Construction Workers Based on Computer Vision
    Fan, Chao
    Mei, Qipei
    Li, Xinming
    PROCEEDINGS OF THE CANADIAN SOCIETY FOR CIVIL ENGINEERING ANNUAL CONFERENCE 2023, VOL 5, CSCE 2023, 2024, 499 : 113 - 127
  • [10] A Real-Time Computer Vision Monitoring Way for Animal Diversity
    Lin Kaiyan
    Yang Xuejun
    Wu Junhui
    Chen Jie
    Si Huiping
    2012 WORLD AUTOMATION CONGRESS (WAC), 2012,