Musigraph: Optical Music Recognition Through Object Detection and Graph Neural Network

被引:4
作者
Baro, Arnau [1 ,2 ]
Riba, Pau [3 ]
Fornes, Alicia [1 ,2 ]
机构
[1] Univ Autonoma Barcelona, Comp Vis Ctr, Bellaterra, Spain
[2] Univ Autonoma Barcelona, Comp Sci Dept, Bellaterra, Spain
[3] Helsing AI, Berlin, Germany
来源
FRONTIERS IN HANDWRITING RECOGNITION, ICFHR 2022 | 2022年 / 13639卷
关键词
Object detection; Optical music recognition; Graph neural network; LINE;
D O I
10.1007/978-3-031-21648-0_12
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
During the last decades, the performance of optical music recognition has been increasingly improving. However, and despite the 2dimensional nature of music notation (e.g. notes have rhythm and pitch), most works treat musical scores as a sequence of symbols in one dimension, which make their recognition still a challenge. Thus, in this work we explore the use of graph neural networks for musical score recognition. First, because graphs are suited for n-dimensional representations, and second, because the combination of graphs with deep learning has shown a great performance in similar applications. Our methodology consists of: First, we will detect each isolated/atomic symbols (those that can not be decomposed in more graphical primitives) and the primitives that form a musical symbol. Then, we will build the graph taking as root node the notehead and as leaves those primitives or symbols that modify the note's rhythm (stem, beam, flag) or pitch (flat, sharp, natural). Finally, the graph is translated into a human-readable character sequence for a final transcription and evaluation. Our method has been tested on more than five thousand measures, showing promising results.
引用
收藏
页码:171 / 184
页数:14
相关论文
共 31 条
  • [1] [Anonymous], 2018, P ISMIR
  • [2] Handwritten Historical Music Recognition by Sequence-to-Sequence with Attention Mechanism
    Baro, Arnau
    Badal, Carles
    Fornes, Alicia
    [J]. 2020 17TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR 2020), 2020, : 205 - 210
  • [3] From Optical Music Recognition to Handwritten Music Recognition: A baseline
    Baro, Arnau
    Riba, Pau
    Calvo-Zaragoza, Jorge
    Fornes, Alicia
    [J]. PATTERN RECOGNITION LETTERS, 2019, 123 : 1 - 8
  • [4] Baró A, 2016, INT CONF FRONT HAND, P465, DOI [10.1109/ICFHR.2016.0092, 10.1109/ICFHR.2016.116]
  • [5] Soft-NMS - Improving Object Detection With One Line of Code
    Bodla, Navaneeth
    Singh, Bharat
    Chellappa, Rama
    Davis, Larry S.
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 5562 - 5570
  • [6] Bruna J, 2014, Arxiv, DOI arXiv:1312.6203
  • [7] End-to-End Neural Optical Music Recognition of Monophonic Scores
    Calvo-Zaragoza, Jorge
    Rizo, David
    [J]. APPLIED SCIENCES-BASEL, 2018, 8 (04):
  • [8] Staff-line detection and removal using a convolutional neural network
    Calvo-Zaragoza, Jorge
    Pertusa, Antonio
    Oncina, Jose
    [J]. MACHINE VISION AND APPLICATIONS, 2017, 28 (5-6) : 665 - 674
  • [9] Couasnon B., 1995, Using a grammar for a reliable full score recognition system
  • [10] Defferrard M, 2016, ADV NEUR IN, V29