End-to-end optical music recognition for pianoform sheet music

被引:8
作者
Rios-Vila, Antonio [1 ]
Rizo, David [1 ,2 ]
Inesta, Jose M. [1 ]
Calvo-Zaragoza, Jorge [1 ]
机构
[1] Univ Alicante, UI Comp Res, Alicante, Spain
[2] Inst Super Ensenanzas Artist Comun Valenciana ISEA, Alicante, Spain
关键词
Optical music recognition; Polyphonic music scores; GrandStaff; Neural networks; REMOVAL; NETWORK; IMAGE;
D O I
10.1007/s10032-023-00432-z
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
End-to-end solutions have brought about significant advances in the field of Optical Music Recognition. These approaches directly provide the symbolic representation of a given image of a musical score. Despite this, several documents, such as pianoform musical scores, cannot yet benefit from these solutions since their structural complexity does not allow their effective transcription. This paper presents a neural method whose objective is to transcribe these musical scores in an end-to-end fashion. We also introduce the GrandStaff dataset, which contains 53,882 single-system piano scores in common western modern notation. The sources are encoded in both a standard digital music representation and its adaptation for current transcription technologies. The method proposed in this paper is trained and evaluated using this dataset. The results show that the approach presented is, for the first time, able to effectively transcribe pianoform notation in an end-to-end manner.
引用
收藏
页码:347 / 362
页数:16
相关论文
共 46 条
  • [1] Decoupling music notation to improve end-to-end Optical Music Recognition
    Alfaro-Contreras, Maria
    Rios-Vila, Antonio
    Valero-Mas, Jose J.
    Inesta, Jose M.
    Calvo-Zaragoza, Jorge
    [J]. PATTERN RECOGNITION LETTERS, 2022, 158 : 157 - 163
  • [2] Approaching End-to-End Optical Music Recognition for Homophonic Scores
    Alfaro-Contreras, Maria
    Calvo-Zaragoza, Jorge
    Inesta, Jose M.
    [J]. PATTERN RECOGNITION AND IMAGE ANALYSIS, IBPRIA 2019, PT II, 2019, 11868 : 147 - 158
  • [3] A set of benchmarks for Handwritten Text Recognition on historical documents
    Andreu Sanchez, Joan
    Romero, Veronica
    Toselli, Alejandro H.
    Villegas, Mauricio
    Vidal, Enrique
    [J]. PATTERN RECOGNITION, 2019, 94 : 122 - 134
  • [4] From Optical Music Recognition to Handwritten Music Recognition: A baseline
    Baro, Arnau
    Riba, Pau
    Calvo-Zaragoza, Jorge
    Fornes, Alicia
    [J]. PATTERN RECOGNITION LETTERS, 2019, 123 : 1 - 8
  • [5] Bluche T, 2016, ADV NEUR IN, V29
  • [6] Scan, Attend and Read: End-to-End Handwritten Paragraph Recognition with MDLSTM Attention
    Bluche, Theodore
    Louradour, Jerome
    Messina, Ronaldo
    [J]. 2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), VOL 1, 2017, : 1050 - 1055
  • [7] Towards a Standard Testbed for Optical Music Recognition: Definitions, Metrics, and Page Images
    Byrd, Donald
    Simonsen, Jakob Grue
    [J]. JOURNAL OF NEW MUSIC RESEARCH, 2015, 44 (03) : 169 - 195
  • [8] Calvo-Zaragoza J., 2018, P 19 INT SOC MUS INF, P248
  • [9] Understanding Optical Music Recognition
    Calvo-Zaragoza, Jorge
    Hajic, Jan, Jr.
    Pacha, Alexander
    [J]. ACM COMPUTING SURVEYS, 2020, 53 (04)
  • [10] Handwritten Music Recognition for Mensural notation with convolutional recurrent neural networks
    Calvo-Zaragoza, Jorge
    Toselli, Alejandro H.
    Vidal, Enrique
    [J]. PATTERN RECOGNITION LETTERS, 2019, 128 : 115 - 121