Optical Music Recognition by Recurrent Neural Networks

被引:3
作者
Baro, Arnau [1 ]
Riba, Pau [1 ]
Calvo-Zaragoza, Jorge [2 ]
Fornes, Alicia [1 ]
机构
[1] Univ Autonoma Barcelona, Comp Vis Ctr, Comp Sci Dept, Bellaterra, Catalonia, Spain
[2] McGill Univ, Schulich Sch Mus, Montreal, PQ, Canada
来源
2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR 2017), VOL 2 | 2017年
关键词
Optical Music Recognition; Recurrent Neural Network; Long Short-Term Memory;
D O I
10.1109/ICDAR.2017.260
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Optical Music Recognition is the task of transcribing a music score into a machine readable format. Many music scores are written in a single staff, and therefore, they could be treated as a sequence. Therefore, this work explores the use of Long Short Term Memory (LSTM) Recurrent Neural Networks for reading the music score sequentially, where the LSTM helps in keeping the context. For training, we have used a synthetic dataset of more than 40000 images, labeled at primitive level.
引用
收藏
页码:25 / 26
页数:2
相关论文
共 50 条
  • [21] A new optical music recognition system based on combined neural network
    Wen, Cuihong
    Rebelo, Ana
    Zhang, Jing
    Cardoso, Jaime
    PATTERN RECOGNITION LETTERS, 2015, 58 : 1 - 7
  • [22] End-to-End Neural Optical Music Recognition of Monophonic Scores
    Calvo-Zaragoza, Jorge
    Rizo, David
    APPLIED SCIENCES-BASEL, 2018, 8 (04):
  • [23] A Novel FPGA-Based Intent Recognition System Utilizing Deep Recurrent Neural Networks
    Tsantikidou, Kyriaki
    Tampouratzis, Nikolaos
    Papaefstathiou, Ioannis
    ELECTRONICS, 2021, 10 (20)
  • [24] Handwriting Recognition with Large Multidimensional Long Short-Term Memory Recurrent Neural Networks
    Voigtlaender, Paul
    Doetsch, Patrick
    Ney, Hermann
    PROCEEDINGS OF 2016 15TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR), 2016, : 228 - 233
  • [25] Biomedical Named Entity Recognition Based on Extended Recurrent Neural Networks
    Li, Lishuang
    Jin, Liuke
    Jiang, Zhenchao
    Song, Dingxin
    Huang, Degen
    PROCEEDINGS 2015 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2015, : 649 - 652
  • [26] Audio Visual Speech Recognition Using Deep Recurrent Neural Networks
    Thanda, Abhinav
    Venkatesan, Shankar M.
    MULTIMODAL PATTERN RECOGNITION OF SOCIAL SIGNALS IN HUMAN-COMPUTER-INTERACTION, MPRSS 2016, 2017, 10183 : 98 - 109
  • [27] JOINT SPEAKER DIARIZATION AND RECOGNITION USING CONVOLUTIONAL AND RECURRENT NEURAL NETWORKS
    Zhou, Zhihan
    Zhang, Yichi
    Duan, Zhiyao
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 2496 - 2500
  • [28] SPEECH RECOGNITION WITH PREDICTION-ADAPTATION-CORRECTION RECURRENT NEURAL NETWORKS
    Zhang, Yu
    Yu, Dong
    Seltzer, Michael L.
    Droppo, Jasha
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 5004 - 5008
  • [29] Exploiting the Two-Dimensional Nature of Agnostic Music Notation for Neural Optical Music Recognition
    Alfaro-Contreras, Maria
    Valero-Mas, Jose J.
    APPLIED SCIENCES-BASEL, 2021, 11 (08):
  • [30] From Optical Music Recognition to Handwritten Music Recognition: A baseline
    Baro, Arnau
    Riba, Pau
    Calvo-Zaragoza, Jorge
    Fornes, Alicia
    PATTERN RECOGNITION LETTERS, 2019, 123 : 1 - 8