Music symbol recognition by a LAG-based combination model

被引:3
|
作者
Na, In Seop [1 ]
Kim, Soo Hyung [1 ]
机构
[1] Chonnam Natl Univ, Sch Elect & Comp Engn, 77 Yongbong Ro, Gwangju 61186, South Korea
基金
新加坡国家研究基金会;
关键词
Optical music recognition; Line adjacency graph; Run length encoding; Graph model; Set model;
D O I
10.1007/s11042-016-4170-z
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Most of optical music recognition (OMR) systems work under the assumption that the input image is scanner-based. However, we propose in this paper, camera based OMR system. Camera based OMR has a challengeable work in un-controlled environment such as a light, perspective, curved, transparency distortions and uneven staff-lines which tend to incur more frequently. In addition, the loss in performance of binarization methods, line thickness variation and space variation between lines are inevitable. In order to solve these problems, we propose a novel and effective staff-line removal method based on following three main ideas. First, a state-of-the-art staff-line detection method, Stable Path, is used to extract staff-line skeletons of the music score. Second, a line adjacency graph (LAG) model is exploited in a different manner over segmentation to cluster pixel runs generated from the run-length encoding (RLE) of an music score image. Third, a two-pass staff-line removal pipeline called filament filtering is applied to remove clusters lying on the staff-line. A music symbol is comprised of several parts so-called primitives, but the combination of these parts to form music symbol is unlimited. It causes difficulty applying the state-of-the-art method for music symbol recognition. To overcome these challenges and deal with primitive parts separately, we proposed a combination model which consists of LAG model, Graph model, and Set model as a framework for music symbol recognition. Our method shows impressive results on music score images captured from cameras, and gives high performance when applied to the ICDAR/GREC 2013 database, and a Gamera synthetic database. We have compared to some commercial software and proved the expediency and efficiency of the proposed method.
引用
收藏
页码:25563 / 25579
页数:17
相关论文
共 50 条
  • [21] Recognition of Useful Music for Emotion Enhancement Based on Dimensional Model
    Nawaz, Rab
    Nisar, Humaira
    Yap, Vooi Voon
    2018 2ND INTERNATIONAL CONFERENCE ON BIOSIGNAL ANALYSIS, PROCESSING AND SYSTEMS (ICBAPS 2018), 2018, : 176 - 180
  • [22] A combination of features for symbol-independent writer identification in old music scores
    Fornes, Alicia
    Llados, Josep
    Sanchez, Gemma
    Otazu, Xavier
    Bunke, Horst
    INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2010, 13 (04) : 243 - 259
  • [23] A combination of features for symbol-independent writer identification in old music scores
    Alicia Fornés
    Josep Lladós
    Gemma Sánchez
    Xavier Otazu
    Horst Bunke
    International Journal on Document Analysis and Recognition (IJDAR), 2010, 13 : 243 - 259
  • [24] Offline music symbol recognition using Daisy feature and quantum Grey wolf optimization based feature selection
    Malakar, Samir
    Ghosh, Manosij
    Chaterjee, Agneet
    Bhowmik, Showmik
    Sarkar, Ram
    MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (43-44) : 32011 - 32036
  • [25] Visual recognition method based on symbol features
    Liu, Xilong
    Qian, Hanbo
    Cao, Zhiqiang
    Tan, Min
    Huazhong Keji Daxue Xuebao (Ziran Kexue Ban)/Journal of Huazhong University of Science and Technology (Natural Science Edition), 2011, 39 (SUPPL. 2): : 120 - 123
  • [26] Offline music symbol recognition using Daisy feature and quantum Grey wolf optimization based feature selection
    Samir Malakar
    Manosij Ghosh
    Agneet Chaterjee
    Showmik Bhowmik
    Ram Sarkar
    Multimedia Tools and Applications, 2020, 79 : 32011 - 32036
  • [27] Photoacoustic tomography reconstruction using lag-based delay multiply and sum with a coherence factor improves in vivo ovarian cancer diagnosis
    Yang, Guang
    Amidi, Eghbal
    Zhu, Quing
    BIOMEDICAL OPTICS EXPRESS, 2021, 12 (04) : 2250 - 2263
  • [28] Multimodal Music Emotion Recognition Method Based on the Combination of Knowledge Distillation and Transfer Learning
    Tong, Guiying
    SCIENTIFIC PROGRAMMING, 2022, 2022
  • [29] Handwritten Symbol Recognition Using Hierarchical Shape Representation Model Based on Shape Signature
    Babu, M. Raja
    Gokaramaiah, T.
    Reddy, A. Vishnuvardhan
    PROCEEDINGS OF INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND DATA ENGINEERING, 2018, 9 : 293 - 300
  • [30] Domain Adaptation for Handwritten Symbol Recognition: A Case of Study in Old Music Manuscripts
    Mateiu, Tudor N.
    Gallego, Antonio-Javier
    Calvo-Zaragoza, Jorge
    PATTERN RECOGNITION AND IMAGE ANALYSIS, IBPRIA 2019, PT II, 2019, 11868 : 135 - 146