Music symbol recognition by a LAG-based combination model

被引：3

作者：

Na, In Seop ^{[1
]}

Kim, Soo Hyung ^{[1
]}

机构：

[1] Chonnam Natl Univ, Sch Elect & Comp Engn, 77 Yongbong Ro, Gwangju 61186, South Korea

来源：

MULTIMEDIA TOOLS AND APPLICATIONS | 2017年 / 76卷 / 24期

基金：

新加坡国家研究基金会;

关键词：

Optical music recognition; Line adjacency graph; Run length encoding; Graph model; Set model;

D O I：

10.1007/s11042-016-4170-z

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Most of optical music recognition (OMR) systems work under the assumption that the input image is scanner-based. However, we propose in this paper, camera based OMR system. Camera based OMR has a challengeable work in un-controlled environment such as a light, perspective, curved, transparency distortions and uneven staff-lines which tend to incur more frequently. In addition, the loss in performance of binarization methods, line thickness variation and space variation between lines are inevitable. In order to solve these problems, we propose a novel and effective staff-line removal method based on following three main ideas. First, a state-of-the-art staff-line detection method, Stable Path, is used to extract staff-line skeletons of the music score. Second, a line adjacency graph (LAG) model is exploited in a different manner over segmentation to cluster pixel runs generated from the run-length encoding (RLE) of an music score image. Third, a two-pass staff-line removal pipeline called filament filtering is applied to remove clusters lying on the staff-line. A music symbol is comprised of several parts so-called primitives, but the combination of these parts to form music symbol is unlimited. It causes difficulty applying the state-of-the-art method for music symbol recognition. To overcome these challenges and deal with primitive parts separately, we proposed a combination model which consists of LAG model, Graph model, and Set model as a framework for music symbol recognition. Our method shows impressive results on music score images captured from cameras, and gives high performance when applied to the ICDAR/GREC 2013 database, and a Gamera synthetic database. We have compared to some commercial software and proved the expediency and efficiency of the proposed method.

引用

页码：25563 / 25579

页数：17

共 50 条

[21] Recognition of Useful Music for Emotion Enhancement Based on Dimensional Model
Nawaz, Rab
Nisar, Humaira
Yap, Vooi Voon
2018 2ND INTERNATIONAL CONFERENCE ON BIOSIGNAL ANALYSIS, PROCESSING AND SYSTEMS (ICBAPS 2018), 2018, : 176 - 180
[22] A combination of features for symbol-independent writer identification in old music scores
Fornes, Alicia
Llados, Josep
Sanchez, Gemma
Otazu, Xavier
Bunke, Horst
INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2010, 13 (04) : 243 - 259
[23] A combination of features for symbol-independent writer identification in old music scores
Alicia Fornés
Josep Lladós
Gemma Sánchez
Xavier Otazu
Horst Bunke
International Journal on Document Analysis and Recognition (IJDAR), 2010, 13 : 243 - 259
[24] Offline music symbol recognition using Daisy feature and quantum Grey wolf optimization based feature selection
Malakar, Samir
Ghosh, Manosij
Chaterjee, Agneet
Bhowmik, Showmik
Sarkar, Ram
MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (43-44) : 32011 - 32036
[25] Visual recognition method based on symbol features
Liu, Xilong
Qian, Hanbo
Cao, Zhiqiang
Tan, Min
Huazhong Keji Daxue Xuebao (Ziran Kexue Ban)/Journal of Huazhong University of Science and Technology (Natural Science Edition), 2011, 39 (SUPPL. 2): : 120 - 123
[26] Offline music symbol recognition using Daisy feature and quantum Grey wolf optimization based feature selection
Samir Malakar
Manosij Ghosh
Agneet Chaterjee
Showmik Bhowmik
Ram Sarkar
Multimedia Tools and Applications, 2020, 79 : 32011 - 32036
[27] Photoacoustic tomography reconstruction using lag-based delay multiply and sum with a coherence factor improves in vivo ovarian cancer diagnosis
Yang, Guang
Amidi, Eghbal
Zhu, Quing
BIOMEDICAL OPTICS EXPRESS, 2021, 12 (04) : 2250 - 2263
[28] Multimodal Music Emotion Recognition Method Based on the Combination of Knowledge Distillation and Transfer Learning
Tong, Guiying
SCIENTIFIC PROGRAMMING, 2022, 2022
[29] Handwritten Symbol Recognition Using Hierarchical Shape Representation Model Based on Shape Signature
Babu, M. Raja
Gokaramaiah, T.
Reddy, A. Vishnuvardhan
PROCEEDINGS OF INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND DATA ENGINEERING, 2018, 9 : 293 - 300
[30] Domain Adaptation for Handwritten Symbol Recognition: A Case of Study in Old Music Manuscripts
Mateiu, Tudor N.
Gallego, Antonio-Javier
Calvo-Zaragoza, Jorge
PATTERN RECOGNITION AND IMAGE ANALYSIS, IBPRIA 2019, PT II, 2019, 11868 : 135 - 146

← 1 2 3 4 5 →