Staff-line removal with selectional auto-encoders

被引:25
作者
Gallego, Antonio-Javier [1 ]
Calvo-Zaragoza, Jorge [1 ]
机构
[1] Univ Alicante, Dept Software & Computing Syst, Carretera San Vicente Raspeig S-N, Alicante 03690, Spain
关键词
Staff-line removal; Optical music recognition; Auto-encoders; Convolutional networks; MUSIC; RECOGNITION;
D O I
10.1016/j.eswa.2017.07.002
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Staff-line removal is, an important preprocessing stage as regards most Optical Music Recognition systems. The common procedures employed to carry out this task involve image processing techniques. In contrast to these traditional methods, which are based on hand-engineered transformations, the problem can also be approached from a machine learning point of view if representative examples of the task are provided. We propose doing this through the use of a new approach involving auto-encoders, which select the appropriate features of an input feature set (Selectional Auto-Encoders). Within the context of the problem at hand, the model is trained to select those pixels of a given image that belong to a musical symbol, thus removing the lines of the staves. Our results show that the proposed technique is quite competitive and significantly outperforms the other state-of-art strategies considered, particularly when dealing with grayscale input images. (C) 2017 Elsevier Ltd. All rights reserved.
引用
收藏
页码:138 / 148
页数:11
相关论文
共 38 条
[1]  
[Anonymous], STRUCTURED DOCUMENT
[2]  
[Anonymous], 2013, P 27 ANN C NEUR INF, DOI DOI 10.48550/ARXIV.1305.6663
[3]  
[Anonymous], 2 INT C PERS TECHN
[4]  
[Anonymous], P INT C DOC AN REC
[5]  
Bainbridge D., 1997, Sixth International Conference on Image Processing and its Applications (Conf. Publ. No.443), P756, DOI 10.1049/cp:19970997
[6]  
Bolan Su, 2012, Proceedings of the 10th IAPR International Workshop on Document Analysis Systems (DAS 2012), P160, DOI 10.1109/DAS.2012.16
[7]   Large-Scale Machine Learning with Stochastic Gradient Descent [J].
Bottou, Leon .
COMPSTAT'2010: 19TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL STATISTICS, 2010, :177-186
[8]   Music staff removal with supervised pixel classification [J].
Calvo-Zaragoza, Jorge ;
Mico, Luisa ;
Oncina, Jose .
INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2016, 19 (03) :211-219
[9]   Avoiding staff removal stage in optical music recognition: application to scores written in white mensural notation [J].
Calvo-Zaragoza, Jorge ;
Barbancho, Isabel ;
Tardon, Lorenzo J. ;
Barbancho, Ana M. .
PATTERN ANALYSIS AND APPLICATIONS, 2015, 18 (04) :933-943
[10]   Staff Detection with Stable Paths [J].
Cardoso, Jaime dos Santos ;
Capela, Artur ;
Rebelo, Ana ;
Guedes, Carlos ;
da Costa, Joaquim Pinto .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2009, 31 (06) :1134-1139