Staff-line detection and removal using a convolutional neural network

被引:13
作者
Calvo-Zaragoza, Jorge [1 ]
Pertusa, Antonio [1 ]
Oncina, Jose [1 ]
机构
[1] Univ Alicante, Dept Software & Comp Syst, Carretera San Vicente Raspeig S-N, Alicante 03690, Spain
关键词
Music staff-line removal; Optical music recognition; Pixel classification; Convolutional neural networks; RECOGNITION;
D O I
10.1007/s00138-017-0844-4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Staff-line removal is an important preprocessing stage for most optical music recognition systems. Common procedures to solve this task involve image processing techniques. In contrast to these traditional methods based on hand-engineered transformations, the problem can also be approached as a classification task in which each pixel is labeled as either staff or symbol, so that only those that belong to symbols are kept in the image. In order to perform this classification, we propose the use of convolutional neural networks, which have demonstrated an outstanding performance in image retrieval tasks. The initial features of each pixel consist of a square patch from the input image centered at that pixel. The proposed network is trained by using a dataset which contains pairs of scores with and without the staff lines. Our results in both binary and grayscale images show that the proposed technique is very accurate, outperforming both other classifiers and the state-of-the-art strategies considered. In addition, several advantages of the presented methodology with respect to traditional procedures proposed so far are discussed.
引用
收藏
页码:665 / 674
页数:10
相关论文
共 29 条
[1]  
[Anonymous], ARXIV1212 CORR
[2]  
[Anonymous], PROC CVPR IEEE
[3]  
[Anonymous], 2 INT C PERS TECHN
[4]  
[Anonymous], 2005, P INT C MUS INF RETR
[5]  
Bolan Su, 2012, Proceedings of the 10th IAPR International Workshop on Document Analysis Systems (DAS 2012), P160, DOI 10.1109/DAS.2012.16
[6]   Large-Scale Machine Learning with Stochastic Gradient Descent [J].
Bottou, Leon .
COMPSTAT'2010: 19TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL STATISTICS, 2010, :177-186
[7]   Towards a Standard Testbed for Optical Music Recognition: Definitions, Metrics, and Page Images [J].
Byrd, Donald ;
Simonsen, Jakob Grue .
JOURNAL OF NEW MUSIC RESEARCH, 2015, 44 (03) :169-195
[8]   Music staff removal with supervised pixel classification [J].
Calvo-Zaragoza, Jorge ;
Mico, Luisa ;
Oncina, Jose .
INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2016, 19 (03) :211-219
[9]   Avoiding staff removal stage in optical music recognition: application to scores written in white mensural notation [J].
Calvo-Zaragoza, Jorge ;
Barbancho, Isabel ;
Tardon, Lorenzo J. ;
Barbancho, Ana M. .
PATTERN ANALYSIS AND APPLICATIONS, 2015, 18 (04) :933-943
[10]   Staff Detection with Stable Paths [J].
Cardoso, Jaime dos Santos ;
Capela, Artur ;
Rebelo, Ana ;
Guedes, Carlos ;
da Costa, Joaquim Pinto .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2009, 31 (06) :1134-1139