Hybrid hidden Markov models and artificial neural networks for handwritten music recognition in mensural notation

被引:13
作者
Calvo-Zaragoza, Jorge [1 ]
Toselli, Alejandro H. [1 ]
Vidal, Enrique [1 ]
机构
[1] Univ Politecn Valencia, PRHLT Res Ctr, Valencia, Spain
基金
欧盟地平线“2020”;
关键词
Handwritten music recognition; Mensural notation; Hidden Markov models; Artificial neural networks; N-gram Language Models; REMOVAL;
D O I
10.1007/s10044-019-00807-1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we present a hybrid approach using hidden Markov models (HMM) and artificial neural networks to deal with the task of handwritten Music Recognition in mensural notation. Previous works have shown that the task can be addressed with Gaussian density HMMs that can be trained and used in an end-to-end manner, that is, without prior segmentation of the symbols. However, the results achieved using that approach are not sufficiently accurate to be useful in practice. In this work, we hybridize HMMs with deep multilayer perceptrons (MLPs), which lead to remarkable improvements in optical symbol modeling. Moreover, this hybrid architecture maintains important advantages of HMMs such as the ability to properly model variable-length symbol sequences through segmentation-free training, and the simplicity and robustness of combining optical models with N-gram language models, which provide statistical a priori information about regularities in musical symbol concatenation observed in the training data. The results obtained with the proposed hybrid MLP-HMM approach outperform previous works by a wide margin, achieving symbol-level error rates around 26%, as compared with about 40% reported in previous works.
引用
收藏
页码:1573 / 1584
页数:12
相关论文
共 31 条
[1]  
[Anonymous], 2016, DEEP LEARNING
[2]  
[Anonymous], 2003, THESIS
[3]   The challenge of optical music recognition [J].
Bainbridge, D ;
Bell, T .
COMPUTERS AND THE HUMANITIES, 2001, 35 (02) :95-121
[4]   Hidden Markov model-based ensemble methods for offline handwritten text line recognition [J].
Bertolami, Roman ;
Bunke, Horst .
PATTERN RECOGNITION, 2008, 41 (11) :3452-3460
[5]   Component-based discriminative classification for hidden Markov models [J].
Bicego, Manuele ;
Pekalska, Elzbieta ;
Tax, David M. J. ;
Duin, Robert P. W. .
PATTERN RECOGNITION, 2009, 42 (11) :2637-2648
[6]  
Bosch V, 2016, INT CONF FRONT HAND, P313, DOI [10.1109/ICFHR.2016.61, 10.1109/ICFHR.2016.0066]
[7]   LINKS BETWEEN MARKOV-MODELS AND MULTILAYER PERCEPTRONS [J].
BOURLARD, H ;
WELLEKENS, CJ .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1990, 12 (12) :1167-1178
[8]   Handwritten Music Recognition for Mensural Notation: Formulation, Data and Baseline Results [J].
Calvo-Zaragoza, Jorge ;
Toselli, Alejandro H. ;
Vidal, Enrique .
2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), VOL 1, 2017, :1081-1086
[9]  
Calvo-Zaragoza J, 2016, INT CONF FRONT HAND, P319, DOI [10.1109/ICFHR.2016.0067, 10.1109/ICFHR.2016.62]
[10]   Avoiding staff removal stage in optical music recognition: application to scores written in white mensural notation [J].
Calvo-Zaragoza, Jorge ;
Barbancho, Isabel ;
Tardon, Lorenzo J. ;
Barbancho, Ana M. .
PATTERN ANALYSIS AND APPLICATIONS, 2015, 18 (04) :933-943