Attribute CNNs for word spotting in handwritten documents

被引:0
作者
Sebastian Sudholt
Gernot A. Fink
机构
[1] TU Dortmund University,
来源
International Journal on Document Analysis and Recognition (IJDAR) | 2018年 / 21卷
关键词
Attribute CNN; PHOCNet; TPP layer; Word spotting; Deep learning; Handwritten documents; Historical documents;
D O I
暂无
中图分类号
学科分类号
摘要
Word spotting has become a field of strong research interest in document image analysis over the last years. Recently, AttributeSVMs were proposed which predict a binary attribute representation (Almazán et al. in IEEE Trans Pattern Anal Mach Intell 36(12):2552–2566, 2014). At their time, this influential method defined the state of the art in segmentation-based word spotting. In this work, we present an approach for learning attribute representations with convolutional neural networks(CNNs). By taking a probabilistic perspective on training CNNs, we derive two different loss functions for binary and real-valued word string embeddings. In addition, we propose two different CNN architectures, specifically designed for word spotting. These architectures are able to be trained in an end-to-end fashion. In a number of experiments, we investigate the influence of different word string embeddings and optimization strategies. We show our attribute CNNs to achieve state-of-the-art results for segmentation-based word spotting on a large variety of data sets.
引用
收藏
页码:199 / 218
页数:19
相关论文
共 70 条
[1]  
Almazán J(2014)Word spotting and recognition with embedded attributes IEEE Trans. Pattern Anal. Mach. Intell. 36 2552-2566
[2]  
Gordo A(2013)Multivariate Bernoulli distribution Bernoulli 19 1465-1483
[3]  
Fornés A(2012)A few useful things to know about machine learning Commun. ACM 55 78-2159
[4]  
Valveny E(2011)Adaptive subgradient methods for online learning and stochastic optimization J. Mach. Learn. Res. 12 2121-224
[5]  
Dai B(2012)A novel word spotting method based on recurrent neural networks IEEE Trans. Pattern Anal. Mach. Intell. 34 211-332
[6]  
Ding S(2017)A survey of document image word spotting techniques Pattern Recogn. 68 310-168
[7]  
Wahba G(2000)A line-oriented approach to word spotting in handwritten documents Pattern Anal. Appl. 3 154-465
[8]  
Domingos P(2014)Attribute-based classification for zero-shot visual object categorization IEEE Trans. Pattern Anal. Mach. Intell. 36 453-46
[9]  
Duchi J(2002)The IAM-database: an English sentence database for offline handwriting recognition Int. J. Doc. Anal. Recognit. 5 39-1863
[10]  
Hazan E(2010)Permutation tests for studying classifier performance J. Mach. Learn. Res. 11 1833-152