Handwriting Recognition in Low-resource Scripts using Adversarial Learning

被引:36
作者
Bhunia, Ayan Kumar [1 ]
Das, Abhirup [2 ]
Bhunia, Ankan Kumar [3 ]
Kishore, Perla Sai Raj [2 ]
Roy, Partha Pratim [4 ]
机构
[1] Nanyang Technol Univ, Singapore, Singapore
[2] Inst Engn & Management, Kolkata, India
[3] Jadavpur Univ, Kolkata, India
[4] Indian Inst Technol Roorkee, Roorkee, Uttar Pradesh, India
来源
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019) | 2019年
关键词
WORD RECOGNITION;
D O I
10.1109/CVPR.2019.00490
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Handwritten Word Recognition and Spotting is a challenging field dealing with handwritten text possessing irregular and complex shapes. The design of deep neural network models makes it necessary to extend training datasets in order to introduce variations and increase the number of samples; word-retrieval is therefore very difficult in low-resource scripts. Much of the existing literature comprises preprocessing strategies which are seldom sufficient to cover all possible variations. We propose an Adversarial Feature Deformation Module (AFDM) that learns ways to elastically warp extracted features in a scalable manner. The AFDM is inserted between intermediate layers and trained alternatively with the original framework, boosting its capability to better learn highly informative features rather than trivial ones. We test our meta-framework, which is built on top of popular word-spotting and word-recognition frameworks and enhanced by AFDM, not only on extensive Latin word datasets but also on sparser Indic scripts. We record results for varying sizes of training data, and observe that our enhanced network generalizes much better in the low-data regime; the overall word-error rates and mAP scores are observed to improve as well.
引用
收藏
页码:4762 / 4771
页数:10
相关论文
共 52 条
[1]   Word Spotting and Recognition with Embedded Attributes [J].
Almazan, Jon ;
Gordo, Albert ;
Fornes, Alicia ;
Valveny, Ernest .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2014, 36 (12) :2552-2566
[2]  
Antoniou A., 2017, ARXIV171104340
[3]   Cross-language framework for word recognition and spotting of Indic scripts [J].
Bhunia, Ayan Kumar ;
Roy, Partha Pratim ;
Mohta, Akash ;
Pal, Umapada .
PATTERN RECOGNITION, 2018, 79 :12-31
[4]   Scan, Attend and Read: End-to-End Handwritten Paragraph Recognition with MDLSTM Attention [J].
Bluche, Theodore ;
Louradour, Jerome ;
Messina, Ronaldo .
2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), VOL 1, 2017, :1050-1055
[6]  
Ciresan D, 2015, IEEE IJCNN
[7]  
Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
[8]  
Dong H., 2017, UNSUPERVISED IMAGE T
[9]  
Goodfellow IJ, 2014, ADV NEUR IN, V27, P2672
[10]  
Graves A, 2012, STUD COMPUT INTELL, V385, P1, DOI [10.1162/neco.1997.9.1.1, 10.1007/978-3-642-24797-2]