Conv-transformer architecture for unconstrained off-line Urdu handwriting recognition

被引:7
|
作者
Riaz, Nauman [1 ,2 ]
Arbab, Haziq [1 ]
Maqsood, Arooba [1 ,2 ]
Nasir, Khuzaeymah [1 ]
Ul-Hasan, Adnan [2 ]
Shafait, Faisal [1 ,2 ]
机构
[1] Natl Univ Sci & Technol NUST, Sch Elect Engn & Comp Sci, Islamabad, Pakistan
[2] Natl Ctr Artificial Intelligence NCAI, Deep Learning Lab, Islamabad, Pakistan
关键词
Urdu handwriting recognition; Urdu language; OCR; Transformer; Beam search;
D O I
10.1007/s10032-022-00416-5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Unconstrained off-line handwriting text recognition in general and for Arabic-like scripts in particular is a challenging task and is still an active research area. Transformer-based models for English handwriting recognition have recently shown promising results. In this paper, we have explored the use of transformer architecture for Urdu handwriting recognition. The use of a convolution neural network before a Vanilla full transformer and using Urdu printed text-lines along with handwritten text lines during the training are the highlights of the proposed work. The convolution layers act to reduce the spatial resolutions and compensate for the n(2) complexity of transformer multi-head attention layers. Moreover, the printed text images in the training phase help the model in learning a greater number of ligatures (a prominent feature of Arabic-like scripts) and a better language model. Our model achieved state-of-the-art accuracy (CER of 5.31%) on publicly available NUST-UHWR dataset (Zia et al. in Neural Comput Appl 34:1-14, 2021).
引用
收藏
页码:373 / 384
页数:12
相关论文
共 50 条
  • [1] Conv-transformer architecture for unconstrained off-line Urdu handwriting recognition
    Nauman Riaz
    Haziq Arbab
    Arooba Maqsood
    Khuzaeymah Nasir
    Adnan Ul-Hasan
    Faisal Shafait
    International Journal on Document Analysis and Recognition (IJDAR), 2022, 25 : 373 - 384
  • [2] A New Large Urdu Database for Off-Line Handwriting Recognition
    Sagheer, Malik Wagas
    He, Chun Lei
    Nobile, Nicola
    Suen, Ching Y.
    IMAGE ANALYSIS AND PROCESSING - ICIAP 2009, PROCEEDINGS, 2009, 5716 : 538 - 546
  • [3] A convolutional recursive deep architecture for unconstrained Urdu handwriting recognition
    Zia, Noor ul Sehr
    Naeem, Muhammad Ferjad
    Raza, Syed Muhammad Kumail
    Khan, Muhammad Mubasher
    Ul-Hasan, Adnan
    Shafait, Faisal
    NEURAL COMPUTING & APPLICATIONS, 2022, 34 (02): : 1635 - 1648
  • [4] A convolutional recursive deep architecture for unconstrained Urdu handwriting recognition
    Noor ul Sehr Zia
    Muhammad Ferjad Naeem
    Syed Muhammad Kumail Raza
    Muhammad Mubasher Khan
    Adnan Ul-Hasan
    Faisal Shafait
    Neural Computing and Applications, 2022, 34 : 1635 - 1648
  • [5] An off-line handwritten Thai alphabet recognition system for unconstrained handwriting styles
    Tanprasert, T
    Jitwongtrakul, K
    Taechotanon, S
    INTERNATIONAL JOURNAL OF UNCERTAINTY FUZZINESS AND KNOWLEDGE-BASED SYSTEMS, 2001, 9 (06) : 791 - 801
  • [6] An off-line handwritten Thai alphabet recognition system for unconstrained handwriting styles
    Tanprasert, Thitipong
    Jitwongtrakul, Kamolchol
    Taechotanon, Sarachai
    2001, World Scientific Publishing Co. Pte. Ltd (09):
  • [7] Recognition of off-line cursive handwriting
    Abuhaiba, ISI
    Holt, MJJ
    Datta, S
    COMPUTER VISION AND IMAGE UNDERSTANDING, 1998, 71 (01) : 19 - 38
  • [8] Imbalanced Conditional Conv-Transformer for Mathematical Expression Recognition
    Ji, Shuaijian
    Zhou, Zhaokun
    Wang, Yuqing
    Duan, Baishan
    Weng, Zhenyu
    Xu, Liang
    Zhu, Yuesheng
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PT VI, 2023, 14259 : 446 - 458
  • [9] Off-line unconstrained handwritten word recognition
    Cai, JH
    Liu, ZQ
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2000, 14 (03) : 259 - 280
  • [10] Off-line unconstrained handwritten word recognition
    Cai, JH
    Liu, ZQ
    ANZIIS 96 - 1996 AUSTRALIAN NEW ZEALAND CONFERENCE ON INTELLIGENT INFORMATION SYSTEMS, PROCEEDINGS, 1996, : 199 - 202