Persian handwritten digit, character and word recognition using deep learning

被引:16
作者
Bonyani, Mahdi [1 ]
Jahangard, Simindokht [2 ]
Daneshmand, Morteza [3 ]
机构
[1] Univ Tabriz, Dept Comp Engn, Tabriz, Iran
[2] Amirkabir Univ Technol, Dept Robot Engn, Tehran, Iran
[3] Univ Tartu, Inst Technol, Tartu, Estonia
关键词
Optical character recognition (OCR); Persian characters and words; Deep neural networks; DenseNet; Xception;
D O I
10.1007/s10032-021-00368-2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In spite of various applications of digit, letter and word recognition, only a few studies have dealt with Persian scripts. In this paper, deep neural networks are utilized through different DenseNet and Xception architectures, being further boosted by means of data augmentation and test time augmentation. Dividing the datasets to training, validation and test sets, and utilizing k-fold cross-validation, the comparison of the proposed method with various state-of-the-art alternatives is performed. Three datasets: HODA, Sadri and Iranshahr are used, which offer the most comprehensive collections of samples in terms of handwriting styles and the forms each letter may take depending on its position within a word. On the HODA dataset, we achieve recognition rates of 99.49% and 98.10% for digits and characters, being 99.72%, 89.99% and 98.82% for digits, characters and words from the Sadri dataset, respectively, as well as 98.99% for words from the Iranshahr dataset, each of which outperforms the performances achieved by the most advanced alternative networks, namely ResNet50 and VGG16. An additional contribution of the paper arises from its capability of words recognition as a holistic image classification. This improves the resulting speed and versatility significantly, as it does not require explicit character models, unlike earlier alternatives such as hidden Markov models and convolutional recursive neural networks. In addition, computation times have been compared with alternative state-of-the-art models and better performance has been observed.
引用
收藏
页码:133 / 143
页数:11
相关论文
共 50 条
  • [1] Persian handwritten digit, character and word recognition using deep learning
    Mahdi Bonyani
    Simindokht Jahangard
    Morteza Daneshmand
    International Journal on Document Analysis and Recognition (IJDAR), 2021, 24 : 133 - 143
  • [2] A Roadmap on Handwritten Gujarati Digit Recognition using Machine Learning
    Bharvad, Janardan
    Garg, Dweepna
    Ribadiya, Shivam
    2021 6TH INTERNATIONAL CONFERENCE FOR CONVERGENCE IN TECHNOLOGY (I2CT), 2021,
  • [3] Optical Handwritten with Character Recognition
    Zahra, Syeda Binish
    Moaen, Shanza
    Munir, Sundus
    Hassan, Arfa
    Nadeem, Afrozah
    Farooq, Muhammad Sajid
    4TH INTERNATIONAL CONFERENCE ON INNOVATIVE COMPUTING (IC)2, 2021, : 562 - 569
  • [4] Persian Optical Character Recognition Using Deep Bidirectional Long Short-Term Memory
    Khosrobeigi, Zohreh
    Veisi, Hadi
    Hoseinzade, Ehsan
    Shabanian, Hanieh
    APPLIED SCIENCES-BASEL, 2022, 12 (22):
  • [5] An Intelligent Telugu Handwritten Character Recognition Using Multi-Objective Mayfly Optimization with Deep Learning-Based DenseNet Model
    Sonthi, Vijaya Krishna
    Nagarajan, S.
    Krishnaraj, N.
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2023, 22 (03)
  • [6] Handwritten Hindi character recognition: a review
    Yadav, Madhuri
    Purwar, Ravindra Kumar
    Mittal, Mamta
    IET IMAGE PROCESSING, 2018, 12 (11) : 1919 - 1933
  • [7] Handwritten Character Recognition-An Analysis
    Tiwari, Usha
    Jain, Monika
    Mehfuz, Shabana
    ADVANCES IN SYSTEM OPTIMIZATION AND CONTROL, 2019, 509 : 207 - 212
  • [8] Handwritten Character Generation using Y-Autoencoder for Character Recognition Model Training
    Kitagawa, Tomoki
    Leow, Chee Siang
    Nishizaki, Hiromitsu
    LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 7344 - 7351
  • [9] Character Recognition of Components Mounted on Printed Circuit Board Using Deep Learning
    Gang, Sumyung
    Fabrice, Ndayishimiye
    Chung, Daewon
    Lee, Joonjae
    SENSORS, 2021, 21 (09)
  • [10] CArDIS: A Swedish Historical Handwritten Character and Word Dataset
    Yavariabdi, Amir
    Kusetogullari, Huseyin
    Celik, Turgay
    Thummanapally, Shivani
    Rijwan, Sakib
    Hall, Johan
    IEEE ACCESS, 2022, 10 : 55338 - 55349