Persian handwritten digit, character and word recognition using deep learning

被引:16
作者
Bonyani, Mahdi [1 ]
Jahangard, Simindokht [2 ]
Daneshmand, Morteza [3 ]
机构
[1] Univ Tabriz, Dept Comp Engn, Tabriz, Iran
[2] Amirkabir Univ Technol, Dept Robot Engn, Tehran, Iran
[3] Univ Tartu, Inst Technol, Tartu, Estonia
关键词
Optical character recognition (OCR); Persian characters and words; Deep neural networks; DenseNet; Xception;
D O I
10.1007/s10032-021-00368-2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In spite of various applications of digit, letter and word recognition, only a few studies have dealt with Persian scripts. In this paper, deep neural networks are utilized through different DenseNet and Xception architectures, being further boosted by means of data augmentation and test time augmentation. Dividing the datasets to training, validation and test sets, and utilizing k-fold cross-validation, the comparison of the proposed method with various state-of-the-art alternatives is performed. Three datasets: HODA, Sadri and Iranshahr are used, which offer the most comprehensive collections of samples in terms of handwriting styles and the forms each letter may take depending on its position within a word. On the HODA dataset, we achieve recognition rates of 99.49% and 98.10% for digits and characters, being 99.72%, 89.99% and 98.82% for digits, characters and words from the Sadri dataset, respectively, as well as 98.99% for words from the Iranshahr dataset, each of which outperforms the performances achieved by the most advanced alternative networks, namely ResNet50 and VGG16. An additional contribution of the paper arises from its capability of words recognition as a holistic image classification. This improves the resulting speed and versatility significantly, as it does not require explicit character models, unlike earlier alternatives such as hidden Markov models and convolutional recursive neural networks. In addition, computation times have been compared with alternative state-of-the-art models and better performance has been observed.
引用
收藏
页码:133 / 143
页数:11
相关论文
共 50 条
  • [21] TeluguScriptify: A Custom Deep Learning Model for Handwritten Telugu Text Recognition and Tool Development
    S. Thara
    Abhiram Gaddam
    Chandra Siddartha Ramakurthi
    Vara Prasad Basava
    Siddartha Thupakala
    S. Dhanya
    SN Computer Science, 6 (2)
  • [22] Handwritten Bangla Numeral Recognition Using Deep Long Short Term Memory
    Ahmed, Mahtab
    Akhand, M. A. H.
    Rahman, M. M. Hafizur
    2016 6TH INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY FOR THE MUSLIM WORLD (ICT4M), 2016, : 310 - 315
  • [23] Support Vector Machine Based Handwritten Hindi Character Recognition and Summarization
    Dhankhar, Sunil
    Gupta, Mukesh Kumar
    Memon, Fida Hussain
    Bhatia, Surbhi
    Dadheech, Pankaj
    Mashat, Arwa
    COMPUTER SYSTEMS SCIENCE AND ENGINEERING, 2022, 43 (01): : 397 - 412
  • [24] A pragmatic convolutional bagging ensemble learning for recognition of Farsi handwritten digits
    Y. A. Nanehkaran
    Junde Chen
    Soheil Salimi
    Defu Zhang
    The Journal of Supercomputing, 2021, 77 : 13474 - 13493
  • [25] A pragmatic convolutional bagging ensemble learning for recognition of Farsi handwritten digits
    Nanehkaran, Y. A.
    Chen, Junde
    Salimi, Soheil
    Zhang, Defu
    JOURNAL OF SUPERCOMPUTING, 2021, 77 (11) : 13474 - 13493
  • [26] Food State Recognition Using Deep Learning
    Alahmari, Saeed. S. S.
    Salem, Tawfiq
    IEEE ACCESS, 2022, 10 : 130048 - 130057
  • [27] Isolated Word Speech Recognition System Using Deep Neural Networks
    Dhanashri, Dhavale
    Dhonde, S. B.
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON DATA ENGINEERING AND COMMUNICATION TECHNOLOGY, ICDECT 2016, VOL 1, 2017, 468 : 9 - 17
  • [28] Analysis and comparison of machine learning classifiers and deep neural networks techniques for recognition of Farsi handwritten digits
    Nanehkaran, Y. A.
    Zhang, Defu
    Salimi, S.
    Chen, Junde
    Tian, Yuan
    Al-Nabhan, Najla
    JOURNAL OF SUPERCOMPUTING, 2021, 77 (04) : 3193 - 3222
  • [29] Intelligent Tool For Malayalam Cursive Handwritten Character Recognition Using Artificial Neural Network And Hidden Markov Model
    Kishna, Thulasi N. P.
    Francis, Seenia
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INVENTIVE COMPUTING AND INFORMATICS (ICICI 2017), 2017, : 595 - 598
  • [30] Analysis and comparison of machine learning classifiers and deep neural networks techniques for recognition of Farsi handwritten digits
    Y. A. Nanehkaran
    Defu Zhang
    S. Salimi
    Junde Chen
    Yuan Tian
    Najla Al-Nabhan
    The Journal of Supercomputing, 2021, 77 : 3193 - 3222