A deep learning model for Ottoman OCR

被引:5
作者
Dolek, Ishak [1 ]
Kurt, Atakan [1 ]
机构
[1] Istanbul Univ Cerrahpasa, Engn Sch, Comp Engn Dept, Istanbul, Turkey
关键词
CNN; CTC; deep neural networks; LSTM; OCR; Ottoman; printed naksh font; RNN; NEURAL-NETWORK; RECOGNITION; SEGMENTATION; RETRIEVAL;
D O I
10.1002/cpe.6937
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
The Ottoman OCR is an open problem because the OCR models for Arabic do not perform well on Ottoman. The models specifically trained with Ottoman documents have not produced satisfactory results either. We present a deep learning model and an OCR tool using that model for the OCR of printed Ottoman documents in the naksh font. We propose an end-to-end trainable CRNN architecture consisting of CNN, RNN (LSTM), and CTC layers for the Ottoman OCR problem. An experimental comparison of this model, called , with the Tesseract Arabic, the Tesseract Persian, Abby Finereader, Miletos, and Google Docs OCR tools or models was performed using a test data set of 21 pages of original documents. With 88.86% raw text, 96.12% normalized text, and 97.37% joined text character recognition accuracy, the Hybrid model outperforms the others with a marked difference. Our model outperforms the next best model by a clear margin of 4% which is a significant improvement considering the difficulty of the Ottoman OCR problem, and the huge size of the Ottoman archives to be processed. The hybrid model also achieves 58% word recognition accuracy on normalized text which is the only rate above 50%.
引用
收藏
页数:17
相关论文
共 50 条
  • [31] Multi-model Deep Learning Ensemble for ECG Heartbeat Arrhythmia Classification
    Essa, Ehab
    Xie, Xianghua
    28TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2020), 2021, : 1085 - 1089
  • [32] EEG-based deep learning model for the automatic detection of clinical depression
    Thoduparambil, Pristy Paul
    Dominic, Anna
    Varghese, Surekha Mariam
    PHYSICAL AND ENGINEERING SCIENCES IN MEDICINE, 2020, 43 (04) : 1349 - 1360
  • [33] Attention-Based Deep Learning Model for Arabic Handwritten Text Recognition
    Gader T.B.A.
    Echi A.K.
    Machine Graphics and Vision, 2022, 31 (1-4): : 49 - 73
  • [34] A Hybrid Deep Learning Model to Estimate the Future Electricity Demand of Sustainable Cities
    Dogan, Gulay Yildiz
    Aksoy, Asli
    Ozturk, Nursel
    SUSTAINABILITY, 2024, 16 (15)
  • [35] CAD Model Segmentation Via Deep Learning
    Van Biesbroeck, Antoine
    Shang, Feifei
    Bassir, David
    INTERNATIONAL JOURNAL OF COMPUTATIONAL METHODS, 2021, 18 (03)
  • [36] A Hybrid Deep Learning Model for Text Classification
    Chen, Xianglong
    Ouyang, Chunping
    Liu, Yongbin
    Luo, Lingyun
    Yang, Xiaohua
    2018 14TH INTERNATIONAL CONFERENCE ON SEMANTICS, KNOWLEDGE AND GRIDS (SKG), 2018, : 46 - 52
  • [37] Deep Learning Model for Text Recognition in Images
    Shrivastava, Anupriya
    Amudha, J.
    Gupta, Deepa
    Sharma, Kshitij
    2019 10TH INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND NETWORKING TECHNOLOGIES (ICCCNT), 2019,
  • [38] A Deep OCR for Degraded Bangla Documents
    Chaudhury, Ayan
    Mukherjee, Partha Sarathi
    Das, Sudip
    Biswas, Chandan
    Bhattacharya, Ujjwal
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2022, 21 (05)
  • [39] Deep Learning-Aided OCR Techniques for Chinese Uppercase Characters in the Application of Internet of Things
    Yin, Yue
    Zhang, Wei
    Hong, Sheng
    Yang, Jie
    Xiong, Jian
    Gui, Guan
    IEEE ACCESS, 2019, 7 : 47043 - 47049
  • [40] The Application of Deep Learning Algorithms for PPG Signal Processing and Classification
    Esgalhado, Filipa
    Fernandes, Beatriz
    Vassilenko, Valentina
    Batista, Arnaldo
    Russo, Sara
    COMPUTERS, 2021, 10 (12)