A deep learning model for Ottoman OCR

被引:5
作者
Dolek, Ishak [1 ]
Kurt, Atakan [1 ]
机构
[1] Istanbul Univ Cerrahpasa, Engn Sch, Comp Engn Dept, Istanbul, Turkey
关键词
CNN; CTC; deep neural networks; LSTM; OCR; Ottoman; printed naksh font; RNN; NEURAL-NETWORK; RECOGNITION; SEGMENTATION; RETRIEVAL;
D O I
10.1002/cpe.6937
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
The Ottoman OCR is an open problem because the OCR models for Arabic do not perform well on Ottoman. The models specifically trained with Ottoman documents have not produced satisfactory results either. We present a deep learning model and an OCR tool using that model for the OCR of printed Ottoman documents in the naksh font. We propose an end-to-end trainable CRNN architecture consisting of CNN, RNN (LSTM), and CTC layers for the Ottoman OCR problem. An experimental comparison of this model, called , with the Tesseract Arabic, the Tesseract Persian, Abby Finereader, Miletos, and Google Docs OCR tools or models was performed using a test data set of 21 pages of original documents. With 88.86% raw text, 96.12% normalized text, and 97.37% joined text character recognition accuracy, the Hybrid model outperforms the others with a marked difference. Our model outperforms the next best model by a clear margin of 4% which is a significant improvement considering the difficulty of the Ottoman OCR problem, and the huge size of the Ottoman archives to be processed. The hybrid model also achieves 58% word recognition accuracy on normalized text which is the only rate above 50%.
引用
收藏
页数:17
相关论文
共 50 条
  • [21] An Efficient Deep Learning based Hybrid Model Image Caption Generation for
    Kaur, Mehzabeen
    Kaur, Harpreet
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (03) : 231 - 237
  • [22] Internet of Things attack detection using hybrid Deep Learning Model
    Sahu, Amiya Kumar
    Sharma, Suraj
    Tanveer, M.
    Raja, Rohit
    COMPUTER COMMUNICATIONS, 2021, 176 : 146 - 154
  • [23] Deep Transfer Learning Based on LSTM Model for Reservoir Flood Forecasting
    Zhu, Qiliang
    Wang, Changsheng
    Jin, Wenchao
    Ren, Jianxun
    Yu, Xueting
    INTERNATIONAL JOURNAL OF DATA WAREHOUSING AND MINING, 2024, 20 (01)
  • [24] A Review on Multiscale-Deep-Learning Applications
    Elizar, Elizar
    Zulkifley, Mohd Asyraf
    Muharar, Rusdha
    Zaman, Mohd Hairi Mohd
    Mustaza, Seri Mastura
    SENSORS, 2022, 22 (19)
  • [25] Review of Text Classification Methods on Deep Learning
    Wu, Hongping
    Liu, Yuling
    Wang, Jingwen
    CMC-COMPUTERS MATERIALS & CONTINUA, 2020, 63 (03): : 1309 - 1321
  • [26] Deep Learning Model for Global Spatio-Temporal Image Prediction
    Nikezic, Dusan P.
    Ramadani, Uzahir R.
    Radivojevic, Dusan S.
    Lazovic, Ivan M.
    Mirkov, Nikola S.
    MATHEMATICS, 2022, 10 (18)
  • [27] A Novel Deep Learning based IoT Enabled Computerized Vehicle Number Plate Recognition System using OCR Principles
    Bhuvaneswari, J.
    Gireesh, N.
    Srimathy, G.
    Nandigam, Subrahmanyam
    Nanammal, V.
    2024 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATION AND APPLIED INFORMATICS, ACCAI 2024, 2024,
  • [28] A Generic OCR Using Deep Siamese Convolution Neural Networks
    Sokar, Ghada
    Hemayed, Elsayed E.
    Rehan, Mohamed
    2018 IEEE 9TH ANNUAL INFORMATION TECHNOLOGY, ELECTRONICS AND MOBILE COMMUNICATION CONFERENCE (IEMCON), 2018, : 1238 - 1244
  • [29] Deep Learning-based Transfer Learning Model in Diagnosis of Diseases with Brain Magnetic Resonance Imaging
    Chandaran, Suganthe Ravi
    Muthusamy, Geetha
    Sevalaiappan, Latha Rukmani
    Senthilkumaran, Nivetha
    ACTA POLYTECHNICA HUNGARICA, 2022, 19 (05) : 127 - 147
  • [30] BLPnet: A new DNN model and Bengali OCR engine for Automatic Licence Plate
    Onim, Md. Saif Hassan
    Nyeem, Hussain
    Roy, Koushik
    Hasan, Mahmudul
    Ishmam, Abtahi
    Akif, Md. Akiful Hoque
    Ovi, Tareque Bashar
    ARRAY, 2022, 15