Script identification in handwritten and printed documents using convolutional recurrent connection

被引:0
|
作者
Jindal A. [1 ]
机构
[1] School of Computer Science, UPES, Bidholi, Uttarakhand, Dehradun
关键词
Bayesian optimization; CNN-LSTM; Deep learning; Script identification;
D O I
10.1007/s11042-024-19106-x
中图分类号
学科分类号
摘要
Identification of the script in multi-script handwritten or printed documents is one of the essential component to recognize the text. The script identification module helps Optical Character Recognition (OCR) to digitize the text present in the multi-script handwritten or printed documents. The similarity of characters between two or more scripts create this task tedious. The factors such as noise and writing style creates identification of the script more tedious. The present research work has proposed a deep learning method having a set of optimized convolutional layers followed by recurrently connected layers to identify the script of any word sample present in the handwritten or printed document. The proposed method has two components to extract deep hierarchical features and identify the temporal features. The experiments have been carried out on MDIW-13 and PHDIndic_11 datasets having handwritten and printed documents. The experimental results from the proposed method has improved the performance over existing methods in this regard. © The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2024.
引用
收藏
页码:5549 / 5563
页数:14
相关论文
共 50 条
  • [1] Script identification in printed bilingual documents
    D. Dhanya
    A. G. Ramakrishnan
    Peeta Basa Pati
    Sadhana, 2002, 27 : 73 - 82
  • [2] Script identification in printed bilingual documents
    Dhanya, D
    Ramakrishnan, AG
    Pati, PB
    SADHANA-ACADEMY PROCEEDINGS IN ENGINEERING SCIENCES, 2002, 27 (1): : 73 - 82
  • [3] Script Identification for Printed and Handwritten Indian Documents: An Empirical Study of Different Feature Classifier Combinations
    Rani, Rajneesh
    Dhir, Renu
    Kakkar, Deepti
    Sharma, Nonita
    INTERNATIONAL JOURNAL OF IMAGE AND GRAPHICS, 2022, 22 (03)
  • [4] Segmentation of Merged Lines and Script Identification in Handwritten Bilingual Documents
    Zinjore, Ranjana S.
    Ramteke, R. J.
    Pathak, Varsha M.
    PROCEEDINGS OF THE 9TH ANNUAL MEETING OF THE FORUM FOR INFORMATION RETRIEVAL EVALUATION (FIRE 2017), 2017, : 29 - 32
  • [5] Statistical comparison of classifiers for script identification from multi-script handwritten documents
    Singh, Pawan Kumar
    Sarkar, Ram
    Das, Nibaran
    Basu, Subhadip
    Nasipuri, Mita
    INTERNATIONAL JOURNAL OF APPLIED PATTERN RECOGNITION, 2014, 1 (02) : 152 - 172
  • [6] Word-Level Script Identification from Handwritten Multi-script Documents
    Singh, Pawan Kumar
    Mondal, Arafat
    Bhowmik, Showmik
    Sarkar, Ram
    Nasipuri, Mita
    PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON FRONTIERS OF INTELLIGENT COMPUTING: THEORY AND APPLICATIONS (FICTA) 2014, VOL 1, 2015, 327 : 551 - 558
  • [7] Hybrid HMM/BLSTM system for multi-script keyword spotting in printed and handwritten documents with identification stage
    Cheikhrouhou, Ahmed
    Kessentini, Yousri
    Kanoun, Slim
    NEURAL COMPUTING & APPLICATIONS, 2020, 32 (13): : 9201 - 9215
  • [8] Hybrid HMM/BLSTM system for multi-script keyword spotting in printed and handwritten documents with identification stage
    Ahmed Cheikhrouhou
    Yousri Kessentini
    Slim Kanoun
    Neural Computing and Applications, 2020, 32 : 9201 - 9215
  • [9] A Texture based approach to Word-level Script Identification from Multi-script Handwritten Documents
    Singh, Pawan Kumar
    Khan, Aparajita
    Sarkar, Ram
    Nasipuri, Mita
    2014 6TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND COMMUNICATION NETWORKS, 2014, : 228 - 232
  • [10] Script Identification of Multi-Script Documents: A Survey
    Ubul, Kurban
    Tursun, Gulzira
    Aysa, Alimjan
    Impedovo, Donato
    Pirlo, Giuseppe
    Yibulayin, Tuergen
    IEEE ACCESS, 2017, 5 : 6546 - 6559