TeluguScriptify: A Custom Deep Learning Model for Handwritten Telugu Text Recognition and Tool Development

被引:0
作者
S. Thara [1 ]
Abhiram Gaddam [2 ]
Chandra Siddartha Ramakurthi [3 ]
Vara Prasad Basava [1 ]
Siddartha Thupakala [4 ]
S. Dhanya [1 ]
机构
[1] Department of Computer Science and Engineering, Amrita School of Computing, Amrita Vishwa Vidyapeetham, Amritapuri
[2] Department of Computer Science and Engineering, Georgia State University, Atlanta, GA
[3] Accenture, Pune
[4] Master’s in Business Analytics and Project Management, University of Connecticut, Hartford, CT
关键词
Alex-Net; Convolutional blocks; Cosine similarity; Dense-Net; Telugu;
D O I
10.1007/s42979-025-03677-z
中图分类号
学科分类号
摘要
This paper proposes a new method for transforming handwritten Telugu text into editable electronic documents. Handwritten documents in the Telugu language are of very high historical value. There is, however, very high difficulty in recognizing the various types of style of handwriting since there are no very successful Telugu recognitions to compare with efficiency in comparison with Chinese and Arabic. To solve this problem, the recognition for Telugu script was designed with a deep learning custom model, along with a novel cosine similarity-based pre-processing technique to further enhance the model’s performance. The model architecture contains 3 convolutional blocks, 3 max-pooling layers, 4 dropout layers, 2 skip layers, and 2 dense layers. The experimental results show that the custom deep learning model is the frontrunner, achieving 97.3% testing accuracy with a corresponding validation loss of 0.116, the model beats well-known models such as Dense-Net and Alex-Net. The models were trained and validated on 52 Telugu characters from ‘ (Figure presented.) ’ (aa) to ‘ (Figure presented.) ’ (bandira). The state-of-the-art object detection model, YOLO v8n (8.2 nano) model, is used to detect the text regions in handwritten documents. © The Author(s), under exclusive licence to Springer Nature Singapore Pte Ltd. 2025.
引用
收藏
相关论文
共 18 条
  • [1] Cheekati B.M., Rajeti R.S., Telugu handwritten character recognition using deep residual learning, 2020 4th international conference on I-SMAC (IoT in social, mobile, analytics and cloud) (I-SMAC). 2020. p, pp. 788-796
  • [2] Soujanya B., Chittineni S., Sitamahalakshmi T., Srinivas G., A cnn based approach for handwritten character identification of telugu guninthalu using various optimizers, Int J Adv Comput Sci Appl, (2022)
  • [3] Ansari A., Kaur B., Rakhra M., Singh A., Singh D., Handwritten text recognition using deep learning algorithms, pp. 1-6, (2022)
  • [4] Angadi A., Vatsavayi V.K., Gorripati S.K., A deep learning approach to recognize handwritten Telugu character using convolution neural networks, Int J Inf Syst Manag Sci. 2018. Available at SSRN
  • [5] Ashlin Deepa R.N., Sai Jayanth Kalluri G., Mohammed Z., Sai Sushank M.P., Raju V., Negi A., A novel approach to recognize handwritten Telugu words using character level cnn, 4th international conference for emerging technology (INCET). 2023. p, pp. 1-5, (2023)
  • [6] Chauhan V.K., Singh S., Sharma A., Hcr-net: a deep learning based script independent handwritten character recognition network, Multimed Tools Appl, 83, 32, pp. 78433-78467, (2024)
  • [7] Lecun Y., Bottou L., Bengio Y., Haffner P., Gradient-based learning applied to document recognition, Proc IEEE, 86, 11, pp. 2278-2324, (1998)
  • [8] Samala S., Prasad C.R., Kollem S., Rao P.R., Yalabaka S., Moola R., Handwritten Telugu vowel character classification using modified 25-layer alexnet with transfer learning, international conference on smart generation computing, communication and networking (SMART GENCON). 2022. p, pp. 1-4, (2022)
  • [9] Vijaya Lakshmi T.R., Sastry P.N., Rajinikanth T.V., A novel 3d approach to recognize Telugu palm leaf text, Eng Sci Technol Int J, 20, 1, pp. 143-150, (2017)
  • [10] Mathew M., Jawahar C., An Empirical Study of CTC Based Models for OCR of Indian Languages, (2022)