DeepNetDevanagari: a deep learning model for Devanagari ancient character recognition

被引:40
作者
Narang, Sonika Rani [1 ]
Kumar, Munish [2 ]
Jindal, M. K. [3 ]
机构
[1] DAV Coll, Dept Comp Sci, Abohar, Punjab, India
[2] Maharaja Ranjit Singh Punjab Tech Univ, Dept Computat Sci, Bathinda, India
[3] Panjab Univ Reg Ctr, Dept Comp Sci & Applicat, Muktsar, Punjab, India
关键词
Devanagari handwritten character dataset; Devanagari ancient; Deep learning; Deep convolutional neural network; Optical character recognition; NEURAL-NETWORKS; PERFORMANCE; TEXT;
D O I
10.1007/s11042-021-10775-6
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Devanagari script is the most widely used script in India and other Asian countries. There is a rich collection of ancient Devanagari manuscripts, which is a wealth of knowledge. To make these manuscripts available to people, efforts are being done to digitize these documents. Optical Character Recognition (OCR) plays an important role in recognizing these documents. Convolutional Neural Network (CNN) is a powerful model that is giving very promising results in the field of character recognition, pattern recognition etc. CNN has never been used for the recognition of the Devanagari ancient manuscripts. Our aim in the proposed work is to use the power of CNN for extracting the wealth of knowledge from Devanagari handwritten ancient manuscripts. In addition, we aim is to experiment with various design options like number of layes, stride size, number of filters, kenel size and different functions in various layers and to select the best of these. In this paper, the authors have proposed to use deep learning model as a feature extractor as well as a classifier for the recognition of 33 classes of basic characters of Devanagari ancient manuscripts. A dataset containing 5484 characters has been used for the experimental work. Various experiments show that the accuracy achieved using CNN as a feature extractor is better than other state-of-the-art techniques. The recognition accuracy of 93.73% has been achieved by using the model proposed in this paper for Devanagari ancient character recognition.
引用
收藏
页码:20671 / 20686
页数:16
相关论文
共 44 条
[1]  
Acharya J, 2015, 2015 INTERNATIONAL CONFERENCE ON COMPUTING, NETWORKING AND COMMUNICATIONS (ICNC), P1, DOI 10.1109/ICCNC.2015.7069284
[2]   A deep network model for paraphrase detection in short text messages [J].
Agarwal, Basant ;
Ramampiaro, Heri ;
Langseth, Helge ;
Ruocco, Massimiliano .
INFORMATION PROCESSING & MANAGEMENT, 2018, 54 (06) :922-937
[3]   Improved Handwritten Digit Recognition Using Convolutional Neural Networks (CNN) [J].
Ahlawat, Savita ;
Choudhary, Amit ;
Nayyar, Anand ;
Singh, Saurabh ;
Yoon, Byungun .
SENSORS, 2020, 20 (12) :1-18
[4]   Machine Learning from Theory to Algorithms: An Overview [J].
Alzubi, Jafar ;
Nayyar, Anand ;
Kumar, Akshi .
SECOND NATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE (NCCI 2018), 2018, 1142
[5]  
[Anonymous], 2018, INT J COMPUTER SCI E, DOI DOI 10.26438/IJCSE/V6I6.909914
[6]   Optical Character Recognition for Sanskrit using Convolution Neural Networks [J].
Avadesh, Meduri ;
Goyal, Navneet .
2018 13TH IAPR INTERNATIONAL WORKSHOP ON DOCUMENT ANALYSIS SYSTEMS (DAS), 2018, :447-452
[7]   A survey on optical character recognition for Bangla and Devanagari scripts [J].
Bag, Soumen ;
Harit, Gaurav .
SADHANA-ACADEMY PROCEEDINGS IN ENGINEERING SCIENCES, 2013, 38 (01) :133-168
[8]   Hybrid OCR combination approach complemented by a specialized ICR applied on ancient documents [J].
Cecotti, H ;
Belaïd, A .
EIGHTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS 1 AND 2, PROCEEDINGS, 2005, :1045-1049
[9]  
Ciresan D.C., 2011, P 22 INT JOINT C ART, DOI DOI 10.5591/978-1-57735-516-8/IJCAI11-210
[10]   Recognizing Characters of Ancient Manuscripts [J].
Diem, Markus ;
Sablatnig, Robert .
COMPUTER VISION AND IMAGE ANALYSIS OF ART, 2010, 7531