Bagging: An Ensemble Approach for Recognition of Handwritten Place Names in Gurumukhi Script

被引:2
作者
Kaur, Harmandeep [1 ]
Kumar, Munish [2 ]
Gupta, Aastha [3 ]
Sachdeva, Monika [4 ]
Mittal, Ajay [5 ]
Kumar, Krishan [5 ]
机构
[1] Akal Univ, Dept Comp Sci & Engn, Bathinda, Punjab, India
[2] Maharaja Ranjit Singh Punjab Tech Univ, Dept Computat Sci, Bathinda, Punjab, India
[3] PEC Univ Technol, Dept Appl Sci, Chandigarh, India
[4] IKG Punjab Tech Univ, Mohali Campus 2, Mohali, India
[5] Panjab Univ, Univ Inst Engn & Technol, Chandigarh, India
关键词
Postal automation; Gurumukhi words; place names; feature extraction; feature selection; classification; Bagging; WORD RECOGNITION; HOLISTIC APPROACH;
D O I
10.1145/3593024
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this article, the authors present an effort to recognize handwritten Gurumukhi place names for use in postal automation. Five feature extraction techniques (zoning, horizontal peak extent, vertical peak extent, diagonal, and centroid) have been analyzed and optimized using Principal Component Analysis (PCA). Four classification methods (k-Nearest Neighbor (k-NN), decision tree, random forest, and Convolutional Neural Network (CNN)) have been utilized to classify the handwritten word images. To enhance the recognition results, the authors have employed Bootstrap Aggregation (Bagging) with a majority voting scheme. The authors used a public benchmark dataset of 40,000 handwritten place-name samples in the Punjabi language for their experimental work. The experiments were conducted using a 70:30 partitioning approach, where 70% of the data was utilized for training and the remaining 30% for testing. The system achieved a maximum recognition accuracy of 96.98% by utilizing a combination of zoning, vertical peak extent, and diagonal features, and a minimum Mean Squared Error (MSE) of 0.86% based on a combination of zoning and horizontal peak extent features with a majority voting scheme through ensemble (Bagging) methodology.
引用
收藏
页数:25
相关论文
共 54 条
[1]  
[Anonymous], 2019, ICLR, DOI DOI 10.1080/09593985.2019.1709234
[2]   Bangla Handwritten City Name Recognition Using Gradient-Based Feature [J].
Barua, Shilpi ;
Malakar, Samir ;
Bhowmik, Showmik ;
Sarkar, Ram ;
Nasipuri, Mita .
PROCEEDINGS OF THE 5TH INTERNATIONAL CONFERENCE ON FRONTIERS IN INTELLIGENT COMPUTING: THEORY AND APPLICATIONS, FICTA 2016, VOL 1, 2017, 515 :343-352
[3]   Off-line Bangla handwritten word recognition: a holistic approach [J].
Bhowmik, Showmik ;
Malakar, Samir ;
Sarkar, Ram ;
Basu, Subhadip ;
Kundu, Mahantapas ;
Nasipuri, Mita .
NEURAL COMPUTING & APPLICATIONS, 2019, 31 (10) :5783-5798
[4]   Handwritten Bangla Word Recognition using Elliptical Features [J].
Bhowmik, Showmik ;
Malakar, Samir ;
Sarkar, Ram ;
Nasipuri, Mita .
2014 6TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND COMMUNICATION NETWORKS, 2014, :257-261
[5]   Cross-language framework for word recognition and spotting of Indic scripts [J].
Bhunia, Ayan Kumar ;
Roy, Partha Pratim ;
Mohta, Akash ;
Pal, Umapada .
PATTERN RECOGNITION, 2018, 79 :12-31
[6]   Feature design for offline Arabic handwriting recognition: handcrafted vs automated? [J].
Chherawala, Youssouf ;
Roy, Partha Pratim ;
Cheriet, Mohamed .
2013 12TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2013, :290-294
[7]   Ensemble classifier-based off-line handwritten word recognition system in holistic approach [J].
Das Gupta, Jija ;
Samanta, Soumitra ;
Chanda, Bhabatosh .
IET IMAGE PROCESSING, 2018, 12 (08) :1467-1474
[8]   A holistic approach for Off-line handwritten cursive word recognition using directional feature based on Arnold transform [J].
Dasgupta, Jija ;
Bhattacharya, Kallol ;
Chanda, Bhabatosh .
PATTERN RECOGNITION LETTERS, 2016, 79 :73-79
[9]   Handwritten Farsi (Arabic) word recognition: a holistic approach using discrete HMM [J].
Dehghan, M ;
Faez, K ;
Ahmadi, M ;
Shridhar, M .
PATTERN RECOGNITION, 2001, 34 (05) :1057-1065
[10]  
google, US