A squeeze and excitation ResNeXt-based deep learning model for Bangla handwritten compound character recognition

被引:19
作者
Khan, Mohammad Meraj [1 ]
Uddin, Mohammad Shorif [2 ]
Parvez, Mohammad Zavid [1 ]
Nahar, Lutfur [2 ]
机构
[1] BRAC Univ, Dept Comp Sci & Engn, Dhaka, Bangladesh
[2] Jahangirnagar Univ, Dept Comp Sci & Engn, Dhaka, Bangladesh
关键词
Bangla compound characters; Handwritten character recognition; OCR; Deep learning; Squeeze and excitation model;
D O I
10.1016/j.jksuci.2021.01.021
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In today's digital era, the recognition of handwritten documents is highly demanding due to the widespread applications. Bangla is one of the most used languages in the world, which consists of 50 basic alphabets. Many compound characters exist in Bangla, which is the combination of two or more basic characters. The recognition of Bangla handwritten characters is a challenging task due to their various sizes, sheer, diversity, lots of turns, similarities between alphabets, and different writing patterns. This paper presents a deep CNN (Convolutional Neural Network) model using the SE-ResNeXt. The squeeze and excitation (SE) blocks are added with existing ResNeXt to address the Bangla handwritten compound character recognition. Usually, CNN extracted the spatial features in the lower-layers and complex features in the upper layers. The SE blocks are added to improve the performance of usual deep CNN by automatically fusing the channel-wise spatial information and inter-channel dependencies through squeezing and excitation within local receptive fields, respectively. To validate the performance of the proposed model, we have utilized Mendeley BanglaLekha-Isolated 2 dataset. The experimental results demonstrate that the proposed model exhibits an average accuracy of 99.82% in recognizing Bangla handwritten compound characters. In addition, the proposed model outperforms the state-of-the-art models by demonstrating higher results. (c) 2021 The Authors. Published by Elsevier B.V. on behalf of King Saud University. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
引用
收藏
页码:3356 / 3364
页数:9
相关论文
共 38 条
[1]  
Alif MA, 2017, 2017 20TH INTERNATIONAL CONFERENCE OF COMPUTER AND INFORMATION TECHNOLOGY (ICCIT)
[2]   Handwritten Bangla Character Recognition Using the State-of-the-Art Deep Convolutional Neural Networks [J].
Alom, Md Zahangir ;
Sidike, Paheding ;
Hasan, Mahmudul ;
Taha, Tarek M. ;
Asari, Vijayan K. .
COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2018, 2018
[3]   Recognition of Pashto Handwritten Characters Based on Deep Learning [J].
Amin, Muhammad Sadiq ;
Yasir, Siddiqui Muhammad ;
Ahn, Hyunsik .
SENSORS, 2020, 20 (20) :1-23
[4]  
[Anonymous], 2017, IEEE INT C IM VIS PA, DOI DOI 10.1007/S11760-016-1052-9
[5]  
Ashiquzzaman A.K.M., 2017, INT C RES COMP INT C
[6]  
Basri R., 2020, INT C COMP ADV ICCA
[7]  
Bhagyasree P.V., 2019, 2019 1 INT C INN COM, DOI 10.1109/ICIICT1.2019.8741412
[8]  
Chatterjee S., 2019, INT C INT HUM COMP I
[9]   Xception: Deep Learning with Depthwise Separable Convolutions [J].
Chollet, Francois .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :1800-1807
[10]  
Clanuwat T., 2019, INT C DOC AN REC ICD