Handwritten alphabet classification in Tamil language using convolution neural network

被引:2
作者
Ravi J. [1 ]
机构
[1] Department of Computer Science, SVKM's Mithibai College of Arts, Chauhan Institute of Science, Amrutben Jivanlal College of Commerce And Economics (AUTONOMOUS), Vile Parle(W), Maharashtra, Mumbai
来源
International Journal of Cognitive Computing in Engineering | 2024年 / 5卷
关键词
Convolution neural network; Data augmentation; Deep learning; Tamil character recognition;
D O I
10.1016/j.ijcce.2024.03.001
中图分类号
学科分类号
摘要
Handwritten Alphabet Recognition can be defined as the way of detecting characters from images of Handwritten language alphabets. This is one of the important problems that can be solved by Convolution Neural Networks (CNN). Recent developments in CNN have made it possible to expand this problem area from English character recognition or Numbers recognition to Regional Languages character recognition, there has not been sufficient studies conducted in the domain of regional languages. This study has attempted to give deep learning approach to Tamil Handwritten Alphabets classification. This article aims to develop 3 models of CNN – THAC-CNN1, THAC-CNN2 and THAC-CNN3 to recognize Tamil Handwritten Alphabets and classify them based on its category. Our proposed models use a combination of benchmark dataset and a customized dataset which totals to over 2800 images of different Tamil alphabets after various data augmentation techniques. The proposed models are compared with a popular image classification pre-trained models - VGG-11 and VGG-16. We use the standard classification metric - accuracy to measure the performance of our proposed models. With our dataset and augmentation techniques, one of our models THAC-CNN1 achieves 97% accuracy on the training dataset and 92.5% accuracy on test dataset as opposed to 72% and 73.5% accuracy on training dataset and test dataset by pre-trained models. © 2024 The Author(s)
引用
收藏
页码:132 / 139
页数:7
相关论文
共 29 条
[1]  
Ahlawat S., Choudhary A., International conference on computational intelligence and data science. Hybrid cnn-svm classifier for handwritten digit recognition, Procedia Computer Science, 167, pp. 2554-2560, (2020)
[2]  
Ahmed R.M., Rashid T.A., Fatah P., Alsadoon A., Mirjalili S., An extensive dataset of handwritten central Kurdish isolated characters, Data in Brief, 39, (2021)
[3]  
Akashkumar S., Dyaram A.N., Anand M., Identification of tamil characters using deep learning, Machine learning and autonomous systems, pp. 223-237, (2022)
[4]  
Babitha Lincy R., Rubia J.J., Sherin Shibi C., Kavitha M., An enhanced deep learning model for handwritten tamil character identification, 2023 5th international conference on smart systems and inventive technology (ICSSIT), pp. 1518-1523, (2023)
[5]  
Das S., Imtiaz M.S., Neom N.H., Siddique N., Wang H., A hybrid approach for bangla sign language recognition using deep transfer learning model with random forest classifier, Expert Systems with Applications, 213, (2023)
[6]  
Gnanasivam P., Bharath G., Karthikeyan V., Dhivya V., Handwritten tamil character recognition using convolutional neural network, 2021 sixth international conference on wireless communications, signal processing and networking (WiSPNET), pp. 84-88, (2021)
[7]  
Haque R., Islam N., Tasneem M., Das A.K., Multi-class sentiment classification on Bengali social media comments using machine learning, International Journal of Cognitive Computing in Engineering, 4, pp. 21-35, (2023)
[8]  
Hidayat A.A., Purwandari K., Cenggoro T.W., Pardamean B., 5th international conference on computer science and computational intelligence 2020. A convolutional neural network-based ancient sundanese character classifier with data augmentation, Procedia Computer Science, 179, pp. 195-201, (2021)
[9]  
India H.L., hpl handwriting datasets, (2013)
[10]  
Karthi M., Priscilla R., Jafer K.S., 9th world engineering education forum (WEEF 2019) proceedings: Disruptive engineering education for sustainable development. A novel content detection approach for handwritten English letters, Procedia Computer Science, 172, pp. 1016-1025, (2020)