Parts of Speech Tagging for Kannada and Hindi Languages using ML and DL models

被引:1
作者
Advaith, V [1 ]
Shivkumar, Anushka [1 ]
Lakshmi, Sowmya B. S. [1 ]
机构
[1] Visvesvaraya Technol Univ, BMS Coll Engn, Dept Machine Learning, Bengaluru, India
来源
2022 IEEE INTERNATIONAL CONFERENCE ON ELECTRONICS, COMPUTING AND COMMUNICATION TECHNOLOGIES, CONECCT | 2022年
关键词
Natural Language Processing; Machine Learning; Deep Learning; Part of Speech tagging;
D O I
10.1109/CONECCT55679.2022.9865745
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Part-of-speech (POS) tagging is one of the vital Natural Language Processing (NLP) tasks that entails categorising words in a text (corpus) in accordance with a specific part of the speech, based on the word's context. POS tagging for Indian Languages is not widely explored. Kannada is extremely inflectional and contains one of the most complex and richest collections of linguistic traits. Hence, developing a POS tagger for a resource-poor language such as Kannada is difficult. The morphological complexity of Hindi becomes a challenge despite there having been numerous attempts of building a POS tagger for the language. The proposed work deals with the development of a POS tagger for both Kannada and Hindi by employing Machine Learning (ML) and Deep Learning (DL) algorithms. The results obtained are based on experiments conducted on a corpus consisting of around 3 lakh unique words for Kannada and Hindi combined. The 17 POS tags have been taken from the BIS tag set.
引用
收藏
页数:5
相关论文
共 13 条
[1]  
Agrawal H., 2006, P NLPAI ML CONT WORK
[2]  
Ananth Alaka, 2021, 2021 IEEE International Conference on Distributed Computing, VLSI, Electrical Circuits and Robotics (DISCOVER), P57, DOI 10.1109/DISCOVER52564.2021.9663430
[3]  
Antony P. J., 2010, 2010 International Conference on Machine Learning and Cybernetics (ICMLC 2010), P2139, DOI 10.1109/ICMLC.2010.5580488
[4]  
Awasthi P., 2006, P NLP ASS IND NLPAI
[5]  
BR S., 2012, international journal of computer applications, V48, P26
[6]   Text Document Summarization Using POS tagging for Kannada Text Documents [J].
Jayashree, R. ;
Anami, Basavaraj S. ;
Poornima, B. K. .
2021 11TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING, DATA SCIENCE & ENGINEERING (CONFLUENCE 2021), 2021, :423-426
[7]  
Todi KK, 2018, Arxiv, DOI arXiv:1808.03175
[8]  
Manju K, 2009, 2009 INTERNATIONAL CONFERENCE ON ADVANCES IN RECENT TECHNOLOGIES IN COMMUNICATION AND COMPUTING (ARTCOM 2009), P709, DOI 10.1109/ARTCom.2009.98
[9]  
Pallavi A. S. P., 2014, P NAT C IND LANG COM
[10]  
Pranckevicius T, 2017, BALT J MOD COMPUT, V5, P221, DOI 10.22364/bjmc.2017.5.2.05