Context based NLP framework of textual tagging for low resource language

被引:4
作者
Mishra, Atul [1 ]
Shaikh, Soharab Hossain [1 ]
Sanyal, Ratna [2 ]
机构
[1] BML Munjal Univ, Kapriwas, Haryana, India
[2] NIIT Univ, Comp Sci & Engn, Neemrana, Rajasthan, India
关键词
Part of speech; Software framework; HMM ANN; RNN; Information retrieval;
D O I
10.1007/s11042-021-11884-y
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Understanding the context of any phrase or extracting relationships requires part of speech tagging (POS). This article proposes an RNN-based POS tagger and compares its performance with some of the existing POS tagging methods. We present novel LSTM-based RNN architecture for POS tagging. The study attempts to determine the usefulness of machine learning and deep learning techniques for tagging part-of-speech of words for the low-resource Hindi language, which is an Indo-Aryan language spoken mostly in India. During the experiments, different deep learning architecture (ANN and RNN) and machine learning methods (HMM, SVM, DT) have been used. A multi-representational treebank and an open-source dataset have been used for the performance analysis of the proposed framework. The experimental results in terms of macro-measured variables have shown better results compared to some state-of-the-art methods.
引用
收藏
页码:35655 / 35670
页数:16
相关论文
共 52 条
[1]  
Abdulkareem Mustafa, 2017, Journal of Theoretical and Applied Information Technology, V95, P403
[2]  
Akbik A., 2018, COLING 2018, 27th International Conference on Computational Linguistics, P1638
[3]   An efficient employment of internet of multimedia things in smart and future agriculture [J].
AlZu'bi, Shadi ;
Hawashin, Bilal ;
Mujahed, Muhannad ;
Jararweh, Yaser ;
Gupta, Brij B. .
MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (20) :29581-29605
[4]  
[Anonymous], 2017, P 21 C COMP NAT LANG
[5]  
[Anonymous], 2003, Treebanks: building and using parsed corpora
[6]  
Ball K, 2020, P 2018 C EMP METH NA, P3084, DOI [10.18653/v1/d18-1347, DOI 10.18653/V1/D18-1347]
[7]  
Bandyopadhyay S, 2006, ADV PATTERN RECOGNIT, P384, DOI [10.1142/9789812772381_0065, DOI 10.1142/9789812772381_0065]
[8]  
Baskaran, 2006, P NLPAI MACH LEARN C, P6
[9]  
Bharati A., 2006, DIM 06, P1, DOI DOI 10.1145/1179529.1179531
[10]  
Bhat Irshad, 2018, P 2018 C N AM CHAPTE, V1, P987, DOI [DOI 10.18653/V1/N18-1090, 10.18653/v1/N18-1090]