Context based NLP framework of textual tagging for low resource language

被引:4
作者
Mishra, Atul [1 ]
Shaikh, Soharab Hossain [1 ]
Sanyal, Ratna [2 ]
机构
[1] BML Munjal Univ, Kapriwas, Haryana, India
[2] NIIT Univ, Comp Sci & Engn, Neemrana, Rajasthan, India
关键词
Part of speech; Software framework; HMM ANN; RNN; Information retrieval;
D O I
10.1007/s11042-021-11884-y
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Understanding the context of any phrase or extracting relationships requires part of speech tagging (POS). This article proposes an RNN-based POS tagger and compares its performance with some of the existing POS tagging methods. We present novel LSTM-based RNN architecture for POS tagging. The study attempts to determine the usefulness of machine learning and deep learning techniques for tagging part-of-speech of words for the low-resource Hindi language, which is an Indo-Aryan language spoken mostly in India. During the experiments, different deep learning architecture (ANN and RNN) and machine learning methods (HMM, SVM, DT) have been used. A multi-representational treebank and an open-source dataset have been used for the performance analysis of the proposed framework. The experimental results in terms of macro-measured variables have shown better results compared to some state-of-the-art methods.
引用
收藏
页码:35655 / 35670
页数:16
相关论文
共 52 条
[31]  
LTRC, LANG TECHN RES CTR H
[32]  
Magerman D.M., 1995, P 33 ANN M ASS COMP, P276, DOI DOI 10.3115/981658.981695
[33]  
Mishra P., 2017, P 14 INT C NATURAL L, P50
[34]  
Narayan R., 2014, IFAC Proceedings, V47, P519
[35]   Audio-visual speech recognition using deep learning [J].
Noda, Kuniaki ;
Yamaguchi, Yuki ;
Nakadai, Kazuhiro ;
Okuno, Hiroshi G. ;
Ogata, Tetsuya .
APPLIED INTELLIGENCE, 2015, 42 (04) :722-737
[36]  
Opitz Juri, 2019, arXiv preprint arXiv:1911.03347
[37]  
Palmer M., 2009, 7 INT C NATURAL LANG, P14
[38]  
Patel, 2021, ARXIV210106949
[39]  
Plank B, 2016, PROCEEDINGS OF THE 54TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2016), VOL 2, P412
[40]   Multilingual POS tagging by a composite deep architecture based on character-level features and on-the-fly enriched Word Embeddings [J].
Pota, Marco ;
Marulli, Fiammetta ;
Esposito, Massimo ;
De Pietro, Giuseppe ;
Fujita, Hamido .
KNOWLEDGE-BASED SYSTEMS, 2019, 164 :309-323