Part-Of-Speech Tagger in Malayalam Using Bi-directional LSTM

被引:0
作者
Rajan, Rajeev [1 ]
Joseph, Anna J. [1 ]
Robin, Elizabeth K. [1 ]
Nishma, Fathima T. K. [1 ]
机构
[1] Coll Engn, Trivandrum, Kerala, India
来源
PROCEEDINGS OF 2020 23RD CONFERENCE OF THE ORIENTAL COCOSDA INTERNATIONAL COMMITTEE FOR THE CO-ORDINATION AND STANDARDISATION OF SPEECH DATABASES AND ASSESSMENT TECHNIQUES (ORIENTAL-COCOSDA 2020) | 2020年
关键词
POS tagging; Malayalam; NLP; Decision tree; BLSTM; Stochastic process;
D O I
10.1109/o-cocosda50338.2020.9295018
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The majority of activities performed by humans are done through language, whether communicated directly or reported using natural language. As technology is increasingly making the methods and platforms on which we communicate ever more accessible, there is a great need to understand the languages we use to communicate. By combining the power of artificial intelligence, computational linguistics and computer science, natural language processing (NLP) helps machines read text by simulating the human ability to understand language. Part-of-speech tagging (POS Tagging) is done as a pre-requisite to simplify a lot of different NLP applications like question answering, speech recognition, machine translation, and so on. Here, we attempt a comparison between part-of-speech taggers in Malayalam using decision tree algorithm and bi-directional long short term memory (BLSTM). The experiments presented in this paper use two corpora, one of 29076 sentences and the other of 500 sentences for performance evaluation. The experiments demonstrate the potential of architectural choice of BLSTM-based tagger over conventional decision tree-based tagging in Malayalam.
引用
收藏
页码:22 / 27
页数:6
相关论文
共 12 条
[1]  
Antony P. J., 2010, Proceedings of the 2010 International Conference on Recent Trends in Information, Telecommunication and Computing (ITC 2010), P339, DOI 10.1109/ITC.2010.86
[2]  
Devadath V. V., 2016, THESIS
[3]  
*GOV IND OFF REG G, 2011, CENSUS INDIA
[4]  
Jesuraj K. Robert, 2013, 3 MALAYALAM LANGUAGE
[5]  
Krishnapriya V, 2014, 2014 First International Conference on Computational Systems and Communications (ICCSC), P370, DOI 10.1109/COMPSC.2014.7032680
[6]  
Manju K, 2009, 2009 INTERNATIONAL CONFERENCE ON ADVANCES IN RECENT TECHNOLOGIES IN COMMUNICATION AND COMPUTING (ARTCOM 2009), P709, DOI 10.1109/ARTCom.2009.98
[7]  
Mubarak D. Muhammad Noorul, 2015, International Journal of Computer Science & Information Technology, V7, P121, DOI 10.5121/ijcsit.2015.7509
[8]  
Nair Ravi Sankar S, 2012, LANGUAGE INDIA
[9]  
Plank B, 2016, PROCEEDINGS OF THE 54TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2016), VOL 2, P412
[10]  
Rajeev R., 2011, COMPUTER ENG INTELLI, V2, P6