A Machine Learning Approach to Anaphora Resolution in Nepali Language

被引:0
作者
Senapati, Apurbalal [1 ]
Poudyal, Arun [1 ]
Adhikary, Prithwiraj [1 ]
Kaushar, Sahana [1 ]
Mahajan, Anmol [2 ]
Saha, Baidya Nath [3 ]
机构
[1] Cent Inst Technol CIT, Dept CSE, Kokrajhar, India
[2] Univ Alberta, Dept Comp Sci, Edmonton, AB, Canada
[3] Concordia Univ Edmonton CUE, Math & Comp Sci, Edmonton, AB, Canada
来源
2020 INTERNATIONAL CONFERENCE ON COMPUTATIONAL PERFORMANCE EVALUATION (COMPE-2020) | 2020年
关键词
Anaphora Resolution; Nepali Language; Machine learning; Natural Language Processing;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In this paper, we attempt a machine learning (ML) approach to Anaphora Resolution (AR) system in Nepali language. It is one of the pioneering approaches in anaphora resolution using machine learning in Nepali language, which is a resource-limited language. For this work, we have developed our own data set in the standard format available in this domain. Data has been tagged with the necessary information like Partsof-speech (POS), Named entity, Chunking information, Gender, Number, Person, etc. We divided the data for training and testing purposes in approximately 5:1 ratio and ML classifiers are used for training and testing. Results show encouraging for further progress.
引用
收藏
页码:436 / 441
页数:6
相关论文
共 42 条
[1]  
Adhikari H., 1993, SAMASAAMAYIK NEPALI
[2]  
[Anonymous], 2000, Anaphora: A Cross-Linguistic Study
[3]  
[Anonymous], 1998, Proceedings ofCOLING-ACL
[4]  
[Anonymous], 1983, Two-level morphology: A general computational model for word-form recognition and production
[5]  
[Anonymous], 2005, HLT '05
[6]  
Bagga Amit, 1998, PROC 1 LANGUAGE RESO, P563
[7]  
Bal B. K., 2009, P 7 WORKSHOP ASIAN L
[8]  
Bharati A., 2001, TRLTRC14
[9]  
Bhattarai GR., 2008, NEPAL LINGUIST, V23, P25
[10]  
Brennan S. E., 1987, 25th Annual Meeting of the Association for Computational Linguistics. Proceedings of the Conference, P155