Towards Word Sense Disambiguation for Latvian

被引:1
作者
Paikens, Peteris [1 ]
Rituma, Laura [1 ]
Pretkalnina, Lauma [1 ]
机构
[1] Univ Latvia, Inst Math & Comp Sci, Riga, Latvia
来源
BALTIC JOURNAL OF MODERN COMPUTING | 2022年 / 10卷 / 03期
关键词
word sense disambiguation; Latvian; semantics;
D O I
10.22364/bjmc.2022.10.3.13
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
The goal of this paper is to describe the current situation on word sense disambigua-tion for Latvian, reviewing the available data and potential problems, and describing the explo-ration of word sense disambiguation methods using BERT contextual embeddings in order to apply them to Latvian language. Training is performed on a recently developed dataset of sense example sentences. The experiments of this paper demonstrate the feasibility of the approach by applying a mixture-of-experts approach of word sense disambiguation to the data, developing the first proof of concept WSD system for Latvian using state of art approaches. An evaluation of the WSD solution was performed on a selection of 18 highly ambiguous words, demonstrating reasonable performance.
引用
收藏
页码:402 / 408
页数:7
相关论文
共 21 条
[1]  
Barba E, 2021, 2021 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL-HLT 2021), P4661
[2]  
Barzdins G., 2007, P 3 BALTIC C HUMAN L, P33
[3]  
Bender Emily M, 2011, Linguistic Issues in Language Technology, V6
[4]  
Bevilacqua M, 2020, 58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), P2854
[5]  
Blevins T, 2020, 58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), P1006
[6]  
Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
[7]  
Hadiwinoto C, 2019, Arxiv, DOI arXiv:1910.00194
[8]  
Huang LY, 2020, Arxiv, DOI arXiv:1908.07245
[9]  
Iacobacci I, 2016, PROCEEDINGS OF THE 54TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1, P897
[10]  
Kumar S, 2019, 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), P5670