Word Sense Disambiguation using KeNet

被引:1
作者
Cetiner, Meltem [1 ,2 ]
Yildirim, Ahmet [1 ]
Onay, Bahadir [1 ]
Oksuz, Cuneyt [1 ]
机构
[1] Idea Teknol Cozumleri, Maslak Mah Sanatkarlar Sok 5-8, TR-34398 Sariyer, Turkey
[2] Gebze Tekn Univ, Bilgisayar Muhendisligi Bolumu, Kocaeli, Turkey
来源
29TH IEEE CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS (SIU 2021) | 2021年
关键词
Word Sense Disambiguation; WordNet; KeNet; Word Embedding Vector; BERT; Ranking Based Precision;
D O I
10.1109/SIU53274.2021.9477816
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The highly studied Natural Language Processing (NLP) problem Word Sense Disambiguation (WSD) is the process of removing the ambiguities of multiple-sense words that have the same morphological structure. The first step of WSD is to list the probable meanings of the word.The next step is identify the meaning which is used in the context within the sentence. A Turkish WordNet called KeNet is used to list the word-senses. The elimination of the ambiguity was done with BERT word embeddings by comparing with the meaning via cosine similarity. The impact of the system is evaluated by both with and without adding it to a search engine of Turkish news. Each news of topmost 10 news returned for each query is manually labeled as related or not related.Results of the labeling, relatedness of the first n documents, and ordered biased precision metrics are evaluated. Positive increment on the results is shown when the WSD modul is added on the system.
引用
收藏
页数:4
相关论文
共 19 条
[1]  
Akin A. A., 2007, Structure, V10, P1
[2]  
[Anonymous], 2002, TORCH MODULAR MACHIN
[3]  
[Anonymous], Apache Solr
[4]  
Bakay O., 2021, P 11 GLOB WORDN C, P166
[5]  
Basile P., 2014, An enhanced lesk word sense disambiguation algorithm through a distributional semantic model, P1591
[6]  
Baziotis C, 2017, P 11 INT WORKSH SEM, P747, DOI DOI 10.18653/V1/S17-2126
[7]  
Decadt B., 2004, Proceedings of the Third International Workshop on the Evaluation of Systems for the Semantic Analysis of Text (Senseval-3), P108
[8]  
Derwojedowa M., 2008, P GLOB WORDNET C SEG, P162
[9]  
Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
[10]  
Ide Nancy., 1998, Computational linguistics