Models and Strategies for Russian Word Sense Disambiguation: A Comparative Analysis

被引:0
|
作者
Aleksandrova, Anastasiia [1 ]
Nivre, Joakim [1 ,2 ]
机构
[1] Uppsala Univ, Uppsala, Sweden
[2] RISE Res Inst Sweden, Stockholm, Sweden
来源
TEXT, SPEECH, AND DIALOGUE, TSD 2024, PT I | 2024年 / 15048卷
关键词
word sense disambiguation; BERT; Russian;
D O I
10.1007/978-3-031-70563-2_21
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Word sense disambiguation (WSD) is a core task in computational linguistics that involves interpreting polysemous words in context by identifying senses from a predefined sense inventory. Despite the dominance of BERT and its derivatives in WSD evaluation benchmarks, their effectiveness in encoding and retrieving word senses, especially in languages other than English, remains relatively unexplored. This paper provides a detailed quantitative analysis, comparing various BERT-based models for Russian, and examines two primary WSD strategies: fine-tuning and feature-based nearest-neighbor classification. The best results are obtained with the ruBERT model coupled with the feature-based nearest neighbor strategy. This approach adeptly captures even fine-grained meanings with limited data and diverse sense distributions.
引用
收藏
页码:267 / 278
页数:12
相关论文
共 50 条
  • [31] A word sense disambiguation corpus for Urdu
    Saeed, Ali
    Nawab, Rao Muhammad Adeel
    Stevenson, Mark
    Rayson, Paul
    LANGUAGE RESOURCES AND EVALUATION, 2019, 53 (03) : 397 - 418
  • [32] Word Sense Disambiguation by Semantic Inference
    Wang, Xinda
    Tang, Xuri
    Qu, Weiguang
    Gu, Min
    PROCEEDINGS OF 4TH INTERNATIONAL CONFERENCE ON BEHAVIORAL, ECONOMIC ADVANCE IN BEHAVIORAL, ECONOMIC, SOCIOCULTURAL COMPUTING (BESC), 2017,
  • [33] Minimal Semantics and Word Sense Disambiguation
    Gasparri, Luca
    DISPUTATIO-INTERNATIONAL JOURNAL OF PHILOSOPHY, 2014, 6 (39): : 147 - 171
  • [34] Arabic Word Sense Disambiguation - Survey
    Alian, Marwah
    Awajan, Arafat
    Al-Kouz, Akram
    2017 INTERNATIONAL CONFERENCE ON NEW TRENDS IN COMPUTING SCIENCES (ICTCS), 2017, : 236 - 240
  • [35] An improved algorithm on word sense disambiguation
    Serban, G
    Tatar, D
    INTELLIGENT INFORMATION PROCESSING AND WEB MINING, 2003, : 199 - 208
  • [36] Feature expansion for word sense disambiguation
    Tsao, NL
    Wible, D
    Kuo, CH
    2003 INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING, PROCEEDINGS, 2003, : 126 - 131
  • [37] Determining the difficulty of Word Sense Disambiguation
    McInnes, Bridget T.
    Stevenson, Mark
    JOURNAL OF BIOMEDICAL INFORMATICS, 2014, 47 : 83 - 90
  • [38] A word sense disambiguation corpus for Urdu
    Ali Saeed
    Rao Muhammad Adeel Nawab
    Mark Stevenson
    Paul Rayson
    Language Resources and Evaluation, 2019, 53 : 397 - 418
  • [39] Word Sense Disambiguation in Nepali Language
    Dhungana, Udaya Raj
    Shakya, Subarna
    2014 FOURTH INTERNATIONAL CONFERENCE ON DIGITAL INFORMATION AND COMMUNICATION TECHNOLOGY AND IT'S APPLICATIONS (DICTAP), 2014, : 46 - 50
  • [40] Arabic word sense disambiguation: a review
    Bilel Elayeb
    Artificial Intelligence Review, 2019, 52 : 2475 - 2532