The long road from performing word sense disambiguation to successfully using it in information retrieval: An overview of the unsupervised approach

被引:7
|
作者
Hristea, Florentina [1 ]
Colhon, Mihaela [2 ]
机构
[1] Univ Bucharest, Comp Sci Dept, Bucharest, Romania
[2] Univ Craiova, Comp Sci Dept, Craiova, Romania
关键词
ambiguous query; information retrieval; Naive Bayes model; spectral clustering; unsupervised word sense disambiguation; word sense disambiguation; DISCRIMINATION; CONSTRUCTION; ALGORITHM; KNOWLEDGE;
D O I
10.1111/coin.12303
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The issue of whether or not word sense disambiguation (WSD) can improve information retrieval (IR) results has been intensely debated over the years, with many inconclusive or contradictory results and a majority of skeptical opinions. All three classes of WSD methods (supervised, unsupervised, and knowledge-based) have been considered by the literature with respect to IR. We hereby survey the unsupervised approach which, although relatively rarely used, has provided positive results at a large scale. Unsupervised WSD has already made proof of its utility in IR and it is our belief that it still holds a promise for this field. The two main existing types of unsupervised methods for IR, which are of completely different natures, are presented, within the scientific context in which they were born, and are compared. Regardless of the gap in time between these central approaches, we are of the opinion that the unsupervised solution to the discussed problem remains the most significant for IR applications. By surveying what we consider the most promising existing approach to usage of WSD in IR, and by discussing its possible extensions, we hope to stimulate continuation of this line of research, possibly at an even more successful level.
引用
收藏
页码:1026 / 1062
页数:37
相关论文
共 44 条
  • [1] Arabic Word Sense Disambiguation for Information Retrieval
    Abderrahim, Mohammed Alaeddine
    Abderrahim, Mohammed El-Amine
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2022, 21 (04)
  • [2] Word Sense Disambiguation in Bengali: an Unsupervised Approach
    Pal, Alok Ranjan
    Saha, Diganta
    PROCEEDINGS OF THE 2017 IEEE SECOND INTERNATIONAL CONFERENCE ON ELECTRICAL, COMPUTER AND COMMUNICATION TECHNOLOGIES (ICECCT), 2017,
  • [3] Unsupervised Approach to Word Sense Disambiguation in Malayalam
    Sankar, Sruthi K. P.
    Raj, P. C. Reghu
    Jayan, V
    INTERNATIONAL CONFERENCE ON EMERGING TRENDS IN ENGINEERING, SCIENCE AND TECHNOLOGY (ICETEST - 2015), 2016, 24 : 1507 - 1513
  • [4] Unsupervised Word Sense Disambiguation Using The WWW
    Klapaftis, Ioannis P.
    Manandhar, Suresh
    STAIRS 2006, 2006, 142 : 174 - 183
  • [5] Word Sense Disambiguation based on IDF applied to Information Retrieval
    Perea-Ortega, Jose M.
    Martinez-Santiago, Fernando
    Garcia-Cumbreras, Miguel A.
    Montejo-Raez, Arturo
    PROCESAMIENTO DEL LENGUAJE NATURAL, 2011, (46): : 99 - 106
  • [6] An Intelligent Information Retrieval System Using Automatic Word Sense Disambiguation
    Ramasubramanian, Prasanna G.
    Agah, Arvin
    Gauch, Susan E.
    JOURNAL OF INTELLIGENT SYSTEMS, 2007, 16 (02) : 135 - 166
  • [7] Unsupervised Korean Word Sense Disambiguation using CoreNet
    Han, Kijong
    Nam, Sangha
    Kim, Jiseong
    Hahm, Younggyun
    Choi, Key-Sun
    PROCEEDINGS OF THE ELEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2018), 2018, : 1023 - 1026
  • [8] A clustering-based Approach for Unsupervised Word Sense Disambiguation
    Martin-Wanton, Tamara
    Berlanga-Llavori, Rafael
    PROCESAMIENTO DEL LENGUAJE NATURAL, 2012, (49): : 49 - 56
  • [9] Word sense disambiguation using implicit information
    Jain, Goonjan
    Lobiyal, D. K.
    NATURAL LANGUAGE ENGINEERING, 2020, 26 (04) : 413 - 432
  • [10] Performing word sense disambiguation at the border between unsupervised and knowledge-based techniques
    Hristea, Florentina
    Popescu, Marius
    Dumitrescu, Monica
    ARTIFICIAL INTELLIGENCE REVIEW, 2008, 30 (1-4) : 67 - 86