The long road from performing word sense disambiguation to successfully using it in information retrieval: An overview of the unsupervised approach

被引:7
|
作者
Hristea, Florentina [1 ]
Colhon, Mihaela [2 ]
机构
[1] Univ Bucharest, Comp Sci Dept, Bucharest, Romania
[2] Univ Craiova, Comp Sci Dept, Craiova, Romania
关键词
ambiguous query; information retrieval; Naive Bayes model; spectral clustering; unsupervised word sense disambiguation; word sense disambiguation; DISCRIMINATION; CONSTRUCTION; ALGORITHM; KNOWLEDGE;
D O I
10.1111/coin.12303
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The issue of whether or not word sense disambiguation (WSD) can improve information retrieval (IR) results has been intensely debated over the years, with many inconclusive or contradictory results and a majority of skeptical opinions. All three classes of WSD methods (supervised, unsupervised, and knowledge-based) have been considered by the literature with respect to IR. We hereby survey the unsupervised approach which, although relatively rarely used, has provided positive results at a large scale. Unsupervised WSD has already made proof of its utility in IR and it is our belief that it still holds a promise for this field. The two main existing types of unsupervised methods for IR, which are of completely different natures, are presented, within the scientific context in which they were born, and are compared. Regardless of the gap in time between these central approaches, we are of the opinion that the unsupervised solution to the discussed problem remains the most significant for IR applications. By surveying what we consider the most promising existing approach to usage of WSD in IR, and by discussing its possible extensions, we hope to stimulate continuation of this line of research, possibly at an even more successful level.
引用
收藏
页码:1026 / 1062
页数:37
相关论文
共 44 条
  • [21] Word sense discrimination in information retrieval: A spectral clustering-based approach
    Chifu, Adrian-Gabriel
    Hristea, Florentina
    Mothe, Josiane
    Popescu, Marius
    INFORMATION PROCESSING & MANAGEMENT, 2015, 51 (02) : 16 - 31
  • [22] Word Sense Disambiguation of Bangla Sentences Using Statistical Approach
    Nazah, Saiba
    Hoque, Mohammed Moshiul
    Hossain, Md. Rajib
    2017 3RD INTERNATIONAL CONFERENCE ON ELECTRICAL INFORMATION AND COMMUNICATION TECHNOLOGY (EICT 2017), 2017,
  • [23] Unsupervised similarity-based word sense disambiguation using context vectors and sentential word importance
    Abdalgader, Khaled
    Skabar, Andrew
    ACM Transactions on Speech and Language Processing, 2012, 9 (01):
  • [24] Improving the Accuracy of Document Similarity Approach using Word Sense Disambiguation
    Veena, G.
    Veni, Umesha Sree U. B.
    PROCEEDING OF THE THIRD INTERNATIONAL SYMPOSIUM ON WOMEN IN COMPUTING AND INFORMATICS (WCI-2015), 2015, : 196 - 202
  • [25] An Empirical Framework for Bangla Word Sense Disambiguation Using Statistical Approach
    Biswas, Monisha
    Sharif, Omar
    Hoque, Mohammed Moshiul
    MACHINE LEARNING AND BIG DATA ANALYTICS (PROCEEDINGS OF INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND BIG DATA ANALYTICS (ICMLBDA) 2021), 2022, 256 : 22 - 33
  • [26] Unsupervised word sense disambiguation and rules extraction using non-aligned bilingual corpus
    Oliveira, F
    Wong, F
    Li, YP
    Zheng, J
    Proceedings of the 2005 IEEE International Conference on Natural Language Processing and Knowledge Engineering (IEEE NLP-KE'05), 2005, : 30 - 35
  • [27] WSD-TIC: Word Sense Disambiguation Using Taxonomic Information Content
    Ben Aouicha, Mohamed
    Taieb, Mohamed Ali Hadj
    Ibn Marai, Hania
    COMPUTATIONAL COLLECTIVE INTELLIGENCE, ICCCI 2016, PT I, 2016, 9875 : 131 - 142
  • [28] Unsupervised word sense disambiguation for Korean through the acyclic weighted digraph using corpus and dictionary
    Yoon, Y
    Seon, CN
    Lee, S
    Seo, J
    INFORMATION PROCESSING & MANAGEMENT, 2006, 42 (03) : 710 - 722
  • [29] Adapted Lesk Algorithm based Word Sense Disambiguation using the Context Information
    Kumar, Manish
    Mukherjee, Prasenjit
    Hendre, Manik
    Godse, Manish
    Chakraborty, Baisakhi
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2020, 11 (03) : 254 - 260
  • [30] A Graph-based Approach to Word Sense Disambiguation. An Unsupervised Method Based on Semantic Relatedness
    Arab, Meysam
    Jahromi, Mansoor Zolghadri
    Fakhrahmad, Seyed Mostafa
    2016 24TH IRANIAN CONFERENCE ON ELECTRICAL ENGINEERING (ICEE), 2016, : 250 - 255