The long road from performing word sense disambiguation to successfully using it in information retrieval: An overview of the unsupervised approach

被引:7
|
作者
Hristea, Florentina [1 ]
Colhon, Mihaela [2 ]
机构
[1] Univ Bucharest, Comp Sci Dept, Bucharest, Romania
[2] Univ Craiova, Comp Sci Dept, Craiova, Romania
关键词
ambiguous query; information retrieval; Naive Bayes model; spectral clustering; unsupervised word sense disambiguation; word sense disambiguation; DISCRIMINATION; CONSTRUCTION; ALGORITHM; KNOWLEDGE;
D O I
10.1111/coin.12303
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The issue of whether or not word sense disambiguation (WSD) can improve information retrieval (IR) results has been intensely debated over the years, with many inconclusive or contradictory results and a majority of skeptical opinions. All three classes of WSD methods (supervised, unsupervised, and knowledge-based) have been considered by the literature with respect to IR. We hereby survey the unsupervised approach which, although relatively rarely used, has provided positive results at a large scale. Unsupervised WSD has already made proof of its utility in IR and it is our belief that it still holds a promise for this field. The two main existing types of unsupervised methods for IR, which are of completely different natures, are presented, within the scientific context in which they were born, and are compared. Regardless of the gap in time between these central approaches, we are of the opinion that the unsupervised solution to the discussed problem remains the most significant for IR applications. By surveying what we consider the most promising existing approach to usage of WSD in IR, and by discussing its possible extensions, we hope to stimulate continuation of this line of research, possibly at an even more successful level.
引用
收藏
页码:1026 / 1062
页数:37
相关论文
共 44 条
  • [31] Word-Sense Disambiguation of Korean Predicates using Sejong Electronic Dictionary and Unsupervised learning
    Kang, Sangwook
    Oh, Yeontaek
    Kim, Minho
    Kwon, Hyuk-chul
    CIT/IUCC/DASC/PICOM 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION TECHNOLOGY - UBIQUITOUS COMPUTING AND COMMUNICATIONS - DEPENDABLE, AUTONOMIC AND SECURE COMPUTING - PERVASIVE INTELLIGENCE AND COMPUTING, 2015, : 257 - 261
  • [32] A novel approach to word sense disambiguation in Bengali language using supervised methodology
    Alok Ranjan Pal
    Diganta Saha
    Niladri Sekhar Dash
    Sudip Kumar Naskar
    Antara Pal
    Sādhanā, 2019, 44
  • [33] Word Sense Disambiguation in Bengali: a Knowledge based Approach using Bengali WordNet
    Pal, Alok Ranjan
    Saha, Diganta
    Naskar, Sudip Kumar
    PROCEEDINGS OF THE 2017 IEEE SECOND INTERNATIONAL CONFERENCE ON ELECTRICAL, COMPUTER AND COMMUNICATION TECHNOLOGIES (ICECCT), 2017,
  • [34] Word Sense Disambiguation Using Swarm Intelligence: A Bee Colony Optimization Approach
    Kumar, Saket
    El Ariss, Omar
    COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, (CICLING 2016), PT I, 2018, 9623 : 479 - 495
  • [35] A novel approach to word sense disambiguation in Bengali language using supervised methodology
    Pal, Alok Ranjan
    Saha, Diganta
    Dash, Niladri Sekhar
    Naskar, Sudip Kumar
    Pal, Antara
    SADHANA-ACADEMY PROCEEDINGS IN ENGINEERING SCIENCES, 2019, 44 (08):
  • [36] A Supervised Approach on Gurmukhi Word Sense Disambiguation Using k-NN Method
    Walla, Himdweep
    Rana, Ajay
    Kansal, Vineet
    PROCEEDINGS OF THE 8TH INTERNATIONAL CONFERENCE CONFLUENCE 2018 ON CLOUD COMPUTING, DATA SCIENCE AND ENGINEERING, 2018, : 743 - 746
  • [37] PosWSD: Low-Resource Word Sense Disambiguation Model using Part Of Speech Information
    Chen, Yazhen
    Zhang, Jian
    He, Qipeng
    2022 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP 2022), 2022, : 26 - 31
  • [38] A Memory Based Approach to Word Sense Disambiguation in Bengali Using k-NN Method
    Pandit, Rajat
    Naskar, Sudip Kumar
    2015 IEEE 2ND INTERNATIONAL CONFERENCE ON RECENT TRENDS IN INFORMATION SYSTEMS (RETIS), 2015, : 383 - 386
  • [39] Learning Taxonomical Relations from Domain Texts Using WordNet and Word Sense Disambiguation
    Punuru, Janardhana
    Chen, Jianhua
    2012 IEEE INTERNATIONAL CONFERENCE ON GRANULAR COMPUTING (GRC 2012), 2012, : 382 - 387
  • [40] Word Sense Disambiguation: A Graph-Based Approach Using N-Cliques Partitioning Technique
    Gutierrez, Yoan
    Vazquez, Sonia
    Montoyo, Andres
    NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS, 2011, 6716 : 112 - 124