Word sense discrimination in information retrieval: A spectral clustering-based approach

被引:20
|
作者
Chifu, Adrian-Gabriel [1 ]
Hristea, Florentina [2 ]
Mothe, Josiane [3 ]
Popescu, Marius [2 ]
机构
[1] Univ Toulouse 3, Univ Toulouse, CNRS, IRIT UMR5505, F-31062 Toulouse 9, France
[2] Univ Bucharest, Fac Math & Comp Sci, Dept Comp Sci, RO-010014 Bucharest, Romania
[3] Univ Toulouse, Ecole Super Professorat & Educ, CNRS, IRIT UMR5505, F-31062 Toulouse 9, France
关键词
Information retrieval; Word sense disambiguation; Word sense discrimination; Spectral clustering; High precision;
D O I
10.1016/j.ipm.2014.10.007
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Word sense ambiguity has been identified as a cause of poor precision in information retrieval (IR) systems. Word sense disambiguation and discrimination methods have been defined to help systems choose which documents should be retrieved in relation to an ambiguous query. However, the only approaches that show a genuine benefit for word sense discrimination or disambiguation in IR are generally supervised ones. In this paper we propose a new unsupervised method that uses word sense discrimination in IR. The method we develop is based on spectral clustering and reorders an initially retrieved document list by boosting documents that are semantically similar to the target query. For several TREC ad hoc collections we show that our method is useful in the case of queries which contain ambiguous terms. We are interested in improving the level of precision after 5, 10 and 30 retrieved documents (P@5, P@10, P@30) respectively. We show that precision can be improved by 8% above current state-of-the-art baselines. We also focus on poor performing queries. (C) 2014 Elsevier Ltd. All rights reserved.
引用
收藏
页码:16 / 31
页数:16
相关论文
共 50 条
  • [31] Clustering Algorithms for Query Expansion Based Information Retrieval
    Khennak, Ilyes
    Drias, Habiba
    Kechid, Amine
    Moulai, Hadjer
    COMPUTATIONAL COLLECTIVE INTELLIGENCE, PT II, 2019, 11684 : 261 - 272
  • [32] Enhancing Clustering Stability in VANET: A Spectral Clustering Based Approach
    Gang Liu
    Nan Qi
    Jiaxin Chen
    Chao Dong
    Zanqi Huang
    中国通信, 2020, 17 (04) : 140 - 151
  • [33] Spectral Clustering-based Pinning Control Strategy for Vehicle Platoons in Connected and Automated Environments
    Wang, Can
    Zhao, Yan
    Li, Lin-Heng
    Qu, Xu
    Ran, Bin
    Zhongguo Gonglu Xuebao/China Journal of Highway and Transport, 2024, 37 (11): : 235 - 248
  • [34] A Gloss Composition and Context Clustering Based Distributed Word Sense Representation Model
    Chen, Tao
    Xu, Ruifeng
    He, Yulan
    Wang, Xuan
    ENTROPY, 2015, 17 (09) : 6007 - 6024
  • [35] A spectral clustering algorithm based on intuitionistic fuzzy information
    Xu, Dawei
    Xu, Zeshui
    Liu, Shousheng
    Zhao, Hua
    KNOWLEDGE-BASED SYSTEMS, 2013, 53 : 20 - 26
  • [36] A spectral clustering based ensemble pruning approach
    Zhang, Huaxiang
    Cao, Linlin
    NEUROCOMPUTING, 2014, 139 : 289 - 297
  • [37] An Adaptive Page Clustering Based weighting Method for Information Retrieval
    Lin, Yi-Xian
    Kao, Hung-Yu
    2013 CONFERENCE ON TECHNOLOGIES AND APPLICATIONS OF ARTIFICIAL INTELLIGENCE (TAAI), 2013, : 199 - 204
  • [38] A New Approach to Word Sense Disambiguation Based on Context Similarity
    Nameh, M.
    Fakhrahmad, S. M.
    Jahromi, M. Zolghadri
    WORLD CONGRESS ON ENGINEERING, WCE 2011, VOL I, 2011, : 456 - 459
  • [39] A Biomedical Information Retrieval System based on Clustering for Mobile Devices
    Millan, Manuel
    Munoz, Alejandro
    de la Villa, Manuel
    Mana, Manuel J.
    PROCESAMIENTO DEL LENGUAJE NATURAL, 2010, (45): : 255 - 258
  • [40] Clustering of Relevant Documents Based on Findability Effort in Information Retrieval
    Rajagopal, Prabha
    Aghris, Taoufik
    Fettah, Fatima-Ezzahra
    Ravana, Sri Devi
    INTERNATIONAL JOURNAL OF INFORMATION RETRIEVAL RESEARCH, 2022, 12 (01)