Word sense discrimination in information retrieval: A spectral clustering-based approach

被引:20
|
作者
Chifu, Adrian-Gabriel [1 ]
Hristea, Florentina [2 ]
Mothe, Josiane [3 ]
Popescu, Marius [2 ]
机构
[1] Univ Toulouse 3, Univ Toulouse, CNRS, IRIT UMR5505, F-31062 Toulouse 9, France
[2] Univ Bucharest, Fac Math & Comp Sci, Dept Comp Sci, RO-010014 Bucharest, Romania
[3] Univ Toulouse, Ecole Super Professorat & Educ, CNRS, IRIT UMR5505, F-31062 Toulouse 9, France
关键词
Information retrieval; Word sense disambiguation; Word sense discrimination; Spectral clustering; High precision;
D O I
10.1016/j.ipm.2014.10.007
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Word sense ambiguity has been identified as a cause of poor precision in information retrieval (IR) systems. Word sense disambiguation and discrimination methods have been defined to help systems choose which documents should be retrieved in relation to an ambiguous query. However, the only approaches that show a genuine benefit for word sense discrimination or disambiguation in IR are generally supervised ones. In this paper we propose a new unsupervised method that uses word sense discrimination in IR. The method we develop is based on spectral clustering and reorders an initially retrieved document list by boosting documents that are semantically similar to the target query. For several TREC ad hoc collections we show that our method is useful in the case of queries which contain ambiguous terms. We are interested in improving the level of precision after 5, 10 and 30 retrieved documents (P@5, P@10, P@30) respectively. We show that precision can be improved by 8% above current state-of-the-art baselines. We also focus on poor performing queries. (C) 2014 Elsevier Ltd. All rights reserved.
引用
收藏
页码:16 / 31
页数:16
相关论文
共 50 条
  • [21] A spectral clustering-based framework for detecting community structures in complex networks
    Jiang, Jeffrey Q.
    Dress, Andreas W. M.
    Yang, Genke
    APPLIED MATHEMATICS LETTERS, 2009, 22 (09) : 1479 - 1482
  • [22] A new approach to clustering records in information retrieval systems
    Moghrabi, IAR
    Makholian, RA
    INFORMATION RETRIEVAL, 2000, 3 (02): : 105 - 126
  • [23] A Novel Information Retrieval Approach using Query Expansion and Spectral-based
    Alnofaie, Sara
    Dahab, Mohammed
    Kamal, Mahmoud
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2016, 7 (09) : 364 - 373
  • [24] Semantic Clustering Approach Based Multi-agent System for Information Retrieval on Web
    Alsulami, Bassma S.
    Abulkhair, Maysoon F.
    Essa, Fathy A.
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2012, 12 (01): : 41 - 46
  • [25] The Hyperspectral Image Clustering Based on Spatial Information and Spectral Clustering
    Wei, Yiwei
    Niu, Chao
    Wang, Hongxia
    Liu, Daizhi
    2019 IEEE 4TH INTERNATIONAL CONFERENCE ON SIGNAL AND IMAGE PROCESSING (ICSIP 2019), 2019, : 127 - 131
  • [26] A spectral clustering-based optimal deployment method for scientific application in cloud computing
    Fan, Pei
    Wang, Ji
    Chen, Zhenbang
    Zheng, Zibin
    Lyu, Michael R.
    INTERNATIONAL JOURNAL OF WEB AND GRID SERVICES, 2012, 8 (01) : 31 - 55
  • [27] Spectral Clustering-based Matrix Completion Method for Top-n Recommendation
    Zhou, Qingmei
    Chen, Xin
    Zhang, Jiuya
    ICCDE 2019: PROCEEDINGS OF THE 2019 5TH INTERNATIONAL CONFERENCE ON COMPUTING AND DATA ENGINEERING, 2019, : 1 - 6
  • [28] Query expansion based on clustering and personalized information retrieval
    Hamid Khalifi
    Walid Cherif
    Abderrahim El Qadi
    Youssef Ghanou
    Progress in Artificial Intelligence, 2019, 8 : 241 - 251
  • [29] Query expansion based on clustering and personalized information retrieval
    Khalifi, Hamid
    Cherif, Walid
    El Qadi, Abderrahim
    Ghanou, Youssef
    PROGRESS IN ARTIFICIAL INTELLIGENCE, 2019, 8 (02) : 241 - 251
  • [30] Managing word mismatch problems in information retrieval: A topic-based query expansion approach
    Wei, Chih-Ping
    Hu, Paul Jen-Hwa
    Tai, Chia-Hung
    Huang, Chun-Neng
    Yang, Chin-Sheng
    JOURNAL OF MANAGEMENT INFORMATION SYSTEMS, 2007, 24 (03) : 269 - 295