Pseudo Relevance Feedback Technique and Semantic Similarity for Corpus-based Expansion

被引:0
|
作者
Mohd, Masnizah [1 ]
Atwan, Jaffar [2 ]
Shirai, Kiyoaki [1 ]
机构
[1] Japan Adv Inst Sci & Technol, 1-1 Asahidai, Nomi, Ishikawa 9231292, Japan
[2] Univ Kebangsaan Malaysia, Bangi 43600, Selangor, Malaysia
关键词
Query Expansion; Pseudo Relevance Feedback; Semantic; Information Retrieval; Arabic;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The adaptation of a Query Expansion (QE) approach for Arabic documents may produce the worst rankings or irrelevant results. Therefore, we have introduced a technique, which is to utilise the Arabic WordNet in the corpus and query expansion level. A Point-wise Mutual Information (PMI) corpus-based measure is used to semantically select synonyms from the WordNet. In addition, Automatic Query Expansion (AQE) and Pseudo Relevance Feedback (PRF) methods were also explored to improve the performance of the Arabic information retrieval (AIR) system. The experimental results of our proposed techniques for AIR shows that the use of Arabic WordNet in the corpus and query level together with AQE, and the adaptation of PMI in the expansion process have successfully reduced the level of ambiguity as these techniques select the most appropriate synonym. It enhanced knowledge discovery by taking care of the relevancy aspect. The techniques also demonstrated an improvement in Mean Average Precision by 49%, with an increase of 7.3% in recall in comparison to the baseline.
引用
收藏
页码:445 / 450
页数:6
相关论文
共 50 条
  • [1] Improving pseudo relevance feedback based query expansion using genetic fuzzy approach and semantic similarity notion
    Bhatnagar, Pragati
    Pareek, Narendra
    JOURNAL OF INFORMATION SCIENCE, 2014, 40 (04) : 523 - 537
  • [2] Semantic text similarity using corpus-based word similarity and string similarity
    University of Ottawa
    不详
    ACM Transactions on Knowledge Discovery from Data, 2008, 2 (02)
  • [3] Hybrid Query Expansion Model Based on Pseudo Relevance Feedback and Semantic Tree for Arabic IR
    Mazari, Ahmed Cherif
    Djeffal, Abdelhamid
    INTERNATIONAL JOURNAL OF INFORMATION RETRIEVAL RESEARCH, 2022, 12 (01)
  • [4] Relevance Feedback Based Query Expansion Model Using Borda Count and Semantic Similarity Approach
    Singh, Jagendra
    Sharan, Aditi
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2015, 2015
  • [5] Applications of corpus-based semantic similarity and word segmentation to database schema matching
    Aminul Islam
    Diana Inkpen
    Iluju Kiringa
    The VLDB Journal, 2008, 17 : 1293 - 1320
  • [6] Applications of corpus-based semantic similarity and word segmentation to database schema matching
    Islam, Aminul
    Inkpen, Diana
    Kiringa, Iluju
    VLDB JOURNAL, 2008, 17 (05): : 1293 - 1320
  • [7] Fuzzy LogicHybrid Model with Semantic Filtering Approach for Pseudo Relevance Feedback-based Query Expansion
    Singh, Jagendra
    Prasad, Mukesh
    Daraghmi, Yousef Awwad
    Tiwari, Prayag
    Yadav, Pranay
    Bharill, Neha
    Pratama, Mahardhika
    Saxena, Amit
    2017 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2017, : 1907 - 1913
  • [8] A Corpus-based Semantic Study of Possibly
    Wu, Guoliang
    Feng, Chuncan
    PROCEEDINGS OF 2011 INTERNATIONAL SYMPOSIUM ON COGNITIVE LINGUISTICS AND ENGLISH LEARNING, 2012, : 190 - 197
  • [9] A corpus-based relevance feedback approach to cross-language image retrieval
    Chang, Yih-Chen
    Lin, Wen-Cheng
    Chen, Hsin-Hsi
    ACCESSING MULTILINGUAL INFORMATION REPOSITORIES, 2006, 4022 : 592 - 601
  • [10] Similarity and Difference in Corpus-based Translation Studies
    Sara Laviosa
    外国语(上海外国语大学学报), 2007, (05) : 56 - 63