Pseudo Relevance Feedback Technique and Semantic Similarity for Corpus-based Expansion

被引:0
|
作者
Mohd, Masnizah [1 ]
Atwan, Jaffar [2 ]
Shirai, Kiyoaki [1 ]
机构
[1] Japan Adv Inst Sci & Technol, 1-1 Asahidai, Nomi, Ishikawa 9231292, Japan
[2] Univ Kebangsaan Malaysia, Bangi 43600, Selangor, Malaysia
关键词
Query Expansion; Pseudo Relevance Feedback; Semantic; Information Retrieval; Arabic;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The adaptation of a Query Expansion (QE) approach for Arabic documents may produce the worst rankings or irrelevant results. Therefore, we have introduced a technique, which is to utilise the Arabic WordNet in the corpus and query expansion level. A Point-wise Mutual Information (PMI) corpus-based measure is used to semantically select synonyms from the WordNet. In addition, Automatic Query Expansion (AQE) and Pseudo Relevance Feedback (PRF) methods were also explored to improve the performance of the Arabic information retrieval (AIR) system. The experimental results of our proposed techniques for AIR shows that the use of Arabic WordNet in the corpus and query level together with AQE, and the adaptation of PMI in the expansion process have successfully reduced the level of ambiguity as these techniques select the most appropriate synonym. It enhanced knowledge discovery by taking care of the relevancy aspect. The techniques also demonstrated an improvement in Mean Average Precision by 49%, with an increase of 7.3% in recall in comparison to the baseline.
引用
收藏
页码:445 / 450
页数:6
相关论文
共 50 条
  • [21] Short Answer Grading Using String Similarity And Corpus-Based Similarity
    Gomaa, Wael H.
    Fahmy, Aly A.
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2012, 3 (11) : 115 - 121
  • [22] Query expansion using pseudo relevance feedback on wikipedia
    Andisheh Keikha
    Faezeh Ensan
    Ebrahim Bagheri
    Journal of Intelligent Information Systems, 2018, 50 : 455 - 478
  • [23] Query expansion using pseudo relevance feedback on wikipedia
    Keikha, Andisheh
    Ensan, Faezeh
    Bagheri, Ebrahim
    JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2018, 50 (03) : 455 - 478
  • [24] Pseudo-relevance feedback based query expansion using boosting algorithm
    Imran Rasheed
    Haider Banka
    Hamaid Mahmood Khan
    Artificial Intelligence Review, 2021, 54 : 6101 - 6124
  • [25] Corpus-based semantic role approach in information retrieval
    Moreda, Palorna
    Navarro, Borja
    Palomar, Manuel
    DATA & KNOWLEDGE ENGINEERING, 2007, 61 (03) : 467 - 483
  • [26] Pseudo-relevance feedback based query expansion using boosting algorithm
    Rasheed, Imran
    Banka, Haider
    Khan, Hamaid Mahmood
    ARTIFICIAL INTELLIGENCE REVIEW, 2021, 54 (08) : 6101 - 6124
  • [27] Corpus-based Semantic Relatedness for the Construction of Polish WordNet
    Broda, Bartosz
    Derwojedowa, Magdalena
    Piasecki, Maciej
    Szpakowicz, Stanislaw
    SIXTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, LREC 2008, 2008, : 1800 - 1807
  • [28] Semantic associations in business English: A corpus-based analysis
    Nelson, M
    ENGLISH FOR SPECIFIC PURPOSES, 2006, 25 (02) : 217 - 234
  • [29] CorSub: Corpus-Based Semantic Topic Matching for Subscriptions
    Piller, Tobias Christian
    Kestner, Christian
    Khelil, Abdelmajid
    PROCEEDINGS OF NINTH INTERNATIONAL CONGRESS ON INFORMATION AND COMMUNICATION TECHNOLOGY, VOL 2, ICICT 2024, 2024, 1012 : 495 - 506
  • [30] Strudel: A Corpus-Based Semantic Model Based on Properties and Types
    Baroni, Marco
    Murphy, Brian
    Barbu, Eduard
    Poesio, Massimo
    COGNITIVE SCIENCE, 2010, 34 (02) : 222 - 254