Pseudo Relevance Feedback Technique and Semantic Similarity for Corpus-based Expansion

被引:0
|
作者
Mohd, Masnizah [1 ]
Atwan, Jaffar [2 ]
Shirai, Kiyoaki [1 ]
机构
[1] Japan Adv Inst Sci & Technol, 1-1 Asahidai, Nomi, Ishikawa 9231292, Japan
[2] Univ Kebangsaan Malaysia, Bangi 43600, Selangor, Malaysia
关键词
Query Expansion; Pseudo Relevance Feedback; Semantic; Information Retrieval; Arabic;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The adaptation of a Query Expansion (QE) approach for Arabic documents may produce the worst rankings or irrelevant results. Therefore, we have introduced a technique, which is to utilise the Arabic WordNet in the corpus and query expansion level. A Point-wise Mutual Information (PMI) corpus-based measure is used to semantically select synonyms from the WordNet. In addition, Automatic Query Expansion (AQE) and Pseudo Relevance Feedback (PRF) methods were also explored to improve the performance of the Arabic information retrieval (AIR) system. The experimental results of our proposed techniques for AIR shows that the use of Arabic WordNet in the corpus and query level together with AQE, and the adaptation of PMI in the expansion process have successfully reduced the level of ambiguity as these techniques select the most appropriate synonym. It enhanced knowledge discovery by taking care of the relevancy aspect. The techniques also demonstrated an improvement in Mean Average Precision by 49%, with an increase of 7.3% in recall in comparison to the baseline.
引用
收藏
页码:445 / 450
页数:6
相关论文
共 50 条
  • [41] CORPUS-BASED SYNTACTIC-SEMANTIC GRAPH ANALYSIS: SEMANTIC DOMAINS OF THE CONCEPT FEELING
    Perak, Benedikt
    Kirigin, Tajana Ban
    RASPRAVE, 2020, 46 (02): : 957 - 996
  • [42] Corpus-based query expansion in Online public access catalogs
    Komarjaya, J
    Poo, DCC
    Kan, MY
    RESEARCH AND ADVANCED TECHNOLOGY FOR DIGITAL LIBRARIES, 2004, 3232 : 221 - 231
  • [43] Corpus-based Set Expansion with Lexical Features and Distributed Representations
    Yu, Puxuan
    Huang, Zhiqi
    Rahimi, Razieh
    Allan, James
    PROCEEDINGS OF THE 42ND INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '19), 2019, : 1153 - 1156
  • [44] Cross-lingual pseudo-relevance feedback using a comparable corpus
    Rogati, M
    Yang, YM
    EVLAUATION OF CROSS-LANGUAGE INFORMATION RETRIEVAL SYSTEMS, 2002, 2406 : 151 - 157
  • [45] Short Tamil Sentence Similarity Calculation using Knowledge-Based and Corpus-Based Similarity Measures
    Selvarasa, Anutharsha
    Thirunavukkarasu, Nilasini
    Rajendran, Niveathika
    Yogalingam, Chinthoorie
    Ranathunga, Surangika
    Dias, Gihan
    2017 3RD INTERNATIONAL MORATUWA ENGINEERING RESEARCH CONFERENCE (MERCON), 2017, : 443 - 448
  • [46] Improvement on Corpus-Based Word Similarity Using Vector Space Models
    Esin, Yunus Emre
    Alan, Oezguer
    Alpaslan, Ferda Nur
    2009 24TH INTERNATIONAL SYMPOSIUM ON COMPUTER AND INFORMATION SCIENCES, 2009, : 279 - 284
  • [47] A Novel Fuzzy Logic Model for Pseudo-Relevance Feedback-Based Query Expansion
    Singh, Jagendra
    Prasad, Mukesh
    Prasad, Om Kumar
    Joo, Er Meng
    Saxena, Amit Kumar
    Lin, Chin-Teng
    INTERNATIONAL JOURNAL OF FUZZY SYSTEMS, 2016, 18 (06) : 980 - 989
  • [48] A Corpus-Based Study on Pseudo-ditransitive Verbs in Mandarin Chinese
    Kuo, Pei-Jung
    CHINESE LEXICAL SEMANTICS, CLSW 2016, 2016, 10085 : 241 - 248
  • [49] A Novel Fuzzy Logic Model for Pseudo-Relevance Feedback-Based Query Expansion
    Jagendra Singh
    Mukesh Prasad
    Om Kumar Prasad
    Er Meng Joo
    Amit Kumar Saxena
    Chin-Teng Lin
    International Journal of Fuzzy Systems, 2016, 18 : 980 - 989
  • [50] Enhancing passage retrieval in log files by query expansion based on explicit and pseudo relevance feedback
    Saneifar, Hassan
    Bonniol, Stephane
    Poncelet, Pascal
    Roche, Mathieu
    COMPUTERS IN INDUSTRY, 2014, 65 (06) : 937 - 951