Pseudo Relevance Feedback Technique and Semantic Similarity for Corpus-based Expansion

被引:0
|
作者
Mohd, Masnizah [1 ]
Atwan, Jaffar [2 ]
Shirai, Kiyoaki [1 ]
机构
[1] Japan Adv Inst Sci & Technol, 1-1 Asahidai, Nomi, Ishikawa 9231292, Japan
[2] Univ Kebangsaan Malaysia, Bangi 43600, Selangor, Malaysia
关键词
Query Expansion; Pseudo Relevance Feedback; Semantic; Information Retrieval; Arabic;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The adaptation of a Query Expansion (QE) approach for Arabic documents may produce the worst rankings or irrelevant results. Therefore, we have introduced a technique, which is to utilise the Arabic WordNet in the corpus and query expansion level. A Point-wise Mutual Information (PMI) corpus-based measure is used to semantically select synonyms from the WordNet. In addition, Automatic Query Expansion (AQE) and Pseudo Relevance Feedback (PRF) methods were also explored to improve the performance of the Arabic information retrieval (AIR) system. The experimental results of our proposed techniques for AIR shows that the use of Arabic WordNet in the corpus and query level together with AQE, and the adaptation of PMI in the expansion process have successfully reduced the level of ambiguity as these techniques select the most appropriate synonym. It enhanced knowledge discovery by taking care of the relevancy aspect. The techniques also demonstrated an improvement in Mean Average Precision by 49%, with an increase of 7.3% in recall in comparison to the baseline.
引用
收藏
页码:445 / 450
页数:6
相关论文
共 50 条
  • [31] A Corpus-based View of Semantic Prosody in Business English
    Li Zeying
    2012 INTERNATIONAL CONFERENCE ON EDUCATION REFORM AND MANAGEMENT INNOVATION (ERMI 2012), VOL 5, 2013, : 293 - 298
  • [32] Behavioral profiles A corpus-based approach to cognitive semantic analysis
    Gries, Stefan Th.
    Divjak, Dagmar
    NEW DIRECTIONS IN COGNITIVE LINGUISTICS, 2009, 24 : 57 - 75
  • [33] Analysis on Semantic Prosody of 'mianzi' and 'lian': A Corpus-Based Study
    Gan, Yeechin
    CHINESE LEXICAL SEMANTICS (CLSW 2015), 2015, 9332 : 101 - 111
  • [34] Corpus-based approaches to semantic interpretation in natural language processing
    Ng, HT
    Zelle, J
    AI MAGAZINE, 1997, 18 (04) : 45 - 64
  • [35] Integration of semantic networks for corpus-based word sense disambiguation
    Moon, YJ
    Min, KH
    Hwang, YH
    Kim, P
    LOGIC PROGRAMMING, PROCEEDINGS, 2003, 2916 : 492 - 493
  • [36] Relevance Feedback and Deep Neural Network-Based Semantic Method for Query Expansion
    Shukla, Abhishek Kumar
    Das, Sujoy
    Kumar, Pushpendra
    Alam, Afroj
    WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2022, 2022
  • [37] QUERY EXPANSION BASED ON IMPLICIT FEEDBACK AND PSEUDO FEEDBACK
    Yang Feifei
    Gao Ling
    Han Luxia
    FOURTH INTERNATIONAL CONFERENCE ON COMPUTER AND ELECTRICAL ENGINEERING (ICCEE 2011), 2011, : 69 - +
  • [38] Personalized Document Summarization Using Pseudo Relevance Feedback and Semantic Feature
    Park, Sun
    Cha, Byung Rae
    Kwon, JangWoo
    IETE JOURNAL OF RESEARCH, 2012, 58 (02) : 155 - 165
  • [39] Sentence similarity based on semantic nets and corpus statistics
    Li, Yuhua
    McLean, David
    Bandar, Zuhair A.
    O'Shea, James D.
    Crockett, Keeley
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2006, 18 (08) : 1138 - 1150
  • [40] Interdisciplinary Corpus-based Approach for Exploring Multimodal Conversational Feedback
    Boudin, Auriane
    PROCEEDINGS OF THE 2022 INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION, ICMI 2022, 2022, : 705 - 710