Pseudo-Relevance Feedback Based on Locally-Built Co-occurrence Graphs

被引:1
作者
Aklouche, Billel [1 ,2 ,4 ]
Bounhas, Ibrahim [1 ,4 ]
Slimani, Yahya [1 ,3 ,4 ]
机构
[1] Carthage Univ, INSAT, LISI Lab Comp Sci Ind Syst, Tunis, Tunisia
[2] La Manouba Univ, Natl Sch Comp Sci ENSI, Manouba, Tunisia
[3] La Manouba Univ, Higher Inst Multimedia Arts Manouba ISAMM, Manouba, Tunisia
[4] JARIR Joint Grp Artificial Reasoning & Informat R, Manouba, Tunisia
来源
ADVANCES IN DATABASES AND INFORMATION SYSTEMS, ADBIS 2019 | 2019年 / 11695卷
关键词
Query expansion; Pseudo-relevance feedback; Term co-occurrence graph; BM25; Context window; Term's discriminative power; TERM PROXIMITY; INFORMATION-RETRIEVAL; DOCUMENT-RETRIEVAL;
D O I
10.1007/978-3-030-28730-6_7
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In Information Retrieval (IR), user queries are often too short, making the selection of relevant documents hard. Pseudo-relevance feedback (PRF) is an effective method to automatically expand the query with new terms using a set of pseudo-relevant documents. However, a main issue in PRF is the selection of good expansion terms that allow improving retrieval effectiveness. In this paper, we present a new PRF method based on locally-built term co-occurrence graphs. We use a context window-based approach to construct our term co-occurrence graphs over top pseudo-relevant documents. For expansion terms selection, we propose an adapted version of the BM25 model, which allows to measure term-term similarity in co-occurrence graphs. This measure has the advantage of selecting discriminant expansion terms that are semantically related to the query as a whole. We evaluate our PRF method using four TREC collections, including the standard TREC Robust04 collection and the newest TREC Washington Post collection. Experimental results show that our proposal outperforms competitive state-of-the-art baselines and achieves significant improvements.
引用
收藏
页码:105 / 119
页数:15
相关论文
共 33 条
[1]  
Aklouche B., 2018, P 27 TEXT RETR C TRE
[2]  
Amati G., 2003, P 12 TEXT RETRIEVAL
[3]  
[Anonymous], 2008, Introduction to information retrieval
[4]  
[Anonymous], 2013, P 2013 C THEOR INF R
[5]   Improving Retrieval Performance for Verbose Queries via Axiomatic Analysis of Term Discrimination Heuristic [J].
Ariannezhad, Mozhdeh ;
Montazeralghaem, Ali ;
Zamani, Hamed ;
Shakery, Azadeh .
SIGIR'17: PROCEEDINGS OF THE 40TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2017, :1201-1204
[6]  
Buckley C., 1994, P 3 TEXT RETRIEVAL C
[7]   A Survey of Automatic Query Expansion in Information Retrieval [J].
Carpineto, Claudio ;
Romano, Giovanni .
ACM COMPUTING SURVEYS, 2012, 44 (01)
[8]   USING PROBABILISTIC MODELS OF DOCUMENT-RETRIEVAL WITHOUT RELEVANCE INFORMATION [J].
CROFT, WB ;
HARPER, DJ .
JOURNAL OF DOCUMENTATION, 1979, 35 (04) :285-295
[9]  
Diaz Fernando, 2016, ACL 1
[10]  
Fang Hui., 2009, Encyclopedia of Database Systems, P3493, DOI DOI 10.1007/978-0-387-39940-9