Integrating Semantic Term Relations into Information Retrieval Systems Based on Language Models

被引:0
作者
ALMasri, Mohannad [1 ]
Tan, KianLam [1 ]
Berrut, Catherine [1 ]
Chevallet, Jean-Pierre [2 ]
Mulhem, Philippe [3 ]
机构
[1] Univ Grenoble 1, Grenoble, France
[2] Univ Grenoble 2, F-38040 Grenoble, France
[3] CNRS, MRIM Grp, LIG Lab, Grenoble, France
来源
INFORMATION RETRIEVAL TECHNOLOGY, AIRS 2014 | 2014年 / 8870卷
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Most information retrieval systems rely on the strict equality of terms between document and query in order to retrieve relevant documents to a given query. The term mismatch problem appears when users and documents' authors use different terms to express the same meaning. Statistical translation models are proposed as an effective way to adapt language models in order to mitigate term mismatch problem by exploiting semantic relations between terms. However, translation probability estimation is shown as a crucial and a hard practice within statistical translation models. Therefore, we present an alternative approach to statistical translation models that formally incorporates semantic relations between indexing terms into language models. Experiments on different CLEF corpora from the medical domain show a statistically significant improvement over the ordinary language models, and mostly better than translation models in retrieval performance. The improvement is related to the rate of general terms and their distribution inside the queries.
引用
收藏
页码:136 / 147
页数:12
相关论文
共 22 条
[1]  
[Anonymous], 1971, The SMART Retrieval System-Experiments in Automatic Document Processing
[2]  
[Anonymous], 1997, READINGS INFORM RETR
[3]  
[Anonymous], 2008, P 31 ANN INT ACM SIG
[4]  
[Anonymous], 2008, Introduction to information retrieval
[5]  
[Anonymous], 1998, SIGIR 98 P 21 ANN IN, DOI DOI 10.1145/290941.291008
[6]  
Aronson AR., 2006, METAMAP MAPPING TEXT
[7]  
Berger A, 1999, SIGIR'99: PROCEEDINGS OF 22ND INTERNATIONAL CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, P222, DOI 10.1145/312624.312681
[8]  
Chevallet J., 2007, CIKM 2007, P495
[9]  
Chevallet JP, 2005, LECT NOTES COMPUT SC, V3411, P263
[10]   Exploiting the Similarity of Non-Matching Terms at Retrieval Time [J].
Fabio Crestani .
Information Retrieval, 2000, 2 (1) :27-47