A Query Expansion Technique Using the EWC Semantic Relatedness Measure

被引:0
作者
Klyuev, Vitaly [1 ]
Haralambous, Yannis [2 ]
机构
[1] Univ Aizu, Aizu Wakamatsu, Fukushima 9658580, Japan
[2] Inst Telecom Telecom Bretagne, Dept Informat, Lab STICC Technopole Brest Iroise, UMR CNRS 3192, F-29238 Brest 3, France
来源
INFORMATICA-JOURNAL OF COMPUTING AND INFORMATICS | 2011年 / 35卷 / 04期
关键词
relatedness measure; Wikipedia; WordNet; search engine; query expansion;
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
This paper analyses the efficiency of the EWC semantic relatedness measure in an ad-hoc retrieval task. This measure combines the Wikipedia-based Explicit Semantic Analysis (ESA) measure, the WordNet path measure and the mixed collocation index. EWC considers encyclopaedic, ontological, and collocational knowledge about terms. This advantage of EWC is a key factor to find precise terms for automatic query expansion. In the experiments, the open source search engine Terrier is utilised as a tool to index and retrieve data. The proposed technique is tested on the NTCIR data collection. The experiments demonstrated superiority of EWC over ESA.
引用
收藏
页码:401 / 406
页数:6
相关论文
共 20 条
[1]  
Aguera R.P., 2008, RES COMPUTING SCI, P177
[2]   Probabilistic models of information retrieval based on measuring the divergence from randomness [J].
Amati, G ;
Van Rijsbergen, CJ .
ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2002, 20 (04) :357-389
[3]  
[Anonymous], 1999, NTCIR 1 CLIR DATA CO
[4]  
Ben He, 2009, P 31 EUR C INF RETR
[5]   Stratified analysis of AOL query log [J].
Brenes, David J. ;
Gayo-Avello, Daniel .
INFORMATION SCIENCES, 2009, 179 (12) :1844-1858
[6]  
Chen Aitao, 1999, P 1 NTCIR WORKSH RES
[7]  
Cui H., 2002, P 11 INT C WORLD WID
[8]  
Egozi O., 2011, ACM T INFORM SYSTEMS, V29
[9]  
Haralambous Yannis, 2011, P IJCNLP
[10]  
Hsu MH, 2008, LECT NOTES COMPUT SC, V4993, P213