Query Phrase Expansion Using Wikipedia in Patent Class Search

被引:0
作者
Al-Shboul, Bashar [1 ]
Myaeng, Sung-Hyon [1 ]
机构
[1] Korea Adv Inst Sci & Technol, Taejon 305701, South Korea
来源
INFORMATION RETRIEVAL TECHNOLOGY | 2011年 / 7097卷
关键词
Pseudo-Relevance Feedback; Patent Information Retrieval; Wikipedia Categories; Query Expansion; Phrase-based Query Expansion; INFORMATION-RETRIEVAL; LEXICAL COHESION; TERMS; RELEVANCE;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Relevance Feedback methods generally suffer from topic drift caused by words ambiguity and synonymous uses of words. As a way to alleviate the inherent problem, we propose a novel query phrase expansion approach utilizing semantic annotations in Wikipedia pages, trying to enrich queries with context disambiguating phrases. Focusing on the patent domain, especially on patent search where patents are classified into a hierarchy of categories, we attempt to understand the roles of phrases and words in query expansion in determining the relevance of documents and examine their contributions to alleviating the query drift problem. Our approach is compared against Relevance Model, a state-of-the-art, to show its superiority in terms of MAP on all levels of the classification hierarchy.
引用
收藏
页码:115 / 126
页数:12
相关论文
共 30 条
[1]  
Al-Shboul B., 2010, P NTCIR 8
[2]   Phase-based information retrieval [J].
Arampatzis, AT ;
Tsoris, T ;
Koster, CHA ;
Van der Weide, TP .
INFORMATION PROCESSING & MANAGEMENT, 1998, 34 (06) :693-707
[3]  
Arguello J., 2008, P ICWSM 2008
[4]  
Azzopardi L., 2010, P SIGIR 2010
[5]   Adapting information retrieval to query contexts [J].
Bai, Jing ;
Nie, Jian-Yun .
INFORMATION PROCESSING & MANAGEMENT, 2008, 44 (06) :1901-1922
[6]  
Banerjee S., 2007, P SIGIR 2007
[7]  
Cao G., 2008, P SIGIR 2008
[8]  
Croft B., 2008, P SIGIR 2008
[9]   Query expansion by mining user logs [J].
Cui, H ;
Wen, JR ;
Nie, JY ;
Ma, WY .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2003, 15 (04) :829-839
[10]  
Ganesh S., 2009, RECENT ADV NATURAL L