Cluster-based query expansion using external collections in medical information retrieval

被引:19
作者
Oh, Heung-Seon [1 ]
Jung, Yuchul [1 ]
机构
[1] Korea Inst Sci & Technol Informat, Daejeon, South Korea
关键词
Query expansion; External collections; Language models; TEXT;
D O I
10.1016/j.jbi.2015.09.017
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Utilizing external collections to improve retrieval performance is challenging research because various test collections are created for different purposes. Improving medical information retrieval has also gained much attention as various types of medical documents have become available to researchers ever since they started storing them in machine processable formats. In this paper, we propose an effective method of utilizing external collections based on the pseudo relevance feedback approach. Our method incorporates the structure of external collections in estimating individual components in the final feedback model. Extensive experiments on three medical collections (TREC CDS, CLEF eHealth, and OHSUMED) were performed, and the results were compared with a representative expansion approach utilizing the external collections to show the superiority of our method. (C) 2015 Elsevier Inc. All rights reserved.
引用
收藏
页码:70 / 79
页数:10
相关论文
共 47 条
[1]   Probabilistic models of information retrieval based on measuring the divergence from randomness [J].
Amati, G ;
Van Rijsbergen, CJ .
ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2002, 20 (04) :357-389
[2]  
[Anonymous], 2009, STANFORD DIGITAL LIB
[3]  
[Anonymous], 2001, Proceedings of the 24th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR'01, page
[4]  
[Anonymous], 1994, P TREC
[5]  
Aronson AR, 2001, J AM MED INFORM ASSN, P17
[6]  
Babashzadeh A, 2013, SIGIR'13: THE PROCEEDINGS OF THE 36TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH & DEVELOPMENT IN INFORMATION RETRIEVAL, P801
[7]  
Blanco R, 2008, LECT NOTES COMPUT SC, V4956, P394
[8]   The Unified Medical Language System (UMLS): integrating biomedical terminology [J].
Bodenreider, O .
NUCLEIC ACIDS RESEARCH, 2004, 32 :D267-D270
[9]   Incorporating Query-Specific Feedback into Learning-to-Rank Models [J].
Can, Ethem F. ;
Croft, W. Bruce ;
Manmatha, R. .
SIGIR'14: PROCEEDINGS OF THE 37TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2014, :1035-1038
[10]   A Survey of Automatic Query Expansion in Information Retrieval [J].
Carpineto, Claudio ;
Romano, Giovanni .
ACM COMPUTING SURVEYS, 2012, 44 (01)