Relevance feedback query refinement for PDF medical journal articles

被引:0
作者
Christiansen, Ammon [1 ]
Lee, D. J. [1 ]
机构
[1] Brigham Young Univ, 459 CB, Provo, UT 84604 USA
来源
19TH IEEE INTERNATIONAL SYMPOSIUM ON COMPUTER-BASED MEDICAL SYSTEMS, PROCEEDINGS | 2006年
关键词
D O I
10.1109/CBMS.2006.140
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper addresses relevance feedback as an alternative to keyword-based search engines for sifting through large PDF document collections and extracting the most relevant documents (especially for literature review purposes). Until now, relevance feedback has only been used in content-based image and video retrieval due to the inability to query those media types without keywords. Since PDF journal articles contain many valuable non-keyword features such as structure and formatting information as well as embedded figures, they would benefit from relevance feedback. Stripping a PDF into 'full-text" for indexing purposes disregards these important features. We discuss how they can be used to our advantage and look to integrate the wealth of knowledge from relevance feedback text-based information retrieval. We argue for the benefits of placing the burden of relevance judgement on the user rather than the retrieval system and present alternative document views that quickly allow the user to deem relevance.
引用
收藏
页码:57 / +
页数:2
相关论文
共 5 条
[1]   Xed: a new tool for eXtracting hidden structures from electronic documents [J].
Hadjar, K ;
Rigamonti, M ;
Lalanne, D ;
Ingold, R .
FIRST INTERNATIONAL WORKSHOP ON DOCUMENT IMAGE ANALYSIS FOR LIBRARIES, PROCEEDINGS, 2004, :212-224
[2]  
MARQUES O, 2002, IMSA, P306
[3]  
RIGAMONTI M, 2005, 8 INT C DOC AN REC I
[4]  
XU Z, 2003, ECIR, P281
[5]  
YANG M, 2004, ISSAC04