Building a reusable test collection for question answering

被引:20
作者
Lin, J [1 ]
Katz, B
机构
[1] Univ Maryland, Coll Informat Studies, Inst Adv Comp Studies, College Pk, MD 20742 USA
[2] MIT, Comp Sci & Artificial Intelligence Lab, Cambridge, MA 02139 USA
来源
JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY | 2006年 / 57卷 / 07期
关键词
D O I
10.1002/asi.20348
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In contrast to traditional information retrieval systems, which return ranked lists of documents that users must manually browse through, a question answering system attempts to directly answer natural language questions posed by the user. Although such systems possess language-processing capabilities, they still rely on traditional document retrieval techniques to generate an initial candidate set of documents. In this article, the authors argue that document retrieval for question answering represents a task different from retrieving documents in response to more general retrospective information needs. Thus, to guide future system development, specialized question answering test collections must be constructed. They show that the current evaluation resources have major shortcomings; to remedy the situation, they have manually created a small, reusable question answering test collection for research purposes. In this article they describe their methodology for building this test collection and discuss issues they encountered regarding the notion of "answer correctness."
引用
收藏
页码:851 / 861
页数:11
相关论文
共 50 条
[21]  
2-3
[22]  
HIRSCHMAN L, 2001, NAT LANG ENG, V7, P275, DOI DOI 10.1017/S1351324901002807
[23]  
Hull DA, 1996, J AM SOC INFORM SCI, V47, P70, DOI 10.1002/(SICI)1097-4571(199601)47:1<70::AID-ASI7>3.0.CO
[24]  
2-#
[25]  
JONES KS, 2003, P 2 COLOGNETELSNET S
[26]  
KATZ B, 2003, P EACL 2003 WORKSH N
[27]  
Li XM, 2002, POWERCON 2002: INTERNATIONAL CONFERENCE ON POWER SYSTEM TECHNOLOGY, VOLS 1-4, PROCEEDINGS, P556, DOI 10.1109/ICPST.2002.1053604
[28]  
LIGHT M, 2001, NAT LANG ENG, V7, P325
[29]  
LIN J, 2005, P 28 ANN INT ACM SIG, P392
[30]  
MIZZARO S, 1999, INT COMPUTERS, V10, P305