Biomedical question answering: A survey

被引:102
作者
Athenikos, Sofia J. [1 ]
Han, Hyoil [2 ]
机构
[1] Drexel Univ, Coll Informat Sci & Technol, Philadelphia, PA 19104 USA
[2] LeMoyne Owen Coll, Dept Comp Sci, Memphis, TN 38126 USA
关键词
Biomedical question answering; Answer/reason extraction; Semantic information extraction; CLINICAL QUESTIONS; RANDOMIZED-TRIAL; PATIENT-CARE; INFORMATION; KNOWLEDGE; MEDLINE; LANGUAGE; TAXONOMY; DATABASE; DOCTORS;
D O I
10.1016/j.cmpb.2009.10.003
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Objectives: In this survey, we reviewed the current state of the art in biomedical QA (Question Answering), within a broader framework of semantic knowledge-based QA approaches, and projected directions for the future research development in this critical area of intersection between Artificial Intelligence, Information Retrieval, and Biomedical Informatics. Materials and methods: We devised a conceptual framework within which to categorize current QA approaches. In particular, we used "semantic knowledge-based QA" as a category under which to subsume QA techniques and approaches, both corpus-based and knowledge base (KB)-based, that utilize semantic knowledge-informed techniques in the QA process, and we further classified those approaches into three subcategories: (1) semantics-based, (2) inference-based, and (3) logic-based. Based on the framework, we first conducted a survey of open-domain or non-biomedical-domain QA approaches that belong to each of the three subcategories. We then conducted an in-depth review of biomedical QA, by first noting the characteristics of, and resources available for, biomedical QA and then reviewing medical QA approaches and biological QA approaches, in turn. The research articles reviewed in this paper were found and selected through online searches. Results: Our review suggested the following tasks ahead for the future research development in this area: (1) Construction of domain-specific typology and taxonomy of questions (biological QA), (2) Development of more sophisticated techniques for natural language (NL) question analysis and classification, (3) Development of effective methods for answer generation from potentially conflicting evidences, (4) More extensive and integrated utilization of semantic knowledge throughout the QA process, and (5) Incorporation of logic and reasoning mechanisms for answer inference. Conclusion: Corresponding to the growth of biomedical information, there is a growing need for QA systems that can help users better utilize the ever-accumulating information. Continued research toward development of more sophisticated techniques for processing NL text, for utilizing semantic knowledge, and for incorporating logic and reasoning mechanisms, will lead to more useful QA systems. (C) 2009 Elsevier Ireland Ltd. All rights reserved.
引用
收藏
页码:1 / 24
页数:24
相关论文
共 137 条
[1]  
Alfonseca E., 2001, Proceedings of the Tenth Text REtrieval Conference (TREC-2001), P680
[2]   Physicians answer more clinical questions and change clinical decisions more often with synthesized evidence: A randomized trial in primary care [J].
Alper, BS ;
White, DS ;
Ge, B .
ANNALS OF FAMILY MEDICINE, 2005, 3 (06) :507-513
[3]  
Alper BS, 2001, J FAM PRACTICE, V50, P960
[4]  
[Anonymous], 2003, DESCRIPTION LOGIC HD
[5]  
[Anonymous], P ACL WORKSH MULT SU
[6]  
[Anonymous], 1998, WORKSH EV PARS SYST
[7]  
[Anonymous], P 3 INT WORKSH PARS
[8]  
Armstrong E C, 1999, WMJ, V98, P25
[9]  
Aronson AR, 2001, J AM MED INFORM ASSN, P17
[10]  
ATHENIKOS SJ, 2008, P INT WORKS BIOM HLT, P111