Question analysis: How Watson reads a clue

被引:38
作者
Lally, A. [1 ]
Prager, J. M. [1 ]
McCord, M. C. [1 ]
Boguraev, B. K. [1 ]
Patwardhan, S. [1 ]
Fan, J. [1 ]
Fodor, P. [2 ]
Chu-Carroll, J. [1 ]
机构
[1] IBM Corp, Div Res, Thomas J Watson Res Ctr, Yorktown Hts, NY 10598 USA
[2] SUNY Stony Brook, Dept Comp Sci, Stony Brook, NY 11794 USA
关键词
D O I
10.1147/JRD.2012.2184637
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The first stage of processing in the IBM Watson (TM) system is to perform a detailed analysis of the question in order to determine what it is asking for and how best to approach answering it. Question analysis uses Watson's parsing and semantic analysis capabilities: a deep Slot Grammar parser, a named entity recognizer, a co-reference resolution component, and a relation extraction component. We apply numerous detection rules and classifiers using features from this analysis to detect critical elements of the question, including: 1) the part of the question that is a reference to the answer (the focus); 2) terms in the question that indicate what type of entity is being asked for (lexical answer types); 3) a classification of the question into one or more of several broad types; and 4) elements of the question that play particular roles that may require special handling, for example, nested subquestions that must be separately answered. We describe how these elements are detected and evaluate the impact of accurate detection on our end-to-end question-answering system accuracy.
引用
收藏
页数:14
相关论文
共 30 条
  • [1] [Anonymous], IBM J RES DEV
  • [2] [Anonymous], IBM J RES DEV
  • [3] Bunescu R., 2010, P 11 INT C INT TEXT
  • [4] Chu-Carroll J., 2003, P TEXT RETREIVAL C
  • [5] Covington MichaelA., 1994, Natural language processing for Prolog programmers
  • [6] Fellbaum C, 1998, LANG SPEECH & COMMUN, P1
  • [7] Building an example application with the Unstructured Information Management Architecture
    Ferrucci, D
    Lally, A
    [J]. IBM SYSTEMS JOURNAL, 2004, 43 (03) : 455 - 475
  • [8] Building Watson: An Overview of the DeepQA Project
    Ferrucci, David
    Brown, Eric
    Chu-Carroll, Jennifer
    Fan, James
    Gondek, David
    Kalyanpur, Aditya A.
    Lally, Adam
    Murdock, J. William
    Nyberg, Eric
    Prager, John
    Schlaefer, Nico
    Welty, Chris
    [J]. AI MAGAZINE, 2010, 31 (03) : 59 - 79
  • [9] Giampiccolo D., 2007, ADV MULTILINGUAL MUL, V5152, P200
  • [10] Gondek David, 2012, IBM J RES DEV, V56