Question analysis: How Watson reads a clue

被引：38

作者：

Lally, A. ^{[1
]}

Prager, J. M. ^{[1
]}

McCord, M. C. ^{[1
]}

Boguraev, B. K. ^{[1
]}

Patwardhan, S. ^{[1
]}

Fan, J. ^{[1
]}

Fodor, P. ^{[2
]}

Chu-Carroll, J. ^{[1
]}

机构：

[1] IBM Corp, Div Res, Thomas J Watson Res Ctr, Yorktown Hts, NY 10598 USA

[2] SUNY Stony Brook, Dept Comp Sci, Stony Brook, NY 11794 USA

来源：

IBM JOURNAL OF RESEARCH AND DEVELOPMENT | 2012年 / 56卷 / 3-4期

关键词：

Natural language processing systems - Classification (of information);

D O I：

10.1147/JRD.2012.2184637

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

The first stage of processing in the IBM Watson (TM) system is to perform a detailed analysis of the question in order to determine what it is asking for and how best to approach answering it. Question analysis uses Watson's parsing and semantic analysis capabilities: a deep Slot Grammar parser, a named entity recognizer, a co-reference resolution component, and a relation extraction component. We apply numerous detection rules and classifiers using features from this analysis to detect critical elements of the question, including: 1) the part of the question that is a reference to the answer (the focus); 2) terms in the question that indicate what type of entity is being asked for (lexical answer types); 3) a classification of the question into one or more of several broad types; and 4) elements of the question that play particular roles that may require special handling, for example, nested subquestions that must be separately answered. We describe how these elements are detected and evaluate the impact of accurate detection on our end-to-end question-answering system accuracy.

引用

页数：14

共 30 条

[1]

[Anonymous], IBM J RES DEV

[2]

[Anonymous], IBM J RES DEV

[3]

Bunescu R., 2010, P 11 INT C INT TEXT

[4]

Chu-Carroll J., 2003, P TEXT RETREIVAL C

[5]

Covington MichaelA., 1994, Natural language processing for Prolog programmers

[6]

Fellbaum C, 1998, LANG SPEECH & COMMUN, P1

[7] Building an example application with the Unstructured Information Management Architecture [J].

Ferrucci, D ;

Lally, A .

IBM SYSTEMS JOURNAL, 2004, 43 (03) :455-475

[8] Building Watson: An Overview of the DeepQA Project [J].

Ferrucci, David ;

Brown, Eric ;

Chu-Carroll, Jennifer ;

Fan, James ;

Gondek, David ;

Kalyanpur, Aditya A. ;

Lally, Adam ;

Murdock, J. William ;

Nyberg, Eric ;

Prager, John ;

Schlaefer, Nico ;

Welty, Chris .

AI MAGAZINE, 2010, 31 (03) :59-79

[9]

Giampiccolo D., 2007, ADV MULTILINGUAL MUL, V5152, P200

[10]

Gondek David, 2012, IBM J RES DEV, V56

← 1 2 3 →