A Survey of Paraphrasing and Textual Entailment Methods

被引:169
作者
Androutsopoulos, Ion [1 ]
Malakasiotis, Prodromos [1 ]
机构
[1] Athens Univ Econ & Business, Dept Informat, GR-10434 Athens, Greece
关键词
INFORMATION EXTRACTION; SENTENCE COMPRESSION; CONSTRUCTION; CORPUS;
D O I
10.1613/jair.2985
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Paraphrasing methods recognize, generate, or extract phrases, sentences, or longer natural language expressions that convey almost the same information. Textual entailment methods, on the other hand, recognize, generate, or extract pairs of natural language expressions, such that a human who reads (and trusts) the first element of a pair would most likely infer that the other element is also true. Paraphrasing can be seen as bidirectional textual entailment and methods from the two areas are often similar. Both kinds of methods are useful, at least in principle, in a wide range of natural language processing applications, including question answering, summarization, text generation, and machine translation. We summarize key ideas from the two areas by considering in turn recognition, generation, and extraction methods, also pointing to prominent articles and resources.
引用
收藏
页码:135 / 187
页数:53
相关论文
共 229 条
  • [61] CARNAP R, 1952, PHILOS STUDIES, V3
  • [62] Charniak E, 2000, 6TH APPLIED NATURAL LANGUAGE PROCESSING CONFERENCE/1ST MEETING OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE CONFERENCE AND PROCEEDINGS OF THE ANLP-NAACL 2000 STUDENT RESEARCH WORKSHOP, pA132
  • [63] Clarke D, 2009, Proceedings of the Workshop on Geometrical Models of Natural Language Semantics, P112
  • [64] Global inference for sentence compression an integer linear programming approach
    Clarke, James
    Lapata, Mirella
    [J]. JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2008, 31 : 399 - 429
  • [65] COHN T, 2008, P 22 INT C COMP LING
  • [66] Constructing Corpora for the Development and Evaluation of Paraphrase Systems
    Cohn, Trevor
    Callison-Burch, Chris
    Lapata, Mirella
    [J]. COMPUTATIONAL LINGUISTICS, 2008, 34 (04) : 597 - 614
  • [67] Sentence Compression as Tree Transduction
    Cohn, Trevor
    Lapata, Mirella
    [J]. JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2009, 34 : 637 - 674
  • [68] Head-driven statistical models for natural language parsing
    Collins, M
    [J]. COMPUTATIONAL LINGUISTICS, 2003, 29 (04) : 589 - 637
  • [69] Cristianini Nello, 2000, An Introduction to Support Vector Machines and Other Kernel-Based Learning Methods, DOI DOI 10.1017/CB09780511801389
  • [70] Culicover PW., 1968, Mech. Transl. Comput. Linguist, V11, P78