Tree Edit Distance as a Baseline Approach for Paraphrase Representation

被引:0
作者
Vila, Marta [1 ]
Dras, Mark [2 ]
机构
[1] Univ Barcelona, Gran Via 585, E-08007 Barcelona, Spain
[2] Macquarie Univ, N Ryde, NSW 2109, Australia
来源
PROCESAMIENTO DEL LENGUAJE NATURAL | 2012年 / 48期
关键词
Paraphrasing; tree edit distance; tree alignment;
D O I
暂无
中图分类号
H0 [语言学];
学科分类号
030303 ; 0501 ; 050102 ;
摘要
Finding an adequate paraphrase representation formalism is a challenging issue in Natural Language Processing. In this paper, we analyse the performance of Tree Edit Distance as a paraphrase representation baseline. Our experiments using Edit Distance Textual Entailment Suite show that, as Tree Edit Distance consists of a purely syntactic approach, paraphrase alternations not based on structural reorganizations do not find an adequate representation. They also show that there is much scope for better modelling of the way trees are aligned.
引用
收藏
页码:89 / 95
页数:7
相关论文
共 17 条
  • [1] Aho A. V., 1969, J COMPUTER SYSTEM SC, V3, P37
  • [2] The pq-Gram Distance between Ordered Labeled Trees
    Augsten, Nikolaus
    Boehlen, Michael
    Gamper, Johann
    [J]. ACM TRANSACTIONS ON DATABASE SYSTEMS, 2010, 35 (01):
  • [3] Bille Philip, 2003, TECHNICAL REPORT SER
  • [4] Learning finite-state models for machine translation
    Casacuberta, Francisco
    Vidal, Enrique
    [J]. MACHINE LEARNING, 2007, 66 (01) : 69 - 91
  • [5] Chomsky N., 1957, SYNTACTIC STRUCTURES, DOI DOI 10.1515/9783112316009
  • [6] Constructing Corpora for the Development and Evaluation of Paraphrase Systems
    Cohn, Trevor
    Callison-Burch, Chris
    Lapata, Mirella
    [J]. COMPUTATIONAL LINGUISTICS, 2008, 34 (04) : 597 - 614
  • [7] Dolan William B, 2005, P IWP
  • [8] Dras Mark, 1999, THESIS
  • [9] Eisner J., 2003, COMPANION VOLUME P 4, P205
  • [10] Heilman Michael, 2010, NAACL, P1011