RELPRON: A Relative Clause Evaluation Data Set for Compositional Distributional Semantics

被引：14

作者：

Rimell, Laura ^{[1
]}

Maillard, Jean ^{[1
]}

Polajnar, Tamara ^{[1
]}

Clark, Stephen ^{[1
]}

机构：

[1] Univ Cambridge, Comp Lab, William Gates Bldg,15 JJ Thomson Ave, Cambridge, England

来源：

COMPUTATIONAL LINGUISTICS | 2016年 / 42卷 / 04期

基金：

英国工程与自然科学研究理事会; 欧洲研究理事会;

关键词：

CCG;

D O I：

10.1162/COLI_a_00263

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This article introduces RELPRON, a large data set of subject and object relative clauses, for the evaluation of methods in compositional distributional semantics. RELPRON targets an intermediate level of grammatical complexity between content-word pairs and full sentences. The task involves matching terms, such as wisdom, with representative properties, such as quality that experience teaches. A unique feature of RELPRON is that it is built from attested properties, but without the need for them to appear in relative clause format in the source corpus. The article also presents some initial experiments on RELPRON, using a variety of composition methods including simple baselines, arithmetic operators on vectors, and finally, more complex methods in which argument-taking words are represented as tensors. The latter methods are based on the Categorial framework, which is described in detail. The results show that vector addition is difficult to beatin line with the existing literaturebut that an implementation of the Categorial framework based on the Practical Lexical Function model is able to match the performance of vector addition. The article finishes with an in-depth analysis of RELPRON, showing how results vary across subject and object relative clauses, across different head nouns, and how the methods perform on the subtasks necessary for capturing relative clause semantics, as well as providing a qualitative analysis highlighting some of the more common errors. Our hope is that the competitive results presented here, in which the best systems are on average ranking one out of every two properties correctly for a given term, will inspire new approaches to the RELPRON ranking task and other tasks based on linguistically interesting constructions.

引用

页码：661 / 701

页数：41

共 88 条

[1] Agirre E, 2015, P 9 INT WORKSH SEM E, P252, DOI 10.18653/v1/S15-2045
[2] Agirre E, 2013, P 2 JOINT C LEX COMP, P32, DOI 10.18653/v1/s17-2001
[3] [Anonymous], 2011, Proceedings of the International Conference on Computational Semantics
[4] [Anonymous], 2010, Proceedings of the 2010 conference on empirical methods in natural language processing
[5] [Anonymous], 2010, lambek festschrift. Linguistic Analysis
[6] [Anonymous], 2013, P 2013 C N AM CHAPTE
[7] [Anonymous], 1970, LINGUAGGI NELLA SOC
[8] [Anonymous], 2014, Linguistic Issues in language technology, DOI DOI 10.33011/LILT.V9I.1321
[9] [Anonymous], 2010, DEEP LEARNING UNSUPE
[10] [Anonymous], 2015, P 1 WORKSH LINK COMP

← 1 2 3 4 5 6 7 8 9 →