Distributed Feature Representations for Dependency Parsing

被引:11
作者
Chen, Wenliang [1 ]
Zhang, Min [1 ]
Zhang, Yue [2 ]
机构
[1] Soochow Univ, Sch Comp Sci & Technol, Suzhou, Peoples R China
[2] Singapore Univ Technol & Design, Singapore, Singapore
基金
中国国家自然科学基金;
关键词
Natural language processing; dependency parsing; feature embeddings; semi-supervised approach; MODELS;
D O I
10.1109/TASLP.2014.2365359
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper presents an approach to automatically learning distributed representations for features to address the feature sparseness problem for dependency parsing. Borrowing terminologies from word embeddings, we call the feature representation feature embeddings. In our approach, the feature embeddings are inferred from large amounts of auto-parsed data. First, the sentences in raw data are parsed by a baseline system and we obtain dependency trees. Then, we represent each model feature using the surrounding features on the dependency trees. Based on the representation of surrounding context, we proposed two learning methods to infer feature embeddings. Finally, based on feature embeddings, we present a set of new features for graph-based dependency parsing models. The new parsers can not only make full use of well-established hand-designed features but also benefit from the hidden-class representations of features. Experiments on the standard Chinese and English data sets show that the new parser achieves significant performance improvements over a strong baseline.
引用
收藏
页码:451 / 460
页数:10
相关论文
共 52 条
  • [1] [Anonymous], 1996, P 16 C COMPUTATIONAL
  • [2] [Anonymous], 2003, Proceedings of 8th International Workshop on Parsing Technologies
  • [3] [Anonymous], 2005, P 43 ANN M ASS COMP
  • [4] [Anonymous], 2005, Aistats
  • [5] Bengio Y., 2008, Scholarpedia, V3, P3881, DOI [DOI 10.4249/SCHOLARPEDIA.3881, 10.4249/scholarpedia.3881]
  • [6] Learning Deep Architectures for AI
    Bengio, Yoshua
    [J]. FOUNDATIONS AND TRENDS IN MACHINE LEARNING, 2009, 2 (01): : 1 - 127
  • [7] Bohnet B., 2010, P 23 INT C COMP LING, P89
  • [8] Bohnet J., 2012, P 2012 JOINT C EMP M, P1455
  • [9] Carreras Xavier, 2007, P CONLL SHAR TASK SE, P957
  • [10] Charniak E., 2000, BLLIP 1987 89 WSJ CO