Deep Semantic Role Labeling: What Works and What's Next

被引:188
作者
He, Luheng [1 ]
Lee, Kenton [1 ]
Lewis, Mike [2 ]
Zettlemoyer, Luke [1 ,3 ]
机构
[1] Univ Washington, Paul G Allen Sch Comp Sci & Engn, Seattle, WA 98195 USA
[2] Facebook AI Res, Menlo Pk, CA USA
[3] Allen Inst Artificial Intelligence, Seattle, WA USA
来源
PROCEEDINGS OF THE 55TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2017), VOL 1 | 2017年
基金
美国国家科学基金会;
关键词
D O I
10.18653/v1/P17-1044
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
We introduce a new deep learning model for semantic role labeling (SRL) that significantly improves the state of the art, along with detailed analyses to reveal its strengths and limitations. We use a deep highway BiLSTM architecture with constrained decoding, while observing a number of recent best practices for initialization and regularization. Our 8-layer ensemble model achieves 83.2 F1 on the CoNLL 2005 test set and 83.4 F1 on CoNLL 2012, roughly a 10% relative error reduction over the previous state of the art. Extensive empirical analysis of these gains show that (1) deep models excel at recovering long-distance dependencies but can still make surprisingly obvious errors, and (2) that there is still room for syntactic parsers to improve these results.
引用
收藏
页码:473 / 483
页数:11
相关论文
共 30 条
  • [1] [Anonymous], 2014, P 2014 C EMPIRICAL M
  • [2] Bonial Claire, 2010, PROPBANK ANNOTATION
  • [3] Carreras X., 2005, P 9 C COMP NAT LANG, P152
  • [4] Charniak E, 2000, 6TH APPLIED NATURAL LANGUAGE PROCESSING CONFERENCE/1ST MEETING OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE CONFERENCE AND PROCEEDINGS OF THE ANLP-NAACL 2000 STUDENT RESEARCH WORKSHOP, pA132
  • [5] Choe Do Kook, 2016, P 2016 C EMP METH NA
  • [6] Collobert R, 2011, J MACH LEARN RES, V12, P2493
  • [7] FitzGerald N., 2015, P 2015 C EMP METH NA, P960, DOI DOI 10.18653/V1/D15-1112
  • [8] Gal Y., 2016, ADV NEURAL INFORM PR, V29, P1019, DOI DOI 10.48550/ARXIV.1512.05287
  • [9] Multilingual Joint Parsing of Syntactic and Semantic Dependencies with a Latent Variable Model
    Henderson, James
    Merlo, Paola
    Titov, Ivan
    Musillo, Gabriele
    [J]. COMPUTATIONAL LINGUISTICS, 2013, 39 (04) : 949 - 998
  • [10] Training products of experts by minimizing contrastive divergence
    Hinton, GE
    [J]. NEURAL COMPUTATION, 2002, 14 (08) : 1771 - 1800