Automatic Annotation and Evaluation of Error Types for Grammatical Error Correction

被引:121
作者
Bryant, Christopher [1 ]
Felice, Mariano [1 ]
Briscoe, Ted [1 ]
机构
[1] Univ Cambridge, Comp Lab, ALTA Inst, Cambridge, England
来源
PROCEEDINGS OF THE 55TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2017), VOL 1 | 2017年
关键词
D O I
10.18653/v1/P17-1074
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Until now, error type performance for Grammatical Error Correction (GEC) systems could only be measured in terms of recall because system output is not annotated. To overcome this problem, we introduce ERRANT, a grammatical ERRor ANnotation Toolkit designed to automatically extract edits from parallel original and corrected sentences and classify them according to a new, dataset-agnostic, rule-based framework. This not only facilitates error type evaluation at different levels of granularity, but can also be used to reduce annotator workload and standardise existing GEC datasets. Human experts rated the automatic edits as "Good" or "Acceptable" in at least 95% of cases, so we applied ERRANT to the system output of the CoNLL-2014 shared task to carry out a detailed error type analysis for the first time.
引用
收藏
页码:793 / 805
页数:13
相关论文
共 19 条
[1]  
[Anonymous], 1993, An introduction to the bootstrap
[2]  
[Anonymous], CONLL
[3]  
[Anonymous], 2014, P 18 C COMP NAT LANG, DOI DOI 10.3115/V1/W14-1701
[4]  
Bryant C, 2015, PROCEEDINGS OF THE 53RD ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 7TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 1, P697
[5]  
Dahlmeier Daniel, 2012, P 2012 C N AM CHAPT, P568
[6]  
Dale Robert., 2011, Proceedings of the 13th European Workshop on Natural Language Generation, P242
[7]  
Felice M., 2015, P 2015 C N AM CHAPT, P578
[8]  
Felice Mariano., 2016, P COLING 2016 26 INT, P825
[9]  
Grundkiewicz Roman., 2015, Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, P461, DOI DOI 10.18653/V1/D15-1052
[10]  
Gupta Anubhav., 2014, CoNLL Shared Task, P49