Chinese Grammatical Error Correction Based on Convolutional Sequence to Sequence Model

被引:12
作者
Li, Si [1 ]
Zhao, Jianbo [1 ]
Shi, Guirong [2 ]
Tan, Yuanpeng [3 ]
Xu, Huifang [3 ]
Chen, Guang [1 ]
Lan, Haibo [2 ]
Lin, Zhiqing [1 ]
机构
[1] Beijing Univ Posts & Telecommun, Sch Informat & Commun Engn, Beijing 100876, Peoples R China
[2] State Grid Jibei Elect Power Co Ltd, Beijing 100053, Peoples R China
[3] China Elect Power Res Inst, Beijing 100192, Peoples R China
来源
IEEE ACCESS | 2019年 / 7卷
关键词
Chinese grammatical error correction; sequence to sequence; convolutional;
D O I
10.1109/ACCESS.2019.2917631
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Chinese grammatical error correction (CGEC) is practically useful for learners of Chinese as a second language, but it is a rather challenging task due to the complex and flexible nature of Chinese language so that existing methods for English cannot be directly applied. In this paper, we introduce a convolutional sequence to sequence model into the CGEC task for the first time, since many Chinese grammatical errors are concentrated between three and four words and convolutional neural network can better capture the local context. A convolution-based model can obtain the representations of the context by fixed size kernel. By stacking convolution layers, long-term dependences can be obtained. We also propose two optimization methods, shared embedding and policy gradient, to optimize the convolutional sequence to sequence model through sharing parameters and reconstructing loss function. Besides, we collate the existing Chinese grammatical correction corpus in detail. The results show that the models we proposed two different optimization methods both achieve large improvement compared with the natural machine translation model based on a recurrent neural network.
引用
收藏
页码:72905 / 72913
页数:9
相关论文
共 34 条
  • [1] [Anonymous], 2013, P 7 SIGHAN WORKSH CH
  • [2] Bengio Y, 2013, INT CONF ACOUST SPEE, P8624, DOI 10.1109/ICASSP.2013.6639349
  • [3] Chang T.-H., 2013, P 7 SIGHAN WORKSH CH, P97
  • [4] Chen H.-H., 2016, P NLPTEA OS JAP
  • [5] Chen S.-H., 2016, P 3 WORKSH NAT LANG, P131
  • [6] Cheng S.M., 2014, P COLING 2014 25 INT, P279
  • [7] Chollampatt S, 2018, 2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), P2528
  • [8] Chollampatt Shamil, 2018, Proceedings of the 27th International Conference on Computational Linguistics, P2730
  • [9] Dauphin YN, 2017, PR MACH LEARN RES, V70
  • [10] Fu RJ, 2018, NATURAL LANGUAGE PROCESSING TECHNIQUES FOR EDUCATIONAL APPLICATIONS, P52