3Rs:Data Augmentation Techniques Using Document Contexts For Low-Resource Chinese Named Entity Recognition

被引:0
作者
Ying, Zheyu [1 ,2 ]
Zhang, Jinglei [1 ,2 ]
Xie, Rui [1 ]
Wen, Guochang [1 ,2 ]
Xiao, Feng [1 ,2 ]
Liu, Xueyang [1 ]
Zhang, Shikun [1 ]
机构
[1] Peking Univ, Natl Engn Res Ctr Software Engn, Beijing, Peoples R China
[2] Peking Univ, Sch Software & Microelect, Beijing, Peoples R China
来源
2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN) | 2022年
关键词
Chinese NER; Data Augmentation; Document-Level; Adversarial Attack; Low-resource;
D O I
10.1109/IJCNN55064.2022.9892341
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With recent advances of neural networks and pre-training techniques, Chinese Named Entity Recognition (NER) has achieved great progress in recent years. However, NER systems still have the problem of generalization ability issues due to lack of annotated data, and current NER models mostly consider input sentences individually, which prevent models from further exploiting cross-sentence document context in training. With regard of these problems, this paper present new insights into Chinese NER and propose 3Rs: three data augmentation methods incorporating document-level information for NER through random concatenating, random swapping and random erasing, which are inspired by some multi-sample data augmentation techniques in computer vision fields, aiming to reorganize the composition of training sentences, and generate more training examples with less human efforts. We conduct extensive experiments on two Chinese datasets, and introduce a two-level attacking method to audit robustness performance. Our experiment results show that even the best model can obtain a better accuracy and robustness, especially for smaller training sets, therefore alleviating performance bottlenecks on low-resource conditions.
引用
收藏
页数:8
相关论文
共 30 条
  • [1] Devlin Jacob, 2018, ARXIV181004805
  • [2] DeVries Terrance, 2017, ARXIV170804552
  • [3] Fadaee Marzieh, 2017, ARXIV170500440
  • [4] Feng S. Y., 2021, ARXIV210503075
  • [5] Few-shot classification in Named Entity Recognition Task
    Fritzler, Alexander
    Logacheva, Varvara
    Kretov, Maksim
    [J]. SAC '19: PROCEEDINGS OF THE 34TH ACM/SIGAPP SYMPOSIUM ON APPLIED COMPUTING, 2019, : 993 - 1000
  • [6] Gao F, 2019, 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), P5539
  • [7] KeepAugment: A Simple Information-Preserving Data Augmentation Approach
    Gong, Chengyue
    Wang, Dilin
    Li, Meng
    Chandra, Vikas
    Liu, Qiang
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 1055 - 1064
  • [8] Gururangan Suchin, 2020, ARXIV200410964
  • [9] Hirschman L., 2001, Natural Language Engineering, V7, P275, DOI 10.1017/S1351324901002807
  • [10] Huang Zhiheng, 2015, Bidirectional lstm-crf models for sequence tagging