3Rs:Data Augmentation Techniques Using Document Contexts For Low-Resource Chinese Named Entity Recognition

被引：0

作者：

Ying, Zheyu ^{[1
,2
]}

Zhang, Jinglei ^{[1
,2
]}

Xie, Rui ^{[1
]}

Wen, Guochang ^{[1
,2
]}

Xiao, Feng ^{[1
,2
]}

Liu, Xueyang ^{[1
]}

Zhang, Shikun ^{[1
]}

机构：

[1] Peking Univ, Natl Engn Res Ctr Software Engn, Beijing, Peoples R China

[2] Peking Univ, Sch Software & Microelect, Beijing, Peoples R China

来源：

2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN) | 2022年

关键词：

Chinese NER; Data Augmentation; Document-Level; Adversarial Attack; Low-resource;

D O I：

10.1109/IJCNN55064.2022.9892341

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

With recent advances of neural networks and pre-training techniques, Chinese Named Entity Recognition (NER) has achieved great progress in recent years. However, NER systems still have the problem of generalization ability issues due to lack of annotated data, and current NER models mostly consider input sentences individually, which prevent models from further exploiting cross-sentence document context in training. With regard of these problems, this paper present new insights into Chinese NER and propose 3Rs: three data augmentation methods incorporating document-level information for NER through random concatenating, random swapping and random erasing, which are inspired by some multi-sample data augmentation techniques in computer vision fields, aiming to reorganize the composition of training sentences, and generate more training examples with less human efforts. We conduct extensive experiments on two Chinese datasets, and introduce a two-level attacking method to audit robustness performance. Our experiment results show that even the best model can obtain a better accuracy and robustness, especially for smaller training sets, therefore alleviating performance bottlenecks on low-resource conditions.

引用

页数：8

共 30 条

[1] Devlin Jacob, 2018, ARXIV181004805
[2] DeVries Terrance, 2017, ARXIV170804552
[3] Fadaee Marzieh, 2017, ARXIV170500440
[4] Feng S. Y., 2021, ARXIV210503075
[5] Few-shot classification in Named Entity Recognition Task
Fritzler, Alexander
Logacheva, Varvara
Kretov, Maksim
[J]. SAC '19: PROCEEDINGS OF THE 34TH ACM/SIGAPP SYMPOSIUM ON APPLIED COMPUTING, 2019, : 993 - 1000
[6] Gao F, 2019, 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), P5539
[7] KeepAugment: A Simple Information-Preserving Data Augmentation Approach
Gong, Chengyue
Wang, Dilin
Li, Meng
Chandra, Vikas
Liu, Qiang
[J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 1055 - 1064
[8] Gururangan Suchin, 2020, ARXIV200410964
[9] Hirschman L., 2001, Natural Language Engineering, V7, P275, DOI 10.1017/S1351324901002807
[10] Huang Zhiheng, 2015, Bidirectional lstm-crf models for sequence tagging

← 1 2 3 →