Pretrained Models with Adversarial Training for Named Entity Recognition in Scientific Text

被引：3

作者：

Ma, Hangchao ^{[1
]}

Zhang, You ^{[1
]}

Wang, Jin ^{[1
]}

机构：

[1] Yunnan Univ, Sch Informat Sci & Engn, Kunming, Yunnan, Peoples R China

来源：

2022 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP 2022) | 2022年

基金：

中国国家自然科学基金;

关键词：

scientific text entity recognition; pretrained models; adversarial training;

D O I：

10.1109/IALP57159.2022.9961309

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Named entity recognition (NER) is an important fundamental task in natural language processing (NLP). This paper describes a method for named entity recognition based on pretrained models and adversarial training in scientific text. The scientific entity recognition task requires the model to identify 7 different scientific term entities. There are some issues with a given dataset, such as imbalanced label classes, excessively long entity bounds, and inconsistent entity labeling. To address these issues, we proposed to use focal loss instead of existing cross-entropy loss. Further, we used one of the common adversarial training methods, i.e., Fast Gradient Method (FGM) to perform semi-supervised NER. The experimental results show that our adversarial training method considerably enhances the performance of the model, and the method used in this study achieves the highest F-1-score of 88.92 %. Moreover, our results also prove that SciBERT is better suited to the task of named entity recognition in scientific text and that the focal loss successfully solves the problem of data imbalance.

引用

页码：259 / 264

页数：6

共 20 条

[1]

Bekoulis G, 2018, 2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), P2830

[2]

Beltagy I, 2019, 2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019), P3615

[3]

C. K. LTD, DAT NLPCC2022 SHARED

[4] Improving sentiment analysis via sentence type classification using BiLSTM-CRF and CNN [J].

Chen, Tao ;

Xu, Ruifeng ;

He, Yulan ;

Wang, Xuan .

EXPERT SYSTEMS WITH APPLICATIONS, 2017, 72 :221-230

[5]

Chen X., 2018, MULTINOMIAL ADVERSAR, P1226

[6]

Devlin J., 2018, CORR

[7] Character-level neural network for biomedical named entity recognition [J].

Gridach, Mourad .

JOURNAL OF BIOMEDICAL INFORMATICS, 2017, 70 :85-91

[8]

Gui Tao, 2017, P 2017 C EMP METH NA, P2411

[9]

Goodfellow IJ, 2015, Arxiv, DOI [arXiv:1412.6572, DOI 10.48550/ARXIV.1412.6572]

[10] Focal Loss for Dense Object Detection [J].

Lin, Tsung-Yi ;

Goyal, Priya ;

Girshick, Ross ;

He, Kaiming ;

Dollar, Piotr .

2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :2999-3007

← 1 2 →