UD_BBC: Named entity recognition in social network combined BERT-BiLSTM-CRF with active learning

被引:37
作者
Li, Wei [1 ]
Du, Yajun [1 ]
Li, Xianyong [1 ]
Chen, Xiaoliang [1 ]
Xie, Chunzhi [1 ]
Li, Hui [2 ]
Li, Xiaolei [1 ]
机构
[1] Xihua Univ, Coll Comp Sci & Software Engn, Chengdu 610039, Peoples R China
[2] Lib Xihua Univ, Chengdu 610039, Peoples R China
关键词
Named entity recognition; Deep learning models; Natural language processing; Education public opinion; COMMUNITY STRUCTURE; PERFORMANCE;
D O I
10.1016/j.engappai.2022.105460
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the rapid growth of Internet penetration, more and more people choose the Internet to express their views on topics of interest. In recent years, named entity recognition (NER) is becoming a popular task for the public to obtain structured information from public opinion text. At present, NER models with good results, such as deep learning model, need a lot of labeled data for training. However, this will give rise to a problem: labeling a large amount of data requires a lot of human resources, which is thankless in some areas. Therefore, in this paper, we propose a NER model combining active learning and deep learning methods. Firstly, the active learning method can solve the above problem. The strategy combines uncertainty-based sampling and diversity -based sampling to estimate the information of data. We use highly informative data as the initial training dataset. Secondly, this paper uses a deep learning model combining bidirectional encoder representations from Transformers, bidirectional long-short-term memory and conditional random field (BERT-BiLSTM-CRF). BERT extracts the semantic features of data, and BiLSTM predicts the probability distribution of entity labels. We use the CRF for decoding the probability distribution into corresponding entity labels. Finally, we use the initial training dataset for training BERT-BiLSTM-CRF. This model predicts the entity labels of the unlabeled data. Then, we judge if the machine-labeled data is highly reliable and expand the highly reliable data to the initial training dataset. The updated dataset retrains the NER model, so that the trained model has higher precision than the previous model. The results show that our model performs well without a large number of labeled datasets. The model achieves a precision value of 70.31%, recall rate of 74.93% and F1 score of 72.55% in the named entity recognition task, which proves the effectiveness of our model. Besides, the F1 score of BERT-BiLSTM-CRF with uncertainty-based sampling and diversity-based sampling (UD_BBC) is higher than the BiLSTM-CRF based on maximum normalized log-probability (MNLP_BiLSTM-CRF) by 9.00%, when recognizing overall entity categories. It provides a solution to the problem of named entity recognition in educational public opinion.
引用
收藏
页数:19
相关论文
共 46 条
[1]  
[Anonymous], 2014, Journal of Computational Information Systems
[2]  
[Anonymous], 2013, Investigation of recurrent-neural-network architectures and learning methods for spoken language understanding
[3]   Deep learning-based appearance features extraction for automated carp species identification [J].
Banan, Ashkan ;
Nasiri, Amin ;
Taheri-Garavand, Amin .
AQUACULTURAL ENGINEERING, 2020, 89
[4]   Augmenting Open-Domain Event Detection with Synthetic Data from GPT-2 [J].
Ben Veyseh, Amir Pouran ;
Minh Van Nguyen ;
Min, Bonan ;
Thien Huu Nguyen .
MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2021: RESEARCH TRACK, PT III, 2021, 12977 :644-660
[5]   Using error decay prediction to overcome practical issues of deep active learning for named entity recognition [J].
Chang, Haw-Shiuan ;
Vembu, Shankar ;
Mohan, Sunil ;
Uppaal, Rheeya ;
McCallum, Andrew .
MACHINE LEARNING, 2020, 109 (9-10) :1749-1778
[6]   Forecast of rainfall distribution based on fixed sliding window long short-term memory [J].
Chen, Chengcheng ;
Zhang, Qian ;
Kashani, Mahsa H. ;
Jun, Changhyun ;
Bateni, Sayed M. ;
Band, Shahab S. ;
Dash, Sonam Sandeep ;
Chau, Kwok-Wing .
ENGINEERING APPLICATIONS OF COMPUTATIONAL FLUID MECHANICS, 2022, 16 (01) :248-261
[7]   A Hyperspectral Image Classification Method Using Multifeature Vectors and Optimized KELM [J].
Chen, Huayue ;
Miao, Fang ;
Chen, Yijia ;
Xiong, Yijun ;
Chen, Tao .
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2021, 14 :2781-2795
[8]   Named entity recognition from Chinese adverse drug event reports with lexical feature based BiLSTM-CRF and tri-training [J].
Chen, Yao ;
Zhou, Changjiang ;
Li, Tianxin ;
Wu, Hong ;
Zhao, Xia ;
Ye, Kai ;
Liao, Jun .
JOURNAL OF BIOMEDICAL INFORMATICS, 2019, 96
[9]   A study of active learning methods for named entity recognition in clinical text [J].
Chen, Yukun ;
Lasko, Thomas A. ;
Mei, Qiaozhu ;
Denny, Joshua C. ;
Xu, Hua .
JOURNAL OF BIOMEDICAL INFORMATICS, 2015, 58 :11-18
[10]   Combinatorial feature embedding based on CNN and LSTM for biomedical named entity recognition [J].
Cho, Minsoo ;
Ha, Jihwan ;
Park, Chihyun ;
Park, Sanghyun .
JOURNAL OF BIOMEDICAL INFORMATICS, 2020, 103