Recognition of Animal Drug Pathogenicity Named Entity Based on Att-Aux-BERT-BiLSTM-CRF

被引:0
作者
Yang L. [1 ]
Zhang T. [1 ]
Zheng L. [1 ,2 ]
Tian L. [1 ]
机构
[1] College of Information and Electrical Engineering, China Agricultural University, Beijing
[2] Beijing Laboratory of Food Quality and Safety, Beijing
来源
Nongye Jixie Xuebao/Transactions of the Chinese Society for Agricultural Machinery | 2022年 / 53卷 / 03期
关键词
Attention mechanism; BERT; Deep learning; Named entity recognition; Veterinary drug pathogenicity;
D O I
10.6041/j.issn.1000-1298.2022.03.031
中图分类号
学科分类号
摘要
In order to solve the problems that traditional methods of veterinary drug named entity recognition rely on artificial design features, which is time-consuming and labor-consuming, and the amount of veterinary drug pathogenic corpus data is less in the process of building veterinary drug pathogenic knowledge graph, a method based on Att-Aux-BERT-BiLSTM-CRF of veterinary drug text named entity recognition model was proposed, which combined BERT-BiLSTM-CRF models by introducing attention mechanism and auxiliary classification layer.The text was vectorized by the BERT preprocessing model, and then connected to bi-directional long-short term memory network.The auxiliary classification mechanism was introduced, the output of the BERT layer was used as the auxiliary classification layer, and the output of the BiLSTM layer was used as the main classification layer. The attention mechanism was proposed to combine auxiliary classification layer with main classification layer to improve the overall performance.Finally, it was sent to conditional random field to construct an end-to-end deep learning model framework suitable for veterinary drug name entity recognition.In the experiment, totally 10 643 sentences and 485 711 characters of veterinary drug text were selected to identify four kinds of entities: drug, adverse effect, intake mode, aimal. The results showed that the model can effectively identify the entities in the veterinary drug pathogenic text, and the F1 value of recognition was 96.7%. © 2022, Chinese Society of Agricultural Machinery. All right reserved.
引用
收藏
页码:294 / 300
页数:6
相关论文
共 28 条
[1]  
LUST E B, BARTHOLD C, MALESKER M A, Et al., Human health hazards of veterinary medications: information for emergency departments, The Journal of Emergency Medicine, 40, 2, pp. 198-207, (2011)
[2]  
CAPLETON A, COURAGE C, RUMSBY P, Et al., Prioritising veterinary medicines according to their potential indirect human exposure and toxicity profile, Toxicology Letters, 163, 3, pp. 213-223, (2006)
[3]  
ZHENG Limin, QI Shanshan, TIAN Lijun, Et al., Entity relation extraction of news texts for food safety events, Transactions of the Chinese Society for Agricultural Machinery, 51, 7, pp. 244-253, (2020)
[4]  
YU Hongkui, ZHANG Huaping, LIU Qun, Et al., Chinese named entity identification using cascaded hidden Markov model, Journal of Communication, 27, 2, pp. 87-94, (2006)
[5]  
LI Xiang, WEI Xiaohong, JIA Lu, Et al., Recognition of crops diseases and pesticides named entities in Chinese based on conditional random fields, Transactions of the Chinese Society for Agricultural Machinery, 48, pp. 178-185, (2017)
[6]  
HABIBI M, WEBER L, NEVES M, Et al., Deep learning with word embeddings improves biomedical named entity recognition, Bioinformatics, 33, 14, pp. 37-48, (2017)
[7]  
SUN Juanjuan, YU Hong, FENG Yanhong, Et al., Recognition of nominated fishery domainentity based on deep learning architectures, Journal of Dalian Fisheries University, 33, 2, pp. 265-269, (2018)
[8]  
GUO Xuchao, TANG Zhan, DIAO Lei, Et al., Recognition of Chinese agricultural diseases and pests named entity with joint radical-embedding and self-attention mechanism, Transactions of the Chinese Society for Agricultural Machinery, 51, pp. 335-343, (2020)
[9]  
YANG Jinfeng, GUAN Yi, HE Bin, Et al., Corpus construction for named entities and entity relations on Chinese electronic medicalrecords, Journal of Software, 27, 11, pp. 2725-2746, (2016)
[10]  
LI Lishuang, GUO Yuankai, Biomedical named entity recognition with CNN-BLSTM-CRF, Journal of Chinese Information Processing, 32, 1, pp. 116-122, (2018)