Kiwifruit Planting Entity Recognition Based on Character and Word Information Fusion

被引:0
作者
Li S. [1 ]
Zhang M. [1 ]
Liu B. [1 ]
机构
[1] College of Information Engineering, Northwest A&F University, Shaanxi, Yangling
来源
Nongye Jixie Xuebao/Transactions of the Chinese Society for Agricultural Machinery | 2022年 / 53卷 / 12期
关键词
kiwifruit planting; named entity recognition; pre-trained language model; self-attention mechanisim; semantic enhancement; word fusion;
D O I
10.6041/j.issn.1000-1298.2022.12.032
中图分类号
学科分类号
摘要
Aiming at the problem of high complexity of real words and low recognition accuracy in the named entity recognition task of kiwifmit planting field, a entity recognition method of kiwifruit planting integrating character and word information was proposed. Based on BiGRU - CRF model, word level and character level information were fused. At the word level, by introducing word set information and using multiple self-attention mechanisms ( MHA) to adjust the weights of different words in the word set. At the same time, attention mechanism was used to ignore the unreliable word sets and focus on the important word sets to improve the entity recognition effect. At the character level, the unsupervised bidirectional encoder representations form transformers (BERT) pre-training model was introduced to enhance the semantic representation of words. Experiments were conducted on a homemade corpus in the kiwifruit cultivation domain containing 12 477 annotated samples and seven categories of entities, and the results showed that the Fl value of the model was improved by 1.58 percentage points compared with the SoftLexicon model. In addition, the experimental comparison of the model ResumeNER with Lattice - LSTM, WC - LSTM and other models in the open data set ResumeNER was carried out, and the best recognition effect was achieved. The Fl value reached 96. 17%, indicating that the method proposed had certain generalization ability. © 2022 Chinese Society of Agricultural Machinery. All rights reserved.
引用
收藏
页码:323 / 331
页数:8
相关论文
共 24 条
[1]  
JIANG Zhengwang, ZHONG Caihong, On the importance of popularizing kiwifruit and improving fruit quality, China Fruits, 1, pp. 1-8, (2020)
[2]  
GIZEM A, DIDEM M, SENIZ D, Et al., An evaluation of recent neural sequence tagging models in Turkish named entity recognition [J]
[3]  
Xiang LI, WEI Xiaohong, JIA Lu, Et al., Recognition of crops, diseases and pesticides named entities in Chinese based on conditional random fields[J], Transactions of the Chinese Society for Agricultural Machinery, 48, pp. 178-185, (2017)
[4]  
ZHANG Jian, WU Qing, YANG Xinyi, Et al., Chinese agricultural named entity recognition based on conditional random fields [J], Computers and Modernization, 1, pp. 123-126, (2018)
[5]  
ZHANG Dong, CHEN Wenliang, Chinese named entity recognition based on context-dependent word vector, Computer Science, 48, 3, pp. 233-238, (2021)
[6]  
SONG Linpeng, LIU Shihong, WANG Gui, Text entity extraction of agricultural technical requirements based on word vector + BiLSTM +CRF[J], Jiangsu Agricultural Sciences, 49, 5, pp. 186-193, (2021)
[7]  
XIE Congjiao, GAO Jing, CHEN Junjie, Named entity recognition for the field of crop pests and diseases [J], Journal of Inner Mongolia Agricultural University(Natural Science Edition), 43, 1, pp. 86-90, (2022)
[8]  
MENG Y, LI X, SUN X, Et al., Is word segmentation necessary for deep learning of Chinese representations [J], (2019)
[9]  
SHEN Liyan, JIANG Haiyan, HU Bin, Et al., A study on joint entity recognition and relation extraction for rice diseases pests weeds and drugs [J], Journal of Nanjing Agricultural University, 43, 6, pp. 1151-1161, (2020)
[10]  
GUO Xuchao, TANG Zhan, DIAO Lei, Et al., Named entity recognition of pests and diseases based on radical embedding and attention mechanism [J], Transactions of the Chinese Society for Agricultural Machinery, 51, pp. 335-343, (2020)