Chinese named entity recognition for apple diseases and pests based on character augmentation

被引:21
作者
Zhang, Jiayu [1 ]
Guo, Mei [1 ]
Geng, Yaojun [1 ]
Li, Mei [1 ]
Zhang, Yongliang [1 ]
Geng, Nan [1 ,2 ,3 ]
机构
[1] Northwest A&F Univ, Coll Informat Engn, Yangling 712100, Shaanxi, Peoples R China
[2] Northwest A&F Univ, Key Lab Agr Internet Things, Minist Agr & Rural Affairs, Yangling 712100, Shaanxi, Peoples R China
[3] Northwest A&F Univ, Shaanxi Key Lab Agr Informat Percept & Intelligen, Yangling 712100, Shaanxi, Peoples R China
关键词
Chinese named entity recognition; Apple diseases and pests; Dictionaries and similar; Neural network; Self-made corpus;
D O I
10.1016/j.compag.2021.106464
中图分类号
S [农业科学];
学科分类号
09 ;
摘要
Aiming at the problems of Chinese named entity recognition in the field of apple diseases and pests, including various entities categories, entities with aliases or abbreviations, and the difficulty of identifying rare entities, we propose a novel Chinese named entity recognition model APD-CA based on character augmentation. Specifically, we incorporate dictionaries and similar words into the character-based BiLSTM-CRF model to augment character representation. To verify the validity of the model, experiments were performed on ApdCNER, a manually derived Chinese apple disease and pest corpus containing 21 entity categories. The experimental results show that the precision, recall, and F1-score of the APD-CA model based on ApdCNER are 92.29%, 91.99%, and 92.14%, respectively, which are improved compared with those for the baseline model and four other state-ofthe-art models. The improvement verifies that the proposed model in this paper has performance advantages in named entity recognition in the field of apple diseases and pests. Other experimental results also prove that this model has efficiency advantages and certain generalization advantages.
引用
收藏
页数:11
相关论文
共 32 条
[1]  
Aguilar G., 2017, P WNUT, P148
[2]  
Andrew M., 2000, P 17 INT C MACH LEAR, P591
[3]  
[Anonymous], 2015, ACS SYM SER
[4]   Ontology-based data acquisition model development for agricultural open data platforms and implementation of OWL2MVC tool [J].
Aydin, Sahin ;
Aydin, Mehmet N. .
COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2020, 175
[5]  
Chen A., 2006, P 5 SIGHAN WORKSH CH, P173
[6]  
Cho Kyunghyun, 2014, P 2014 C EMP METH NA, P1724
[7]   Towards reliable named entity recognition in the biomedical domain [J].
Giorgi, John M. ;
Bader, Gary D. .
BIOINFORMATICS, 2020, 36 (01) :280-286
[8]  
Gui T, 2019, PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P4982
[9]   Chinese agricultural diseases and pests named entity recognition with multi-scale local context features and self-attention mechanism [J].
Guo, Xuchao ;
Zhou, Han ;
Su, Jie ;
Hao, Xia ;
Tang, Zhan ;
Diao, Lei ;
Li, Lin .
COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2020, 179
[10]   Deep learning with word embeddings improves biomedical named entity recognition [J].
Habibi, Maryam ;
Weber, Leon ;
Neves, Mariana ;
Wiegandt, David Luis ;
Leser, Ulf .
BIOINFORMATICS, 2017, 33 (14) :I37-I48