Lexicon and attention-based named entity recognition for kiwifruit diseases and pests: A Deep learning approach

被引:8
|
作者
Zhang, Lilin [1 ]
Nie, Xiaolin [1 ]
Zhang, Mingmei [1 ]
Gu, Mingyang [1 ]
Geissen, Violette [2 ]
Ritsema, Coen J. [2 ]
Niu, Dangdang [1 ]
Zhang, Hongming [1 ]
机构
[1] Northwest Agr & Forestry Univ, Coll Informat Engn, Yangling, Shaanxi, Peoples R China
[2] Wageningen Univ, Soil Phys & Land Management Grp, Wageningen, Netherlands
来源
FRONTIERS IN PLANT SCIENCE | 2022年 / 13卷
关键词
intelligent farming for diseases recognition; Chinese named entity recognition; kiwifruit diseases and pests; data mining; lexicon; Criss-cross attention; deep learning; machine learning; ALGORITHM; ALGEBRA;
D O I
10.3389/fpls.2022.1053449
中图分类号
Q94 [植物学];
学科分类号
071001 ;
摘要
Named Entity Recognition (NER) is a crucial step in mining information from massive agricultural texts, which is required in the construction of many knowledge-based agricultural support systems, such as agricultural technology question answering systems. The vital domain characteristics of Chinese agricultural text cause the Chinese NER (CNER) in kiwifruit diseases and pests to suffer from the insensitivity of common word segmentation tools to kiwifruit-related texts and the feature extraction capability of the sequence encoding layer being challenged. In order to alleviate the above problems, effectively mine information from kiwifruit-related texts to provide support for agricultural support systems such as agricultural question answering systems, this study constructed a novel Chinese agricultural NER (CANER) model KIWINER by statistics-based new word detection and two novel modules, AttSoftlexicon (Criss-cross attention-based Softlexicon) and PCAT (Parallel connection criss-cross attention), proposed in this paper. Specifically, new words were detected to improve the adaptability of word segmentation tools to kiwifruit-related texts, thereby constructing a kiwifruit lexicon. The AttSoftlexicon integrates word information into the model and makes full use of the word information with the help of Criss-cross attention network (CCNet). And the PCAT improves the feature extraction ability of sequence encoding layer through CCNet and parallel connection structure. The performance of KIWINER was evaluated on four datasets, namely KIWID (Self-annotated), Boson, ClueNER, and People's Daily, which achieved optimal F-1-scores of 88.94%, 85.13%, 80.52%, and 92.82%, respectively. Experimental results in many aspects illustrated that methods proposed in this paper can effectively improve the recognition effect of kiwifruit diseases and pests named entities, especially for diseases and pests with strong domain characteristics
引用
收藏
页数:16
相关论文
共 50 条
  • [1] An Attention-Based Approach for Mongolian News Named Entity Recognition
    Tan, Mingyan
    Bao, Feilong
    Gao, Guanglai
    Wang, Weihua
    CHINESE COMPUTATIONAL LINGUISTICS, CCL 2019, 2019, 11856 : 424 - 435
  • [2] An Attention-Based Approach for Chemical Compound and Drug Named Entity Recognition
    Yang P.
    Yang Z.
    Luo L.
    Lin H.
    Wang J.
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2018, 55 (07): : 1548 - 1556
  • [3] An attention-based deep learning model for clinical named entity recognition of Chinese electronic medical records
    Li, Luqi
    Zhao, Jie
    Hou, Li
    Zhai, Yunkai
    Shi, Jinming
    Cui, Fangfang
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2019, 19 (01)
  • [4] An attention-based deep learning model for clinical named entity recognition of Chinese electronic medical records
    Luqi Li
    Jie Zhao
    Li Hou
    Yunkai Zhai
    Jinming Shi
    Fangfang Cui
    BMC Medical Informatics and Decision Making, 19
  • [5] Integrated Deep Learning with Attention Layer Based Approach for Precise Biomedical Named Entity Recognition
    Pooja, H.
    Jagadeesh, Prabhudev M. P.
    JOURNAL OF ADVANCES IN INFORMATION TECHNOLOGY, 2024, 15 (06) : 704 - 713
  • [6] Named Entity Recognition for Crop Diseases and Pests Based on Gated Fusion Unit and Manhattan Attention
    Tang, Wentao
    Wen, Xianhuan
    Hu, Zelin
    AGRICULTURE-BASEL, 2024, 14 (09):
  • [7] Named entity recognition based on deep learning
    Ji Z.
    Kong D.
    Liu W.
    Dong W.
    Sang Y.
    Jisuanji Jicheng Zhizao Xitong/Computer Integrated Manufacturing Systems, CIMS, 2022, 28 (06): : 1603 - 1615
  • [8] Named entity recognition (NER) for Chinese agricultural diseases and pests based on discourse topic and attention mechanism
    Wang, Chao
    Gao, Jiale
    Rao, Haidi
    Chen, Aiwen
    He, Jin
    Jiao, Jun
    Zou, Nengfeng
    Gu, Lichuan
    EVOLUTIONARY INTELLIGENCE, 2024, 17 (01) : 457 - 466
  • [9] Named entity recognition (NER) for Chinese agricultural diseases and pests based on discourse topic and attention mechanism
    Chao Wang
    Jiale Gao
    Haidi Rao
    Aiwen Chen
    Jin He
    Jun Jiao
    Nengfeng Zou
    Lichuan Gu
    Evolutionary Intelligence, 2024, 17 : 457 - 466
  • [10] Deep Learning Approach for Arabic Named Entity Recognition
    Gridach, Mourad
    COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, (CICLING 2016), PT I, 2018, 9623 : 439 - 451