Research on Named Entity Recognition Methods in Chinese Forest Disease Texts

被引:2
作者
Wang, Qi [1 ]
Su, Xiyou [1 ]
机构
[1] Beijing Forestry Univ, Sch Informat Sci & Technol, Beijing 100083, Peoples R China
来源
APPLIED SCIENCES-BASEL | 2022年 / 12卷 / 08期
基金
中国国家自然科学基金;
关键词
disease; named entity recognition; multi-feature; transformer; bi-gated recurrent unit; CRF;
D O I
10.3390/app12083885
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Named entity recognition of forest diseases plays a key role in knowledge extraction in the field of forestry. The aim of this paper is to propose a named entity recognition method based on multi-feature embedding, a transformer encoder, a bi-gated recurrent unit (BiGRU), and conditional random fields (CRF). According to the characteristics of the forest disease corpus, several features are introduced here to improve the method's accuracy. In this paper, we analyze the characteristics of forest disease texts; carry out pre-processing, labeling, and extraction of multiple features; and construct forest disease texts. In the input representation layer, the method integrates multi-features, such as characters, radicals, word boundaries, and parts of speech. Then, implicit features (e.g., sentence context features) are captured through the transformer's encoding layer. The obtained features are transmitted to the BiGRU layer for further deep feature extraction. Finally, the CRF model is used to learn constraints and output the optimal annotation of disease names, damage sites, and drug entities in the forest disease texts. The experimental results on the self-built data set of forest disease texts show that the precision of the proposed method for entity recognition reached more than 93%, indicating that it can effectively solve the task of named entity recognition in forest disease texts.
引用
收藏
页数:14
相关论文
共 50 条
  • [31] IMPROVING CHINESE NAMED ENTITY RECOGNITION WITH LEXICAL INFORMATION
    Fu, Guo-Hong
    PROCEEDINGS OF 2009 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-6, 2009, : 3487 - 3491
  • [32] Multi-Feature Fusion Transformer for Chinese Named Entity Recognition
    Han, Xiaokai
    Yue, Qi
    Chu, Jing
    Han, Zhan
    Shi, Yifan
    Wang, Chengfeng
    2022 41ST CHINESE CONTROL CONFERENCE (CCC), 2022, : 4227 - 4232
  • [33] Named Entity Recognition in Chinese Electronic Medical Records Based on CRF
    Liu, Kaixin
    Hu, Qingcheng
    Liu, Jianwei
    Xing, Chunxiao
    2017 14TH WEB INFORMATION SYSTEMS AND APPLICATIONS CONFERENCE (WISA 2017), 2017, : 105 - 110
  • [34] Enhanced character embedding for Chinese named entity recognition
    Jia, Bingjing
    Wu, Zhongli
    Wu, Bin
    Liu, Yutong
    Zhou, Pengpeng
    MEASUREMENT & CONTROL, 2020, 53 (9-10) : 1669 - 1681
  • [35] Data Augmentation for Chinese Clinical Named Entity Recognition
    Wang P.-H.
    Li M.-Z.
    Li S.
    Li, Si (lisi@bupt.edu.cn), 1600, Beijing University of Posts and Telecommunications (43): : 84 - 90
  • [36] Exploiting Multiple Embeddings for Chinese Named Entity Recognition
    Xu, Canwen
    Wang, Feiyang
    Han, Jialong
    Li, Chenliang
    PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM '19), 2019, : 2269 - 2272
  • [37] Chinese named entity recognition based on Transformer encoder
    Guo X.-R.
    Luo P.
    Wang W.-L.
    Jilin Daxue Xuebao (Gongxueban)/Journal of Jilin University (Engineering and Technology Edition), 2021, 51 (03): : 989 - 995
  • [38] Deep adaptation of CNN in Chinese named entity recognition
    Lv, Yana
    Qin, Xutong
    Du, Xiuli
    Qiu, Shaoming
    ENGINEERING REPORTS, 2023, 5 (06)
  • [39] Chinese Named Entity Recognition with Inducted Context Patterns
    Pang, Wenbo
    Fan, Xiaozhong
    2009 THIRD INTERNATIONAL SYMPOSIUM ON INTELLIGENT INFORMATION TECHNOLOGY APPLICATION, VOL 3, PROCEEDINGS, 2009, : 608 - 611
  • [40] Integrated Chinese Segmentation, Parsing and Named Entity Recognition
    LI Dongchen
    ZHANG Xiantao
    WU Xihong
    ChineseJournalofElectronics, 2018, 27 (04) : 756 - 760