DABC: A Named Entity Recognition Method Incorporating Attention Mechanisms

被引:1
作者
Leng, Fangling [1 ]
Li, Fan [1 ]
Bao, Yubin [1 ]
Zhang, Tiancheng [1 ]
Yu, Ge [1 ]
机构
[1] Northeastern Univ, Sch Comp Sci & Engn, Shenyang 110169, Peoples R China
基金
中国国家自然科学基金;
关键词
DeBERTa; multi-attention mechanism; BiLSTM-CRF; named entity recognition;
D O I
10.3390/math12131992
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Regarding the existing models for feature extraction of complex similar entities, there are problems in the utilization of relative position information and the ability of key feature extraction. The distinctiveness of Chinese named entity recognition compared to English lies in the absence of space delimiters, significant polysemy and homonymy of characters, diverse and common names, and a greater reliance on complex contextual and linguistic structures. An entity recognition method based on DeBERTa-Attention-BiLSTM-CRF (DABC) is proposed. Firstly, the feature extraction capability of the DeBERTa model is utilized to extract the data features; then, the attention mechanism is introduced to further enhance the extracted features; finally, BiLSTM is utilized to further capture the long-distance dependencies in the text and obtain the predicted sequences through the CRF layer, and then the entities in the text are identified. The proposed model is applied to the dataset for validation. The experiments show that the precision (P) of the proposed DABC model on the dataset reaches 88.167%, the recall (R) reaches 83.121%, and the F1 value reaches 85.024%. Compared with other models, the F1 value improves by 3 similar to 5%, and the superiority of the model is verified. In the future, it can be extended and applied to recognize complex entities in more fields.
引用
收藏
页数:15
相关论文
共 40 条
  • [1] Application of BiLSTM-CRF model with different embeddings for product name extraction in unstructured Turkish text
    Arslan, Serdar
    [J]. NEURAL COMPUTING & APPLICATIONS, 2024, 36 (15) : 8371 - 8382
  • [2] An Efficient Optimized DenseNet Model for Aspect-Based Multi-Label Classification
    Ayub, Nasir
    Tayyaba
    Hussain, Saddam
    Ullah, Syed Sajid
    Iqbal, Jawaid
    [J]. ALGORITHMS, 2023, 16 (12)
  • [3] Bahdanau D, 2016, Arxiv, DOI arXiv:1409.0473
  • [4] ATBBC: Named entity recognition in emergency domains based on joint BERT-BILSTM-CRF adversarial training
    Cai, Buqing
    Tian, Shengwei
    Yu, Long
    Long, Jun
    Zhou, Tiejun
    Wang, Bo
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2024, 46 (02) : 4063 - 4076
  • [5] Nested Named Entity Recognition Based on Span Boundary Perception
    Cai, Yu-Xiang
    Luo, Da
    Gan, Yang-Lei
    Hou, Rui
    Liu, Xue-Yi
    Liu, Qiao
    Shi, Xiao-Jun
    [J]. Ruan Jian Xue Bao/Journal of Software, 2024, 35 (11): : 5149 - 5162
  • [6] Online biomedical named entities recognition by data and knowledge-driven model
    Cao, Lulu
    Wu, Chaochen
    Luo, Guan
    Guo, Chao
    Zheng, Anni
    [J]. ARTIFICIAL INTELLIGENCE IN MEDICINE, 2024, 150
  • [7] Chen X., 2024, Acad. J. Comput. Inf. Sci, V7, P78
  • [8] Named entity recognition in aerospace based on multi-feature fusion transformer
    Chu, Jing
    Liu, Yumeng
    Yue, Qi
    Zheng, Zixuan
    Han, Xiaokai
    [J]. SCIENTIFIC REPORTS, 2024, 14 (01)
  • [9] Language inference-based learning for Low-Resource Chinese clinical named entity recognition using language model
    Cui, Zhaojian
    Yu, Kai
    Yuan, Zhenming
    Dong, Xiaofeng
    Luo, Weibin
    [J]. JOURNAL OF BIOMEDICAL INFORMATICS, 2024, 149
  • [10] Optimizing fake news detection for Arabic context: A multitask learning approach with transformers and an enhanced Nutcracker Optimization Algorithm
    Dahou, Abdelghani
    Ewees, Ahmed A.
    Hashim, Fatma A.
    Al-qaness, Mohammed A. A.
    Orabi, Dina Ahmed
    Soliman, Eman M.
    Tag-eldin, Elsayed M.
    Aseeri, Ahmad O.
    Abd Elaziz, Mohamed
    [J]. KNOWLEDGE-BASED SYSTEMS, 2023, 280