A multi-head adjacent attention-based pyramid layered model for nested named entity recognition

被引:4
作者
Cui, Shengmin [1 ]
Joe, Inwhee [1 ]
机构
[1] Hanyang Univ, Dept Comp Sci, 222 Wangsimni Ro, Seoul 04763, South Korea
关键词
Nested named entity recognition; Named entity recognition; Attention; Pyramid; Natural language processing; EXTRACTION;
D O I
10.1007/s00521-022-07747-8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Named entity recognition (NER) is one of the widely studied natural language processing tasks in recent years. Conventional solutions treat the NER as a sequence labeling problem, but these approaches cannot handle nested NER. This is due to the fact that nested NER refers to the case where one entity contains another entity and it is not feasible to tag each token with a single tag. The pyramid model stacks L flat NER layers for prediction, which subtly enumerates all spans with length less than or equal to L. However, the original model introduces a block consisting of a convolutional layer and a bidirectional long short-term memory (Bi-LSTM) layer as the decoder, which does not consider the dependency between adjacent inputs and the Bi-LSTM cannot perform parallel computation on sequential inputs. For the purpose of improving performance and reducing the forward computation, we propose a Multi-Head Adjacent Attention-based Pyramid Layered model. In addition, when constructing a pyramid structure for span representation, the information of the intermediate words has more proportion than words on the two sides. To address this imbalance in the span representation, we fuse the output of the attention layer with the features of head and tail words when doing classification. We conducted experiments on nested NER datasets such as GENIA, SciERC, and ADE to validate the effectiveness of our proposed model.
引用
收藏
页码:2561 / 2574
页数:14
相关论文
共 50 条
  • [21] An Attention-Based ID-CNNs-CRF Model for Named Entity Recognition on Clinical Electronic Medical Records
    Gao, Ming
    Xiao, Qifeng
    Wu, Shaochun
    Deng, Kun
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2019: WORKSHOP AND SPECIAL SESSIONS, 2019, 11731 : 231 - 242
  • [22] Deep Learning Based Mobilenet and Multi-Head Attention Model for Facial Expression Recognition
    Nouisser, Aicha
    Zouari, Ramzi
    Kherallah, Monji
    INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2023, 20 (3A) : 485 - 491
  • [23] Temporal Residual Network Based Multi-Head Attention Model for Arabic Handwriting Recognition
    Zouari, Ramzi
    Othmen, Dalila
    Boubaker, Houcine
    Kherallah, Monji
    INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2023, 20 (3A) : 469 - 476
  • [24] Span-Prototype Graph Based on Graph Attention Network for Nested Named Entity Recognition
    Mu, Jichong
    Ouyang, Jihong
    Yao, Yachen
    Ren, Zongxiao
    ELECTRONICS, 2023, 12 (23)
  • [25] BidH: A Bidirectional Hierarchical Model for Nested Named Entity Recognition
    Xu, Wanyang
    Li, Wengen
    Guan, Jihong
    Zhou, Shuigeng
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2022, 2022, : 4600 - 4604
  • [26] Multi-Task Multi-Attention Transformer for Generative Named Entity Recognition
    Mo, Ying
    Liu, Jiahao
    Tang, Hongyin
    Wang, Qifan
    Xu, Zenglin
    Wang, Jingang
    Quan, Xiaojun
    Wu, Wei
    Li, Zhoujun
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 : 4171 - 4183
  • [27] Nested Named Entity Recognition Based on Span Boundary Perception
    Cai, Yu-Xiang
    Luo, Da
    Gan, Yang-Lei
    Hou, Rui
    Liu, Xue-Yi
    Liu, Qiao
    Shi, Xiao-Jun
    Ruan Jian Xue Bao/Journal of Software, 2024, 35 (11): : 5149 - 5162
  • [28] Document-level attention-based BiLSTM-CRF incorporating disease dictionary for disease named entity recognition
    Xu, Kai
    Yang, Zhenguo
    Kang, Peipei
    Wang, Qi
    Liu, Wenyin
    COMPUTERS IN BIOLOGY AND MEDICINE, 2019, 108 : 122 - 132
  • [29] HTMapper: Bidirectional Head-Tail Mapping for Nested Named Entity Recognition
    Zhao, Jin
    Li, Zhixu
    Xiao, Yanghua
    Liang, Jiaqing
    Liu, Jingping
    PROCEEDINGS OF THE 32ND ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2023, 2023, : 3433 - 3443
  • [30] Named Entity Recognition From Biomedical Texts Using a Fusion Attention-Based BiLSTM-CRF
    Wei, Hao
    Gao, Mingyuan
    Zhou, Ai
    Chen, Fei
    Qu, Wen
    Wang, Chunli
    Lu, Mingyu
    IEEE ACCESS, 2019, 7 : 73627 - 73636