A multi-head adjacent attention-based pyramid layered model for nested named entity recognition

被引：4

作者：

Cui, Shengmin ^{[1
]}

Joe, Inwhee ^{[1
]}

机构：

[1] Hanyang Univ, Dept Comp Sci, 222 Wangsimni Ro, Seoul 04763, South Korea

来源：

NEURAL COMPUTING & APPLICATIONS | 2023年 / 35卷 / 03期

关键词：

Nested named entity recognition; Named entity recognition; Attention; Pyramid; Natural language processing; EXTRACTION;

D O I：

10.1007/s00521-022-07747-8

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Named entity recognition (NER) is one of the widely studied natural language processing tasks in recent years. Conventional solutions treat the NER as a sequence labeling problem, but these approaches cannot handle nested NER. This is due to the fact that nested NER refers to the case where one entity contains another entity and it is not feasible to tag each token with a single tag. The pyramid model stacks L flat NER layers for prediction, which subtly enumerates all spans with length less than or equal to L. However, the original model introduces a block consisting of a convolutional layer and a bidirectional long short-term memory (Bi-LSTM) layer as the decoder, which does not consider the dependency between adjacent inputs and the Bi-LSTM cannot perform parallel computation on sequential inputs. For the purpose of improving performance and reducing the forward computation, we propose a Multi-Head Adjacent Attention-based Pyramid Layered model. In addition, when constructing a pyramid structure for span representation, the information of the intermediate words has more proportion than words on the two sides. To address this imbalance in the span representation, we fuse the output of the attention layer with the features of head and tail words when doing classification. We conducted experiments on nested NER datasets such as GENIA, SciERC, and ADE to validate the effectiveness of our proposed model.

引用

页码：2561 / 2574

页数：14

共 50 条

[21] An Attention-Based ID-CNNs-CRF Model for Named Entity Recognition on Clinical Electronic Medical Records
Gao, Ming
Xiao, Qifeng
Wu, Shaochun
Deng, Kun
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2019: WORKSHOP AND SPECIAL SESSIONS, 2019, 11731 : 231 - 242
[22] Deep Learning Based Mobilenet and Multi-Head Attention Model for Facial Expression Recognition
Nouisser, Aicha
Zouari, Ramzi
Kherallah, Monji
INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2023, 20 (3A) : 485 - 491
[23] Temporal Residual Network Based Multi-Head Attention Model for Arabic Handwriting Recognition
Zouari, Ramzi
Othmen, Dalila
Boubaker, Houcine
Kherallah, Monji
INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2023, 20 (3A) : 469 - 476
[24] Span-Prototype Graph Based on Graph Attention Network for Nested Named Entity Recognition
Mu, Jichong
Ouyang, Jihong
Yao, Yachen
Ren, Zongxiao
ELECTRONICS, 2023, 12 (23)
[25] BidH: A Bidirectional Hierarchical Model for Nested Named Entity Recognition
Xu, Wanyang
Li, Wengen
Guan, Jihong
Zhou, Shuigeng
PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2022, 2022, : 4600 - 4604
[26] Multi-Task Multi-Attention Transformer for Generative Named Entity Recognition
Mo, Ying
Liu, Jiahao
Tang, Hongyin
Wang, Qifan
Xu, Zenglin
Wang, Jingang
Quan, Xiaojun
Wu, Wei
Li, Zhoujun
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 : 4171 - 4183
[27] Nested Named Entity Recognition Based on Span Boundary Perception
Cai, Yu-Xiang
Luo, Da
Gan, Yang-Lei
Hou, Rui
Liu, Xue-Yi
Liu, Qiao
Shi, Xiao-Jun
Ruan Jian Xue Bao/Journal of Software, 2024, 35 (11): : 5149 - 5162
[28] Document-level attention-based BiLSTM-CRF incorporating disease dictionary for disease named entity recognition
Xu, Kai
Yang, Zhenguo
Kang, Peipei
Wang, Qi
Liu, Wenyin
COMPUTERS IN BIOLOGY AND MEDICINE, 2019, 108 : 122 - 132
[29] HTMapper: Bidirectional Head-Tail Mapping for Nested Named Entity Recognition
Zhao, Jin
Li, Zhixu
Xiao, Yanghua
Liang, Jiaqing
Liu, Jingping
PROCEEDINGS OF THE 32ND ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2023, 2023, : 3433 - 3443
[30] Named Entity Recognition From Biomedical Texts Using a Fusion Attention-Based BiLSTM-CRF
Wei, Hao
Gao, Mingyuan
Zhou, Ai
Chen, Fei
Qu, Wen
Wang, Chunli
Lu, Mingyu
IEEE ACCESS, 2019, 7 : 73627 - 73636

← 1 2 3 4 5 →