Multi-level context features extraction for named entity recognition

被引:5
作者
Chang, Jun [1 ]
Han, Xiaohong [1 ]
机构
[1] Taiyuan Univ Technol, 79 West St Yingze, Taiyuan 030024, Shanxi, Peoples R China
关键词
Bi-LSTM; Sentence-level feature; Document-level feature; Layer-by-layer Residual;
D O I
10.1016/j.csl.2022.101412
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Bidirectional long short-term memory (Bi-LSTM), as one of the effective networks for sequence labeling tasks, is widely used in named entity recognition (NER). However, the sequential nature of Bi-LSTM and the inability to recognize multiple sentences at the same time make it impossible to obtain overall information. In this paper, to make up for the shortcomings of Bi-LSTM in extracting global information, we propose a hierarchical context model embedded with sentence level and document-level feature extraction. In sentence-level feature extraction, we use the self attention mechanism to extract sentence-level representations considering the different contribution of each word to the sentence. For document-level feature extraction, 3D convolutional neural network (CNN), which not only can extract features within sentences, but also pays attention to the sequential relationship between sentences, is used to extract document-level representations. Furthermore, we investigate a layer-by-layer residual (LBL Residual) structure to optimize each Bi-LSTM block of our model, which can solve the degradation problem that appears as the number of model layers increases. Experiments show that our model achieves results competitive with the state-of-the-art records on the CONLL-2003 and Ontonotes5.0 English datasets respectively.
引用
收藏
页数:17
相关论文
共 50 条
[11]   A named entity recognition model based on ensemble learning [J].
Zhu, Xinghui ;
Zou, Zhuoyang ;
Qiao, Bo ;
Fang, Kui ;
Chen, Yiming .
JOURNAL OF COMPUTATIONAL METHODS IN SCIENCES AND ENGINEERING, 2021, 21 (02) :475-486
[12]   Radial Basis Function Attention for Named Entity Recognition [J].
Chen, Jiusheng ;
Xu, Xingkai ;
Zhang, Xiaoyu .
ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2023, 22 (01)
[13]   Chinese Medical Named Entity Recognition Based on Multi-word Segmentation and Multi-layer BILSTM [J].
Li, Dawei ;
Li, Jianqiang ;
Zhu, Zhichao ;
Mahmood, Tariq .
2022 IEEE 46TH ANNUAL COMPUTERS, SOFTWARE, AND APPLICATIONS CONFERENCE (COMPSAC 2022), 2022, :1414-1419
[14]   Chinese named entity recognition based on Transformer encoder [J].
Guo X.-R. ;
Luo P. ;
Wang W.-L. .
Jilin Daxue Xuebao (Gongxueban)/Journal of Jilin University (Engineering and Technology Edition), 2021, 51 (03) :989-995
[15]   Research on Named Entity Recognition Methods for Urban Underground Space Disasters Based on Text Information Extraction [J].
Li, Zhaowen ;
Zhang, Xuedong .
GEOSPATIAL WEEK 2023, VOL. 48-1, 2023, :547-552
[16]   A Text Classification Model via Multi-Level Semantic Features [J].
Mao, Keji ;
Xu, Jinyu ;
Yao, Xingda ;
Qiu, Jiefan ;
Chi, Kaikai ;
Dai, Guanglin .
SYMMETRY-BASEL, 2022, 14 (09)
[17]   Chinese Named Entity Recognition for Dairy Cow Diseases by Fusion of Multi-Semantic Features Using Self-Attention-Based Deep Learning [J].
Lou, Yongjun ;
Gao, Meng ;
Zhang, Shuo ;
Yang, Hongjun ;
Wang, Sicong ;
He, Yongqiang ;
Yang, Jing ;
Yang, Wenxia ;
Du, Haitao ;
Shen, Weizheng .
ANIMALS, 2025, 15 (06)
[18]   A Multi-Task BERT-BiLSTM-AM-CRF Strategy for Chinese Named Entity Recognition [J].
Xiaoyong Tang ;
Yong Huang ;
Meng Xia ;
Chengfeng Long .
Neural Processing Letters, 2023, 55 :1209-1229
[19]   A Multi-Task BERT-BiLSTM-AM-CRF Strategy for Chinese Named Entity Recognition [J].
Tang, Xiaoyong ;
Huang, Yong ;
Xia, Meng ;
Long, Chengfeng .
NEURAL PROCESSING LETTERS, 2023, 55 (02) :1209-1229
[20]   A Multi-domain Named Entity Recognition Method Based on Part-of-Speech Attention Mechanism [J].
Zhang, Shun ;
Sheng, Ying ;
Gao, Jiangfan ;
Chen, Jianhui ;
Huang, Jiajin ;
Lin, Shaofu .
COMPUTER SUPPORTED COOPERATIVE WORK AND SOCIAL COMPUTING, CHINESECSCW 2019, 2019, 1042 :631-644