Lexicon enhanced Chinese named entity recognition with pointer network

被引:0
|
作者
Qian Guo
Yi Guo
机构
[1] East China University of Science and Technology,Department of Computer Science and Engineering
[2] National Engineering Laboratory for Big Data Distribution and Exchange Technologies,Business Intelligence and Visualization Research Center
[3] Shanghai Engineering Research Center of Big Data & Internet Audience,undefined
来源
Neural Computing and Applications | 2022年 / 34卷
关键词
Chinese named entity recognition; Lexicon enhancement; Pointer network;
D O I
暂无
中图分类号
学科分类号
摘要
In recent time, lexicon-based LSTM and pre-training language models are combined to explore the Chinese Named Entity Recognition (NER) and achieve the current state-of-the-art (SOTA) performance on several Chinese benchmark datasets. However, existing lexicon-based models only conform lexicon features through shallow and randomly initialized coding layers and do not integrate them into the bottom layer of the pre-training language model to mine the deep lexicon knowledge. To address the above issue, we propose a novel BERT-based Enhanced Lexicon Adapter (BLA) model that fuses external lexicon feature into the pre-training language model BERT in-depth. Specifically, the external lexicon knowledge is integrated into the deep Transformer layers of BERT by the lexicon adapter mechanism. With the comparison of existing methods, our model achieves the genuine deep fusion of the lexicon knowledge and BERT representation, effectively obtaining entity boundaries and word information.Besides, given the value of high-level global semantic features in alleviating word ambiguity and segmenting precisely the entity boundary in Chinese NER, transforming the sequence labeling task into sequence generation task provides the new cogitation for extracting global semantic features. Therefore, we explore the strategies of local lexicon information’s fusion and global semantic features extraction for entity category labeling. Specifically, we utilize the sequence-to-sequence (Seq2Seq) framework with pointer network as the prominent model architecture, in which the pointing function implements a custom attention mechanism and models different interactions between the source text and the semantic embedding by the generated probability ppoint\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$p_{point}$$\end{document}. Furthermore, the decoder with the pointer mechanism generates the target sequence autoregressively. Experiments on several different benchmark Chinese datasets indicate that the proposed model achieves remarkable improvement compared with the current lexicon-based methods, and the results significantly outperform the current SOTA models.
引用
收藏
页码:14535 / 14555
页数:20
相关论文
共 50 条
  • [1] Lexicon enhanced Chinese named entity recognition with pointer network
    Guo, Qian
    Guo, Yi
    NEURAL COMPUTING & APPLICATIONS, 2022, 34 (17): : 14535 - 14555
  • [2] Chinese Named Entity Recognition Augmented with Lexicon Memory
    Zhou, Yi
    Zheng, Xiao-Qing
    Huang, Xuan-Jing
    JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2023, 38 (05) : 1021 - 1035
  • [3] Chinese Named Entity Recognition Augmented with Lexicon Memory
    Yi Zhou
    Xiao-Qing Zheng
    Xuan-Jing Huang
    Journal of Computer Science and Technology, 2023, 38 : 1021 - 1035
  • [4] Named Entity Recognition in Classical Chinese by Lexicon Enhancement
    Yu, Jianye
    Feng, Xiangyilan
    Li, Jie
    Liu, Jialin
    2023 IEEE INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE AND INTELLIGENT AGENT TECHNOLOGY, WI-IAT, 2023, : 463 - 468
  • [5] Hierarchical Lexicon Embedding Architecture for Chinese Named Entity Recognition
    Hu, Jiahao
    Ouyang, Yuanxin
    Li, Chen
    Wang, Chuanrui
    Rong, Wenge
    Xiong, Zhang
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2021, PT V, 2021, 12895 : 345 - 356
  • [6] Named entity recognition for Chinese based on global pointer and adversarial training
    Li, Hongjun
    Cheng, Mingzhe
    Yang, Zelin
    Yang, Liqun
    Chua, Yansong
    SCIENTIFIC REPORTS, 2023, 13 (01)
  • [7] Named entity recognition for Chinese based on global pointer and adversarial training
    Hongjun Li
    Mingzhe Cheng
    Zelin Yang
    Liqun Yang
    Yansong Chua
    Scientific Reports, 13
  • [8] Enhanced Chinese Domain Named Entity Recognition: An Approach with Lexicon Boundary and Frequency Weight Features
    Guo, Yan
    Feng, Shixiang
    Liu, Fujiang
    Lin, Weihua
    Liu, Hongchen
    Wang, Xianbin
    Su, Junshun
    Gao, Qiankai
    APPLIED SCIENCES-BASEL, 2024, 14 (01):
  • [9] Enhanced Chinese named entity recognition with multi-granularity BERT adapter and efficient global pointer
    Zhang, Lei
    Xia, Pengfei
    Ma, Xiaoxuan
    Yang, Chengwei
    Ding, Xin
    COMPLEX & INTELLIGENT SYSTEMS, 2024, 10 (03) : 4473 - 4491
  • [10] Enhanced Chinese named entity recognition with multi-granularity BERT adapter and efficient global pointer
    Lei Zhang
    Pengfei Xia
    Xiaoxuan Ma
    Chengwei Yang
    Xin Ding
    Complex & Intelligent Systems, 2024, 10 : 4473 - 4491