Local or global? A novel transformer for Chinese named entity recognition based on multi-view and sliding attention

被引:4
|
作者
Wang, Yuke [1 ]
Lu, Ling [1 ]
Yang, Wu [1 ]
Chen, Yinong [2 ]
机构
[1] Chongqing Univ Technol, Sch Comp Sci & Engn, 69 Hongguang Ave, Chongqing, Peoples R China
[2] Arizona State Univ, Sch Comp & Augmented Intelligence, Tempe, AZ USA
关键词
Chinese NER; Human semantic understanding; Multimodal information fusion; Sliding attention; CONTEXT; CONSCIOUSNESS; INFORMATION;
D O I
10.1007/s13042-023-02023-0
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Transformer is widely used in natural language processing (NLP) tasks due to the parallel and modeling of long texts. However, its performance in Chinese named entity recognition (NER) is not effective. While distance, direction, and information on global and local perspectives of sequence are all important for NER tasks, the traditional transformer structure only focus on distance and partial global information by fully connected self-attention mechanism. In this paper, we propose a multi-view and sliding attention (MVSA) model to enhance transformer's ability to model Chinese character-word features in NER task. MVSA combines directional information to extract character-word features from multiple views, proposes a weighted ternary fusion method for feature fusion and uses slider attention mechanisms to enhance the local representation ability of the model. Experiments on five Chinese NER datasets show that MVSA achieves superior performance than CNN-based, LSTM-based and traditional transformer-based models.
引用
收藏
页码:2199 / 2208
页数:10
相关论文
共 50 条
  • [31] Local and global character representation enhanced model for Chinese medical named entity recognition
    Xiang, Yan
    Liu, Wei
    Guo, Junjun
    Zhang, Li
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2023, 45 (03) : 3779 - 3790
  • [32] A Novel Method for Chinese Named Entity Recognition Based on Character Vector
    Lu, Jing
    Ye, Mao
    Tang, Zhi
    Huang, Xiao-Jun
    Ma, Jia-Le
    COLLABORATIVE COMPUTING: NETWORKING, APPLICATIONS, AND WORKSHARING, COLLABORATECOM 2015, 2016, 163 : 141 - 150
  • [33] Improving Clinical Named Entity Recognition with Global Neural Attention
    Xu, Guohai
    Wang, Chengyu
    He, Xiaofeng
    WEB AND BIG DATA (APWEB-WAIM 2018), PT II, 2018, 10988 : 264 - 279
  • [34] Multi-view united transformer block of graph attention network based autism spectrum disorder recognition
    Jemima, D. Darling
    Selvarani, A. Grace
    Lovenia, J. Daphy Louis
    FRONTIERS IN PSYCHIATRY, 2025, 16
  • [35] A Model for Chinese Named Entity Recognition Based on Global Pointer and Adversarial Learning
    ZHANG Yangsen
    LI Jianlong
    XIN Yonghui
    ZHAO Xiquan
    LIU Yang
    ChineseJournalofElectronics, 2023, 32 (04) : 854 - 867
  • [36] A multi-feature fusion method based on bilstm-attention-crf for chinese named entity recognition
    Zhang, Zhiyuan
    Sun, Shuihua
    Xu, Shiao
    Xu, Fan
    Liu, Jianhua
    Journal of Network Intelligence, 2021, 6 (03): : 518 - 534
  • [37] Attention assessment based on multi-view classroom behaviour recognition
    Zheng, ZhouJie
    Liang, GuoJun
    Luo, HuiBin
    Yin, HaiChang
    IET COMPUTER VISION, 2022,
  • [38] A Model for Chinese Named Entity Recognition Based on Global Pointer and Adversarial Learning
    Zhang Yangsen
    Li Jianlong
    Xin Yonghui
    Zhao Xiquan
    Liu Yang
    CHINESE JOURNAL OF ELECTRONICS, 2023, 32 (04) : 854 - 867
  • [39] Fast Neural Chinese Named Entity Recognition with Multi-head Self-attention
    Qi, Tao
    Wu, Chuhan
    Wu, Fangzhao
    Ge, Suyu
    Liu, Junxin
    Huang, Yongfeng
    Xie, Xing
    KNOWLEDGE GRAPH AND SEMANTIC COMPUTING: KNOWLEDGE COMPUTING AND LANGUAGE UNDERSTANDING, 2019, 1134 : 98 - 110
  • [40] Incorporating multi-level CNN and attention mechanism for Chinese clinical named entity recognition
    Kong, Jun
    Zhang, Leixin
    Jiang, Min
    Liu, Tianshan
    JOURNAL OF BIOMEDICAL INFORMATICS, 2021, 116 (116)