MIFM: Multi-Granularity Information Fusion Model for Chinese Named Entity Recognition

被引:9
|
作者
Zhang, Naixin [1 ,2 ,3 ]
Xu, Guangluan [1 ,2 ,3 ]
Zhang, Zequen [1 ,3 ]
Li, Feng [1 ,2 ,3 ]
机构
[1] Chinese Acad Sci, Aerosp Informat Res Inst, Beijing 100190, Peoples R China
[2] Univ Chinese Acad Sci, Sch Elect Elect & Commun Engn, Beijing 100190, Peoples R China
[3] Chinese Acad Sci, Aerosp Informat Res Inst, Key Lab Network Informat Syst Technol NTST, Beijing 100190, Peoples R China
来源
IEEE ACCESS | 2019年 / 7卷
关键词
Named entity recognition; Chinese NER; reverse stacked LSTM; multi-granularity embedding;
D O I
10.1109/ACCESS.2019.2958959
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Chinese Named Entity Recognition (Chinese NER) is an important task in Chinese natural language processing field. It is difficult to identify the boundary of entities because Chinese texts lack natural delimiters to separate words. For this task, two major methods can be distinguished by the model inputs, i.e., word-based model and character-based model. However, the word-based model relies on the result of the Chinese Word Segmentation (CWS), and the character-based model cannot utilize enough word-level information. In this paper, we propose a multi-granularity information fusion model (MIFM) for the Chinese NER task. We introduce a novel multi-granularity embedding layer that utilizes the attention mechanism and an information gate to fuse the character and word level features. The results of this embedding method are dynamic and data-specific because they are calculated based on different contexts. Moreover, we apply the reverse stacked LSTM layer to gain deep semantic information for a sequence. Experiments on two benchmark datasets, MSRA and ResumeNER, show that our approach can effectively improve the performance of Chinese NER.
引用
收藏
页码:181648 / 181655
页数:8
相关论文
共 50 条
  • [1] Multimodal heterogeneous graph entity-level fusion for named entity recognition with multi-granularity visual guidance
    Gong, Yunchao
    Lv, Xueqiang
    Yuan, Zhu
    Wang, ZhaoJun
    Hu, Feng
    You, Xindong
    JOURNAL OF SUPERCOMPUTING, 2024, 80 (16) : 23767 - 23793
  • [2] Enhanced Chinese named entity recognition with multi-granularity BERT adapter and efficient global pointer
    Zhang, Lei
    Xia, Pengfei
    Ma, Xiaoxuan
    Yang, Chengwei
    Ding, Xin
    COMPLEX & INTELLIGENT SYSTEMS, 2024, 10 (03) : 4473 - 4491
  • [3] Enhanced Chinese named entity recognition with multi-granularity BERT adapter and efficient global pointer
    Lei Zhang
    Pengfei Xia
    Xiaoxuan Ma
    Chengwei Yang
    Xin Ding
    Complex & Intelligent Systems, 2024, 10 : 4473 - 4491
  • [4] Psychomedical named entity recognition method based on multi-level feature extraction and multi-granularity embedding fusion
    Zixuan Liu
    Guofang Zhang
    Yanguang Shen
    Scientific Reports, 15 (1)
  • [5] Multi-granularity cross-modal representation learning for named entity recognition on social media
    Liu, Peipei
    Wang, Gaosheng
    Li, Hong
    Liu, Jie
    Ren, Yimo
    Zhu, Hongsong
    Sun, Limin
    INFORMATION PROCESSING & MANAGEMENT, 2024, 61 (01)
  • [6] Chinese named entity recognition based on multi-criteria fusion
    Cai Q.
    Dongnan Daxue Xuebao (Ziran Kexue Ban)/Journal of Southeast University (Natural Science Edition), 2020, 50 (05): : 929 - 934
  • [7] A Multi-Granularity Word Fusion Method for Chinese NER
    Liu, Tong
    Gao, Jian
    Ni, Weijian
    Zeng, Qingtian
    APPLIED SCIENCES-BASEL, 2023, 13 (05):
  • [8] Fusion of multiple features for Chinese Named Entity Recognition based on CRF model
    Zhang, Yuejie
    Xu, Zhiting
    Zhang, Tao
    INFORMATION RETRIEVAL TECHNOLOGY, 2008, 4993 : 95 - +
  • [9] A hybrid model for Chinese named entity recognition
    Sun, Xiao
    Huang, Degen
    RECENT ADVANCE OF CHINESE COMPUTING TECHNOLOGIES, 2007, : 232 - 237
  • [10] IMPROVING CHINESE NAMED ENTITY RECOGNITION WITH LEXICAL INFORMATION
    Fu, Guo-Hong
    PROCEEDINGS OF 2009 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-6, 2009, : 3487 - 3491