Domain Named Entity Recognition Combining GAN and BiLSTM-Attention-CRF

被引:0
|
作者
Zhang H. [1 ,2 ]
Guo Y. [1 ]
Li T. [1 ]
机构
[1] Department of Cryptogram Engineering, Strategic Support Force Information Engineering University, Zhengzhou
[2] Software College, Zhengzhou University, Zhengzhou
来源
Jisuanji Yanjiu yu Fazhan/Computer Research and Development | 2019年 / 56卷 / 09期
基金
中国国家自然科学基金;
关键词
BiLSTM-Attention-CRF model; Crowd annotations; Domain named entity recognition; Entity annotations consistent; Generative adversarial network (GAN);
D O I
10.7544/issn1000-1239.2019.20180733
中图分类号
学科分类号
摘要
Domain named entity recognition usually faces the lack of domain annotation data and the inconsistency of entity annotation in the same document due to the diversity of entity names in the domain. To issue the above problems, our work draws on the combination of the generative adversarial network (GAN) which can generate data and the BiLSTM-Attention-CRF model. Firstly, BiLSTM-Attention is used as the generator model of GAN, and CNN is used as the discriminant model. The two models use the crowd annotations and the expert annotations to train respectively, and integrate the positive annotation data consistent with the expert annotation data distribution from the crowd annotations to solve the problem of lack of annotation data in the domain; then we also introduce a new method to obtain the new feature representation of each word in the document through introducing a document-level global feature in the BiLSTM-Attention-CRF model in order to solve the problem of inconsistency of the entity in the same document caused by the diversification of the entity name. Finally, taking the crowd annotations in the information security field as a sample, a comprehensive horizontal evaluation experiment is carried out by learning the common features and applying them to the training BiLSTM-Attention-CRF model for the identification of named entities in the information security field. The results show that compared with the existing models and methods, the model we proposed has made great progress on various indicators, reflecting its superiority. © 2019, Science Press. All right reserved.
引用
收藏
页码:1851 / 1858
页数:7
相关论文
共 22 条
  • [1] Ratinov L., Roth D., Design challenges and misconceptions in named entity recognition, Proc of the 13th Conf on Computational Natural Language Learning, pp. 147-155, (2009)
  • [2] Finkel J.R., Grenager T., Manning C., Incorporating non-local information into information extraction systems by Gibbs sampling, Proc of the 43rd Annual Meeting of the Association for Computational Linguistics, pp. 363-370, (2005)
  • [3] Hal D., Frustratingly easy domain adaptation, Proc of the 45th Annual Meeting of the Association for Computa-Tional Linguistics, pp. 256-263, (2007)
  • [4] Snow R., O'Connor B., Jurafsky D., Et al., Cheap and fast-but is it good? evaluating non-expert annotations for natural language tasks, Proc of the Conf on Empirical Methods in Natural Language Processing, pp. 254-263, (2008)
  • [5] Rodrigues F., Pereira F., Ribeiro B., Sequence labeling with multiple annotators, Machine Learning, 95, 2, pp. 165-181, (2014)
  • [6] Zhao H., Chunyu K., Unsupervised segmentation helps supervised learning of character tagging for word segmentation and named entity recognition, Proc of the 6th SIGHAN Workshop on Chinese Language Processing, pp. 106-111, (2008)
  • [7] Huang Z., Xu W., Yu K., Bidirectional LSTM-CRF models for sequence tagging
  • [8] Ramamoorthy S., Murugan S., An attentive sequence model for adverse drug event extraction from biomedical text
  • [9] Goodfellow I.J., Pouget-Abadie J., Mirza M., Et al., Generative adversarial nets, Proc of Neural Information Processing System, pp. 2672-2680, (2014)
  • [10] Wang W., Li Z., Advance in generative adversarial network, Journal on Communications, 39, 2, pp. 135-148, (2018)