A network security entity recognition method based on feature template and CNN-BiLSTM-CRF

被引:38
作者
Qin, Ya [1 ,2 ]
Shen, Guo-wei [1 ,2 ]
Zhao, Wen-bo [1 ,2 ]
Chen, Yan-ping [1 ,2 ]
Yu, Miao [3 ]
Jin, Xin [4 ]
机构
[1] Guizhou Univ, Coll Comp Sci & Technol, Guiyang 550025, Guizhou, Peoples R China
[2] Guizhou Prov Key Lab Publ Big Data, Guiyang 550025, Guizhou, Peoples R China
[3] Chinese Acad Sci, Inst Informat Engn, Beijing 100093, Peoples R China
[4] Coordinat Ctr China, Natl Comp Network Emergency Response Tech Team, Beijing 100029, Peoples R China
基金
中国国家自然科学基金;
关键词
Network security entity; Security knowledge graph (SKG); Entity recognition; Feature template; Neural network;
D O I
10.1631/FITEE.1800520
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
By network security threat intelligence analysis based on a security knowledge graph (SKG), multi-source threat intelligence data can be analyzed in a fine-grained manner. This has received extensive attention. It is difficult for traditional named entity recognition methods to identify mixed security entities in Chinese and English in the field of network security, and there are difficulties in accurately identifying network security entities because of insufficient features extracted. In this paper, we propose a novel FT-CNN-BiLSTM-CRF security entity recognition method based on a neural network CNN-BiLSTM-CRF model combined with a feature template (FT). The feature template is used to extract local context features, and a neural network model is used to automatically extract character features and text global features. Experimental results showed that our method can achieve an F-score of 86% on a large-scale network security dataset and outperforms other methods.
引用
收藏
页码:872 / 884
页数:13
相关论文
共 34 条
  • [1] Bergstra J, 2012, J MACH LEARN RES, V13, P281
  • [2] Chiu J.P. Nichols., 2015, Named entity recognition with bidirectional lstm-cnns
  • [3] Collobert R., 2008, Proceedings of the 25th international conference on Machine learning, V25, P160, DOI DOI 10.1145/1390156.1390177
  • [4] Collobert R, 2011, J MACH LEARN RES, V12, P2493
  • [5] Character-Based LSTM-CRF with Radical-Level Features for Chinese Named Entity Recognition
    Dong, Chuanhai
    Zhang, Jiajun
    Zong, Chengqing
    Hattori, Masanori
    Di, Hui
    [J]. NATURAL LANGUAGE UNDERSTANDING AND INTELLIGENT APPLICATIONS (NLPCC 2016), 2016, 10102 : 239 - 250
  • [6] dos Santos Ccero., 2015, Proceedings of NEWS 2015 The Fifth Named Entities Workshop, P25
  • [7] [冯艳红 Feng Yanhong], 2018, [计算机科学, Computer Science], V45, P261
  • [8] FinkelJR Manning CD, 2009, P HUMAN LANGUAGE TEC, P326
  • [9] Learning to forget: Continual prediction with LSTM
    Gers, FA
    Schmidhuber, J
    Cummins, F
    [J]. NEURAL COMPUTATION, 2000, 12 (10) : 2451 - 2471
  • [10] Goller C, 1996, IEEE IJCNN, P347, DOI 10.1109/ICNN.1996.548916