A Comparative Study of Deep Learning based Named Entity Recognition Algorithms for Cybersecurity

被引:16
|
作者
Dasgupta, Soham [2 ]
Piplai, Aritran [1 ]
Kotal, Anantaa [1 ]
Joshi, Anupam [1 ]
机构
[1] Univ Maryland Baltimore Cty, Dept Comp Sci & Elect Engn, Baltimore, MD 21228 USA
[2] Mallya Aditi Int Sch, Bengaluru, Karnataka, India
关键词
Named Entity Recognition; Deep Learning; Cybersecurity; Artificial Intelligence;
D O I
10.1109/BigData50022.2020.9378482
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Named Entity Recognition (NER) is important in the cybersecurity domain. It helps researchers extract cyber threat information from unstructured text sources. The extracted cyber-entities or key expressions can be used to model a cyber-attack described in an open-source text. A large number of generalpurpose NER algorithms have been published that work well in text analysis. These algorithms do not perform well when applied to the cybersecurity domain. In the field of cybersecurity, the open-source text available varies greatly in complexity and underlying structure of the sentences. General-purpose NER algorithms can misrepresent domain-specific words, such as "malicious" and "javascript". In this paper, we compare the recent deep learning-based NER algorithms on a cybersecurity dataset. We created a cybersecurity dataset collected from various sources, including "Microsoft Security Bulletin" and "Adobe Security Updates". Some of these approaches proposed in literature were not used for Cybersecurity. Others are innovations proposed by us. This comparative study helps us identify the NER algorithms that are robust and can work well in sentences taken from a large number of cybersecurity sources. We tabulate their performance on the test set and identify the best NER algorithm for a cybersecurity corpus. We also discuss the different embedding strategies that aid in the process of NER for the chosen deep learning algorithms.
引用
收藏
页码:2596 / 2604
页数:9
相关论文
共 50 条
  • [21] Deep Learning Architectures for Named Entity Recognition: A Survey
    Thomas, Anu
    Sangeetha, S.
    ADVANCED COMPUTING AND INTELLIGENT ENGINEERING, 2020, 1082 : 215 - 225
  • [22] Deep Learning Approach for Arabic Named Entity Recognition
    Gridach, Mourad
    COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, (CICLING 2016), PT I, 2018, 9623 : 439 - 451
  • [23] A Comparative Study of Named Entity Recognition for Telugu
    Gorla, SaiKiranmai
    Murthy, N. L. Bhanu
    Malapati, Aruna
    PROCEEDINGS OF THE 9TH ANNUAL MEETING OF THE FORUM FOR INFORMATION RETRIEVAL EVALUATION (FIRE 2017), 2017, : 21 - 24
  • [24] Survey on Chinese named entity recognition with deep learning
    Kang Y.
    Sun L.
    Zhu R.
    Li M.
    Huazhong Keji Daxue Xuebao (Ziran Kexue Ban)/Journal of Huazhong University of Science and Technology (Natural Science Edition), 2022, 50 (11): : 44 - 53
  • [25] A comparative study for biomedical named entity recognition
    Xu Wang
    Chen Yang
    Renchu Guan
    International Journal of Machine Learning and Cybernetics, 2018, 9 : 373 - 382
  • [26] A comparative study for biomedical named entity recognition
    Wang, Xu
    Yang, Chen
    Guan, Renchu
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2018, 9 (03) : 373 - 382
  • [27] A Self-Attention-Based Approach for Named Entity Recognition in Cybersecurity
    Li, Tao
    Guo, Yuanbo
    Ju, Ankang
    2019 15TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND SECURITY (CIS 2019), 2019, : 147 - 150
  • [28] A Comparative Study of Dictionary-based and Machine Learning-based Named Entity Recognition in Pashto
    Momand, Rafiullah
    Waseeb, Shakirullah
    Rai, Ahmad Masood Latif
    2020 4TH INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND INFORMATION RETRIEVAL, NLPIR 2020, 2020, : 96 - 101
  • [29] Joint contrastive learning and belief rule base for named entity recognition in cybersecurity
    Chenxi Hu
    Tao Wu
    Chunsheng Liu
    Chao Chang
    Cybersecurity, 7
  • [30] Joint contrastive learning and belief rule base for named entity recognition in cybersecurity
    Hu, Chenxi
    Wu, Tao
    Liu, Chunsheng
    Chang, Chao
    CYBERSECURITY, 2024, 7 (01)