Automated Extraction of Software Names from Vulnerability Reports using LSTM and Expert System

被引:0
|
作者
Khokhlov, Igor [1 ]
Okutan, Ahmet [2 ]
Bryla, Ryan [2 ]
Simmons, Steven [2 ]
Mirakhorli, Mehdi [2 ]
机构
[1] Sacred Heart Univ, Fairfield, CT 06825 USA
[2] Rochester Inst Technol, Rochester, MN USA
来源
2022 IEEE 29TH ANNUAL SOFTWARE TECHNOLOGY CONFERENCE (STC 2022) | 2022年
关键词
Common Product Enumeration; Common Vulnerability; and Exposures; Natural Language Processing; Software Product Name Extraction; Software Vulnerability;
D O I
10.1109/STC55697.2022.00024
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Software vulnerabilities are closely monitored by the security community to timely address the security and privacy issues in software systems. Before a vulnerability is published by vulnerability management systems, it needs to be characterized to highlight its unique attributes, including affected software products and versions, to help security professionals prioritize their patches. Associating product names and versions with disclosed vulnerabilities may require a labor-intensive process that may delay their publication and fix, and thereby give attackers more time to exploit them. This work proposes a machine learning method to extract software product names and versions from unstructured CVE descriptions automatically. It uses Word2Vec and Char2Vec models to create context-aware features from CVE descriptions and uses these features to train a Named Entity Recognition (NER) model using bidirectional Long short-term memory (LSTM) networks. Based on the attributes of the product names and versions in previously published CVE descriptions, we created a set of Expert System (ES) rules to refine the predictions of the NER model and improve the performance of the developed method. Experiment results on real-life CVE examples indicate that using the trained NER model and the set of ES rules, software names and versions in unstructured CVE descriptions could be identified with FMeasure values above 0.95.
引用
收藏
页码:125 / 134
页数:10
相关论文
共 43 条
  • [1] Automatic event identification and extraction from daily drilling reports using an expert system and artificial intelligence
    Cinelli, Lucas P.
    de Oliveira, Jose F. L.
    de Pinho, Vinicius M.
    Passos, Wesley L.
    Padilla, Rafael
    Braz, Patrick F.
    Galves, Breno
    Dalvi, Domenica P.
    Lewenfus, Gabriela
    Ferreira, Jonathas O.
    Ji, Anthony Y. Y.
    de Oliveira, Felipe L.
    Goncalves, Clemente J. C.
    Netto, Sergio L.
    da Silva, Eduardo A. B.
    de Campos, Marcello L. R.
    JOURNAL OF PETROLEUM SCIENCE AND ENGINEERING, 2021, 205
  • [2] Automated extraction of Biomarker information from pathology reports
    Jeongeun Lee
    Hyun-Je Song
    Eunsil Yoon
    Seong-Bae Park
    Sung-Hye Park
    Jeong-Wook Seo
    Peom Park
    Jinwook Choi
    BMC Medical Informatics and Decision Making, 18
  • [3] Automated extraction of Biomarker information from pathology reports
    Lee, Jeongeun
    Song, Hyun-Je
    Yoon, Eunsil
    Park, Seong-Bae
    Park, Sung-Hye
    Seo, Jeong-Wook
    Park, Peom
    Choi, Jinwook
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2018, 18
  • [4] Patching Power System Software Vulnerability Using CNNVD
    Wu, Qin-qin
    Wei, Li-hao
    Liang, Zhi-qiang
    Yu, Zhi-wen
    Chen, Min
    Chen, Zhi-hua
    Tan, Jin-jun
    2018 INTERNATIONAL CONFERENCE ON COMPUTER, COMMUNICATIONS AND MECHATRONICS ENGINEERING (CCME 2018), 2018, 332 : 356 - 360
  • [5] Automated image label extraction from radiology reports - A review
    Pereira, Sofia C.
    Mendonca, Ana Maria
    Campilho, Aurelio
    Sousa, Pedro
    Lopes, Carla Teixeira
    ARTIFICIAL INTELLIGENCE IN MEDICINE, 2024, 149
  • [6] Automated Extraction of Key Entities from Thorax CT Reports Using NER with Prompt Engineering
    Akcali, Z.
    Kocak, M.
    Farzaliyeva, A.
    Guven, F.
    Ramazanoglu, M. N.
    Hasdemir, E.
    Yagdiran, B.
    Oguz, A.
    Agildere, M.
    Altundag, O.
    JOURNAL OF THORACIC ONCOLOGY, 2024, 19 (10) : S228 - S229
  • [7] Deep Learning for Automated Extraction of Primary Sites From Cancer Pathology Reports
    Qiu, John X.
    Yoon, Hong-Jun
    Fearn, Paul A.
    Tourassi, Georgia D.
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2018, 22 (01) : 244 - 251
  • [8] TTPHunter: Automated Extraction of Actionable Intelligence as TTPs from Narrative Threat Reports
    Rani, Nanda
    Saha, Bikash
    Maurya, Vikas
    Shukla, Sandeep Kumar
    PROCEEDINGS OF 2023 AUSTRALIAN COMPUTER SCIENCE WEEK, ACSW 2023, 2023, : 126 - 134
  • [9] Using Local Grammar for Entity Extraction from Clinical Reports
    Ghoulam, Aicha
    Barigou, Fatiha
    Belalem, Ghalem
    Meziane, Farid
    INTERNATIONAL JOURNAL OF INTERACTIVE MULTIMEDIA AND ARTIFICIAL INTELLIGENCE, 2015, 3 (03): : 16 - 24
  • [10] Semantic Neural Network Ensemble for Automated Dependency Relation Extraction from Bridge Inspection Reports
    Liu, Kaijian
    El-Gohary, Nora
    JOURNAL OF COMPUTING IN CIVIL ENGINEERING, 2021, 35 (04)