Vulnerability Identification and Classification Via Text Mining Bug Databases

被引:0
|
作者
Wijayasekara, Dumidu [1 ]
Manic, Milos [1 ]
McQueen, Miles [2 ]
机构
[1] Univ Idaho, Idaho Falls, ID 83402 USA
[2] Idaho Natl Lab, Idaho Falls, ID USA
来源
IECON 2014 - 40TH ANNUAL CONFERENCE OF THE IEEE INDUSTRIAL ELECTRONICS SOCIETY | 2014年
关键词
hidden impact bugs; bug database mining; vulnerability discovery; text mining; classification;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
As critical and sensitive systems increasingly rely on complex software systems, identifying software vulnerabilities is becoming increasingly important. It has been suggested in previous work that some bugs are only identified as vulnerabilities long after the bug has been made public. These bugs are known as Hidden Impact Bugs (HIBs). This paper presents a hidden impact bug identification methodology by means of text mining bug databases. The presented methodology utilizes the textual description of the bug report for extracting textual information. The text mining process extracts syntactical information of the bug reports and compresses the information for easier manipulation. The compressed information is then utilized to generate a feature vector that is presented to a classifier. The proposed methodology was tested on Linux vulnerabilities that were discovered in the time period from 2006 to 2011. Three different classifiers were tested and 28% to 88% of the hidden impact bugs were identified correctly by using the textual information from the bug descriptions alone. Further analysis of the Bayesian detection rate showed the applicability of the presented method according to the requirements of a development team.
引用
收藏
页码:3612 / 3618
页数:7
相关论文
共 50 条
  • [1] Identification of Security related Bug Reports via Text Mining using Supervised and Unsupervised Classification
    Goseva-Popstojanova, Katerina
    Tyo, Jacob
    2018 IEEE INTERNATIONAL CONFERENCE ON SOFTWARE QUALITY, RELIABILITY AND SECURITY (QRS 2018), 2018, : 344 - 355
  • [2] Combining text mining and data mining for bug report classification
    Zhou, Yu
    Tong, Yanxiang
    Gu, Ruihang
    Gall, Harald
    JOURNAL OF SOFTWARE-EVOLUTION AND PROCESS, 2016, 28 (03) : 150 - 176
  • [3] Combining Text Mining and Data Mining for Bug Report Classification
    Zhou, Yu
    Tong, Yanxiang
    Gu, Ruihang
    Gall, Harald
    2014 IEEE INTERNATIONAL CONFERENCE ON SOFTWARE MAINTENANCE AND EVOLUTION (ICSME), 2014, : 311 - 320
  • [4] Improving classification in protein structure databases using text mining
    Antonis Koussounadis
    Oliver C Redfern
    David T Jones
    BMC Bioinformatics, 10
  • [5] Improving classification in protein structure databases using text mining
    Koussounadis, Antonis
    Redfern, Oliver C.
    Jones, David T.
    BMC BIOINFORMATICS, 2009, 10
  • [6] Mining Bug Databases for Unidentified Software Vulnerabilities
    Wijayasekara, Dumidu
    Manic, Milos
    Wright, Jason L.
    McQueen, Miles
    2012 5TH INTERNATIONAL CONFERENCE ON HUMAN SYSTEM INTERACTIONS (HSI 2012), 2012, : 89 - 96
  • [7] Data mining method from text databases
    Kawano, M
    Watada, J
    Kawaura, T
    KNOWLEDGE-BASED INTELLIGENT INFORMATION AND ENGINEERING SYSTEMS, PT 3, PROCEEDINGS, 2005, 3683 : 1122 - 1128
  • [8] Efficient mining of association rules in text databases
    Holt, JD
    Chung, SM
    PROCEEDINGS OF THE EIGHTH INTERNATIONAL CONFERENCE ON INFORMATION KNOWLEDGE MANAGEMENT, CIKM'99, 1999, : 234 - 242
  • [9] Review of Text Mining Techniques for Software Bug Localization
    Tamanna
    Sangwan, Om Prakash
    2019 9TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING, DATA SCIENCE & ENGINEERING (CONFLUENCE 2019), 2019, : 208 - 211
  • [10] Rule mining and classification in imperfect databases
    Hewawasam, KKRGK
    Premaratne, K
    Subasingha, SP
    Shyu, ML
    2005 7TH INTERNATIONAL CONFERENCE ON INFORMATION FUSION (FUSION), VOLS 1 AND 2, 2005, : 661 - 668