Machine learning techniques for software vulnerability prediction: a comparative study

被引:0
作者
Gul Jabeen
Sabit Rahim
Wasif Afzal
Dawar Khan
Aftab Ahmed Khan
Zahid Hussain
Tehmina Bibi
机构
[1] Tsinghua University,Department of Computer Science
[2] Karakoram International University,Shenzhen Institute of Advanced Technology
[3] Mälardalen University,Department of Information Technology
[4] Chinese Academy of Sciences,Institute of Geology
[5] The University of Haripur,undefined
[6] University of Azad Jammu and Kashmir,undefined
来源
Applied Intelligence | 2022年 / 52卷
关键词
Software vulnerability; Machine learning; Prediction models;
D O I
暂无
中图分类号
学科分类号
摘要
Software vulnerabilities represent a major cause of security problems. Various vulnerability discovery models (VDMs) attempt to model the rate at which the vulnerabilities are discovered in a software. Although several VDMs have been proposed, not all of them are universally applicable. Also most of them seldom give accurate predictive results for every type of vulnerability dataset. The use of machine learning (ML) techniques has generally found success in a wide range of predictive tasks. Thus, in this paper, we conducted an empirical study on applying some well-known machine learning (ML) techniques as well as statistical techniques to predict the software vulnerabilities on a variety of datasets. The following ML techniques have been evaluated: cascade-forward back propagation neural network, feed-forward back propagation neural network, adaptive-neuro fuzzy inference system, multi-layer perceptron, support vector machine, bagging, M5Rrule, M5P and reduced error pruning tree. The following statistical techniques have been evaluated: Alhazmi-Malaiya model, linear regression and logistic regression model. The applicability of the techniques is examined using two separate approaches: goodness-of-fit to see how well the model tracks the data, and prediction capability using different criteria. It is observed that ML techniques show remarkable improvement in predicting the software vulnerabilities than the statistical vulnerability prediction models.
引用
收藏
页码:17614 / 17635
页数:21
相关论文
共 123 条
[1]  
Bhatt N(2021)Exploitability prediction of software vulnerabilities Qual Reliab Eng Int 37 648-663
[2]  
Anand A(2019)A performance evaluation of deep-learnt features for software vulnerability detection Concurrency and Computation: Practice and Experience 31 e5103-1848
[3]  
Yadavalli VenkataSS(2020)Software vulnerability detection using deep neural networks: A survey Proc IEEE 108 1825-407
[4]  
Ban X(2019)Software Vulnerability Discovery via Learning Multi-domain Knowledge Bases IEEE Trans. Dependable Secur. Comput. PP 1-690
[5]  
Liu S(2013)Vulnerability Scrying Method for Software Vulnerability Discovery Prediction Without a Vulnerability Database IEEE Trans Reliab 62 395-44292
[6]  
Chen C(2017)Periodicity in software vulnerability discovery, patching and exploitation Int J Inf Secur 16 673-707
[7]  
Chua C(2019)E-WBM : An Effort-Based Vulnerability Discovery Model IEEE Access 7 44276-707
[8]  
Lin G(2019)Vulnerability prediction capability: A comparison between vulnerability discovery models and neural network models Computers & Security 87 101596-91
[9]  
Wen S(2015)Web Application Vulnerability Prediction Using Hybrid Program Analysis and Machine Learning IEEE Transactions on Dependable and Secure Computing 12 688-318
[10]  
Han Q-L(2015)Web application vulnerability prediction using hybrid program analysis and machine learning IEEE Transactions on Dependable and Secure Computing 12 688-664