Time series forecast modeling of vulnerabilities in the android operating system using ARIMA and deep learning methods

被引:13
作者
Gencer, Kerem [1 ]
Basciftci, Fatih [2 ]
机构
[1] Karamanoglu Mehmetbey Univ, Vocat Sch Tech Sci, Dept Comp Programming, Karaman, Turkey
[2] Selcuk Univ, Fac Technol, Dept Comp Engn, Konya, Turkey
关键词
LSTM; Time series; NVD; Android vulnerabilities; Software security; Vulnerability discovery model; SECURITY VULNERABILITIES;
D O I
10.1016/j.suscom.2021.100515
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Security vulnerability prediction models allow estimation of the number of potential vulnerabilities and evaluation of the risks caused by these vulnerabilities. In particular, for modeling the vulnerabilities that may occur in software versions over time, it is appropriate to take the necessary countermeasures. These models are crucial in areas such as determining the number of resources required to cope with security vulnerabilities. These reported vulnerabilities, we anticipate the actions of OS companies to make strategic and operational decisions such as secure deployment. The operating system includes backup provisioning, disaster recovery. Although many vulnerability predictions models have been constructed, most of these are for operating systems and internet browsers, and non-exist for the Android mobile operating system, which has the highest number of users. In contrast to other studies, the present study investigated Android vulnerabilities that directly depend on time. Time series, multilayer perceptron (MLP), convolutional neural network (CNN), long short term memory (LSTM), Convolutional LSTM (ConvLSTM) and CNN-LSTM based models were generated, and the best model, providing the lowest error rates for the prediction of future security vulnerabilities, was selected. Data for the creation of the models were obtained by filtering security vulnerabilities published in the National Vulnerability Database (NVD) using the keyword Android. It was observed that the LSTM model has an error rate of 26.830 and the ARIMA model has an error rate of 18.449. Finally, it has been determined that LSTM based algorithms reach error rates that can compete with classical time series models despite limited data.
引用
收藏
页数:11
相关论文
共 50 条
[1]  
Adhikari Ratnadip, 2013, International Journal of Surface Engineering and Interdisciplinary Materials Science, V1, P75, DOI 10.4018/jaec.2013070107
[2]   An Empirical Comparison of Machine Learning Models for Time Series Forecasting [J].
Ahmed, Nesreen K. ;
Atiya, Amir F. ;
El Gayar, Neamat ;
El-Shishiny, Hisham .
ECONOMETRIC REVIEWS, 2010, 29 (5-6) :594-621
[3]   Measuring, analyzing and predicting security vulnerabilities in software systems [J].
Alhazmi, O. H. ;
Malaiya, Y. K. ;
Ray, I. .
COMPUTERS & SECURITY, 2007, 26 (03) :219-228
[4]   Quantitative vulnerability assessment of systems software [J].
Alhazmi, OH ;
Malaiya, YK .
ANNUAL RELIABILITY AND MAINTAINABILITY SYMPOSIUM, 2005 PROCEEDINGS, 2005, :615-620
[5]   Application of vulnerability discovery models to major operating systems [J].
Alhazmi, Omar H. ;
Malaiya, Yashwant K. .
IEEE TRANSACTIONS ON RELIABILITY, 2008, 57 (01) :14-22
[6]  
Alhazmi OH, 2006, P REL MAINT S, P86
[7]  
Anand A., 2016, Journal of the Indian Society for Probability and Statistics, V17, P1, DOI [DOI 10.1007/s41096-016-0006-4, 10.1007/s41096-016-0006-4, DOI 10.1007/S41096-016-0006-4]
[8]  
Anderson R.J., 2002, Open Source Software, P20
[9]  
[Anonymous], 2001, TEMEL EKONOMETRI
[10]  
[Anonymous], 2006, P 10 IASTED INT C SO