An empirical assessment of ensemble methods and traditional machine for web-based attack detection in 5.0

被引:22
作者
Chakir, Oumaima [1 ]
Rehaimi, Abdeslam [1 ]
Sadqi, Yassine [1 ]
Alaoui, El Arbi Abdellaoui [2 ]
Krichen, Moez [3 ,4 ]
Gaba, Gurjot Singh [5 ]
Gurtov, Andrei [5 ]
机构
[1] USMS Univ, FPBM, Lab LIMATI, Beni Mellal, Morocco
[2] Univ Moulay Ismail, Dept Sci, IMAGE Lab, ENS,IEVIA Team, Meknes, Morocco
[3] Al Baha Univ, Fac CSIT, Al Bahah, Saudi Arabia
[4] Univ Sfax, ReDCAD Lab, Sfax, Tunisia
[5] Linkoping Univ, Sch Comp & Informat Sci, Linkoping, Sweden
关键词
Cybersecurity; Ensemble methods; Industry; 5; 0; Machine learning; Web-based attack detection; INTRUSION DETECTION SYSTEMS; PROTOCOL;
D O I
10.1016/j.jksuci.2023.02.009
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Cybersecurity attacks that target software have become profitable and popular targets for cybercriminals who consciously take advantage of web-based vulnerabilities and execute attacks that might jeopardize essential industry 5.0 features. Several machine learning-based techniques have been developed in the literature to identify these types of assaults. In contrast to single classifiers, ensemble methods have not been evaluated empirically. To the best of our knowledge, this work is the first empirical evaluation of both homogeneous and heterogeneous ensemble approaches compared to single classifiers for web -based attack detection in industry 5.0, utilizing two of the most realistic public web-based attack data -sets. The authors divided the experiment into three main phases: In the first phase, they evaluated the performance of five well-established supervised machine learning (ML) classifiers. In the second phase, they constructed a heterogeneous ensemble of the three best-performing ML algorithms using max vot-ing and stacking methods. In the third phase, they used four well-known homogeneous ensembles to evaluate the performance of the bagging and boosting method. The results based on the ECML/PKDD 2007 and CSIC HTTP 2010 datasets revealed that bagging, particularly Random Forest, outperformed sin-gle classifiers in terms of accuracy, precision, F-value, FPR, and area of the ROC curve with values of 99.597%, 98.274%, 99.129%, 0.523%, 100 and 99.867%, 99.867%, 99.867%, 0.267%, 100, respectively. In con-trast, single classifiers performed better than boosting and stacking. However, in terms of FPR, the boost-ing exceeded single classifiers. Max voting is appropriate when accuracy, precision, and FPR are the primary concerns, whereas single classifiers can be employed when recall, FNR, training, and prediction times are critical elements. In terms of training time, ensemble approaches are more likely to be affected by data volume than single classifiers. The paper's findings will help security researchers and practition-ers identify the most efficient learning techniques for securing web applications. (c) 2023 The Author(s). Published by Elsevier B.V. on behalf of King Saud University. This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
引用
收藏
页码:103 / 119
页数:17
相关论文
共 70 条
  • [21] Improving malware detection using big data and ensemble learning
    Gupta, Deepak
    Rani, Rinkle
    [J]. COMPUTERS & ELECTRICAL ENGINEERING, 2020, 86
  • [22] Securing Industrial Internet of Things Against Botnet Attacks Using Hybrid Deep Learning Approach
    Hasan, Tooba
    Malik, Jahanzaib
    Bibi, Iram
    Khan, Wali Ullah
    Al-Wesabi, Fahd N.
    Dev, Kapal
    Huang, Gaojian
    [J]. IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2023, 10 (05): : 2952 - 2963
  • [23] Performance evaluation of Convolutional Neural Network for web security
    Jemal, Ines
    Haddar, Mohamed Amine
    Cheikhrouhou, Omar
    Mahfoudhi, Adel
    [J]. COMPUTER COMMUNICATIONS, 2021, 175 : 58 - 67
  • [24] Adversarial machine learning for network intrusion detection: A comparative study
    Jmila, Houda
    Ibn Khedher, Mohamed
    [J]. COMPUTER NETWORKS, 2022, 214
  • [25] Performance Analysis of Intrusion Detection Systems Using a Feature Selection Method on the UNSW-NB15 Dataset
    Kasongo, Sydney M.
    Sun, Yanxia
    [J]. JOURNAL OF BIG DATA, 2020, 7 (01)
  • [26] Defending Malicious Script Attacks Using Machine Learning Classifiers
    Khan, Nayeem
    Abdullah, Johari
    Khan, Adnan Shahid
    [J]. WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2017,
  • [27] Intrusion Detection in Automatic Dependent Surveillance-Broadcast (ADS-B) with Machine Learning
    Khan, Suleman
    Thorn, Joakim
    Wahlgren, Alex
    Gurtov, Andrei
    [J]. 2021 IEEE/AIAA 40TH DIGITAL AVIONICS SYSTEMS CONFERENCE (DASC), 2021,
  • [28] Intelligent intrusion detection system in smart grid using computational intelligence and machine learning
    Khan, Suleman
    Kifayat, Kashif
    Kashif Bashir, Ali
    Gurtov, Andrei
    Hassan, Mehdi
    [J]. TRANSACTIONS ON EMERGING TELECOMMUNICATIONS TECHNOLOGIES, 2021, 32 (06)
  • [29] Kozik R, 2014, LECT NOTES COMPUT SC, V8838, P680, DOI 10.1007/978-3-662-45237-0_61
  • [30] Machine learning algorithms for wireless sensor networks: A survey
    Kumar, D. Praveen
    Amgoth, Tarachand
    Annavarapu, Chandra Sekhara Rao
    [J]. INFORMATION FUSION, 2019, 49 : 1 - 25