An empirical assessment of ensemble methods and traditional machine for web-based attack detection in 5.0

被引:22
作者
Chakir, Oumaima [1 ]
Rehaimi, Abdeslam [1 ]
Sadqi, Yassine [1 ]
Alaoui, El Arbi Abdellaoui [2 ]
Krichen, Moez [3 ,4 ]
Gaba, Gurjot Singh [5 ]
Gurtov, Andrei [5 ]
机构
[1] USMS Univ, FPBM, Lab LIMATI, Beni Mellal, Morocco
[2] Univ Moulay Ismail, Dept Sci, IMAGE Lab, ENS,IEVIA Team, Meknes, Morocco
[3] Al Baha Univ, Fac CSIT, Al Bahah, Saudi Arabia
[4] Univ Sfax, ReDCAD Lab, Sfax, Tunisia
[5] Linkoping Univ, Sch Comp & Informat Sci, Linkoping, Sweden
关键词
Cybersecurity; Ensemble methods; Industry; 5; 0; Machine learning; Web-based attack detection; INTRUSION DETECTION SYSTEMS; PROTOCOL;
D O I
10.1016/j.jksuci.2023.02.009
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Cybersecurity attacks that target software have become profitable and popular targets for cybercriminals who consciously take advantage of web-based vulnerabilities and execute attacks that might jeopardize essential industry 5.0 features. Several machine learning-based techniques have been developed in the literature to identify these types of assaults. In contrast to single classifiers, ensemble methods have not been evaluated empirically. To the best of our knowledge, this work is the first empirical evaluation of both homogeneous and heterogeneous ensemble approaches compared to single classifiers for web -based attack detection in industry 5.0, utilizing two of the most realistic public web-based attack data -sets. The authors divided the experiment into three main phases: In the first phase, they evaluated the performance of five well-established supervised machine learning (ML) classifiers. In the second phase, they constructed a heterogeneous ensemble of the three best-performing ML algorithms using max vot-ing and stacking methods. In the third phase, they used four well-known homogeneous ensembles to evaluate the performance of the bagging and boosting method. The results based on the ECML/PKDD 2007 and CSIC HTTP 2010 datasets revealed that bagging, particularly Random Forest, outperformed sin-gle classifiers in terms of accuracy, precision, F-value, FPR, and area of the ROC curve with values of 99.597%, 98.274%, 99.129%, 0.523%, 100 and 99.867%, 99.867%, 99.867%, 0.267%, 100, respectively. In con-trast, single classifiers performed better than boosting and stacking. However, in terms of FPR, the boost-ing exceeded single classifiers. Max voting is appropriate when accuracy, precision, and FPR are the primary concerns, whereas single classifiers can be employed when recall, FNR, training, and prediction times are critical elements. In terms of training time, ensemble approaches are more likely to be affected by data volume than single classifiers. The paper's findings will help security researchers and practition-ers identify the most efficient learning techniques for securing web applications. (c) 2023 The Author(s). Published by Elsevier B.V. on behalf of King Saud University. This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
引用
收藏
页码:103 / 119
页数:17
相关论文
共 70 条
  • [51] Sadqi Yassine, 2021, Innovations in Smart Cities Applications. Proceedings of the 5th International Conference on Smart City Applications. Lecture Notes in Networks and Systems (LNNS 183), P1087, DOI 10.1007/978-3-030-66840-2_83
  • [52] A systematic review and taxonomy of web applications threats
    Sadqi, Yassine
    Maleh, Yassine
    [J]. INFORMATION SECURITY JOURNAL, 2022, 31 (01): : 1 - 27
  • [53] Ensemble learning: A survey
    Sagi, Omer
    Rokach, Lior
    [J]. WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2018, 8 (04)
  • [54] Schapire RE, 1999, IJCAI-99: PROCEEDINGS OF THE SIXTEENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOLS 1 & 2, P1401
  • [55] Schmitt Isabell., 2012, WOOT, P34
  • [56] Dew-Cloud-Based Hierarchical Federated Learning for Intrusion Detection in IoMT
    Singh, Parminder
    Gaba, Gurjot Singh
    Kaur, Avinash
    Hedabou, Mustapha
    Gurtov, Andrei
    [J]. IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2023, 27 (02) : 722 - 731
  • [57] Smitha Rajagopal, 2019, Soft Computing and Signal Processing. Proceedings of ICSCSP 2018. Advances in Intelligent Systems and Computing (AISC 900), P119, DOI 10.1007/978-981-13-3600-3_12
  • [58] Outside the Closed World: On Using Machine Learning For Network Intrusion Detection
    Sommer, Robin
    Paxson, Vern
    [J]. 2010 IEEE SYMPOSIUM ON SECURITY AND PRIVACY, 2010, : 305 - 316
  • [59] Ensemble learning for intrusion detection systems: A systematic mapping study and cross-benchmark evaluation
    Tama, Bayu Adhi
    Lim, Sunghoon
    [J]. COMPUTER SCIENCE REVIEW, 2021, 39
  • [60] An Enhanced Anomaly Detection in Web Traffic Using a Stack of Classifier Ensemble
    Tama, Bayu Adhi
    Nkenyereye, Lewis
    Islam, S. M. Riazul
    Kwak, Kyung-Sup
    [J]. IEEE ACCESS, 2020, 8 : 24120 - 24134