An empirical assessment of ensemble methods and traditional machine for web-based attack detection in 5.0

被引：22

作者：

Chakir, Oumaima ^{[1
]}

Rehaimi, Abdeslam ^{[1
]}

Sadqi, Yassine ^{[1
]}

Alaoui, El Arbi Abdellaoui ^{[2
]}

Krichen, Moez ^{[3
,4
]}

Gaba, Gurjot Singh ^{[5
]}

Gurtov, Andrei ^{[5
]}

机构：

[1] USMS Univ, FPBM, Lab LIMATI, Beni Mellal, Morocco

[2] Univ Moulay Ismail, Dept Sci, IMAGE Lab, ENS,IEVIA Team, Meknes, Morocco

[3] Al Baha Univ, Fac CSIT, Al Bahah, Saudi Arabia

[4] Univ Sfax, ReDCAD Lab, Sfax, Tunisia

[5] Linkoping Univ, Sch Comp & Informat Sci, Linkoping, Sweden

来源：

JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES | 2023年 / 35卷 / 03期

关键词：

Cybersecurity; Ensemble methods; Industry; 5; 0; Machine learning; Web-based attack detection; INTRUSION DETECTION SYSTEMS; PROTOCOL;

D O I：

10.1016/j.jksuci.2023.02.009

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Cybersecurity attacks that target software have become profitable and popular targets for cybercriminals who consciously take advantage of web-based vulnerabilities and execute attacks that might jeopardize essential industry 5.0 features. Several machine learning-based techniques have been developed in the literature to identify these types of assaults. In contrast to single classifiers, ensemble methods have not been evaluated empirically. To the best of our knowledge, this work is the first empirical evaluation of both homogeneous and heterogeneous ensemble approaches compared to single classifiers for web -based attack detection in industry 5.0, utilizing two of the most realistic public web-based attack data -sets. The authors divided the experiment into three main phases: In the first phase, they evaluated the performance of five well-established supervised machine learning (ML) classifiers. In the second phase, they constructed a heterogeneous ensemble of the three best-performing ML algorithms using max vot-ing and stacking methods. In the third phase, they used four well-known homogeneous ensembles to evaluate the performance of the bagging and boosting method. The results based on the ECML/PKDD 2007 and CSIC HTTP 2010 datasets revealed that bagging, particularly Random Forest, outperformed sin-gle classifiers in terms of accuracy, precision, F-value, FPR, and area of the ROC curve with values of 99.597%, 98.274%, 99.129%, 0.523%, 100 and 99.867%, 99.867%, 99.867%, 0.267%, 100, respectively. In con-trast, single classifiers performed better than boosting and stacking. However, in terms of FPR, the boost-ing exceeded single classifiers. Max voting is appropriate when accuracy, precision, and FPR are the primary concerns, whereas single classifiers can be employed when recall, FNR, training, and prediction times are critical elements. In terms of training time, ensemble approaches are more likely to be affected by data volume than single classifiers. The paper's findings will help security researchers and practition-ers identify the most efficient learning techniques for securing web applications. (c) 2023 The Author(s). Published by Elsevier B.V. on behalf of King Saud University. This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).

引用

页码：103 / 119

页数：17

共 70 条

[21] Improving malware detection using big data and ensemble learning
Gupta, Deepak
Rani, Rinkle
[J]. COMPUTERS & ELECTRICAL ENGINEERING, 2020, 86
[22] Securing Industrial Internet of Things Against Botnet Attacks Using Hybrid Deep Learning Approach
Hasan, Tooba
Malik, Jahanzaib
Bibi, Iram
Khan, Wali Ullah
Al-Wesabi, Fahd N.
Dev, Kapal
Huang, Gaojian
[J]. IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2023, 10 (05): : 2952 - 2963
[23] Performance evaluation of Convolutional Neural Network for web security
Jemal, Ines
Haddar, Mohamed Amine
Cheikhrouhou, Omar
Mahfoudhi, Adel
[J]. COMPUTER COMMUNICATIONS, 2021, 175 : 58 - 67
[24] Adversarial machine learning for network intrusion detection: A comparative study
Jmila, Houda
Ibn Khedher, Mohamed
[J]. COMPUTER NETWORKS, 2022, 214
[25] Performance Analysis of Intrusion Detection Systems Using a Feature Selection Method on the UNSW-NB15 Dataset
Kasongo, Sydney M.
Sun, Yanxia
[J]. JOURNAL OF BIG DATA, 2020, 7 (01)
[26] Defending Malicious Script Attacks Using Machine Learning Classifiers
Khan, Nayeem
Abdullah, Johari
Khan, Adnan Shahid
[J]. WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2017,
[27] Intrusion Detection in Automatic Dependent Surveillance-Broadcast (ADS-B) with Machine Learning
Khan, Suleman
Thorn, Joakim
Wahlgren, Alex
Gurtov, Andrei
[J]. 2021 IEEE/AIAA 40TH DIGITAL AVIONICS SYSTEMS CONFERENCE (DASC), 2021,
[28] Intelligent intrusion detection system in smart grid using computational intelligence and machine learning
Khan, Suleman
Kifayat, Kashif
Kashif Bashir, Ali
Gurtov, Andrei
Hassan, Mehdi
[J]. TRANSACTIONS ON EMERGING TELECOMMUNICATIONS TECHNOLOGIES, 2021, 32 (06)
[29] Kozik R, 2014, LECT NOTES COMPUT SC, V8838, P680, DOI 10.1007/978-3-662-45237-0_61
[30] Machine learning algorithms for wireless sensor networks: A survey
Kumar, D. Praveen
Amgoth, Tarachand
Annavarapu, Chandra Sekhara Rao
[J]. INFORMATION FUSION, 2019, 49 : 1 - 25

← 1 2 3 4 5 6 7 →