Mitigating cyber threats through integration of feature selection and stacking ensemble learning: the LGBM and random forest intrusion detection perspective

被引:15
作者
Mishra, Amit Kumar [1 ]
Paliwal, Shweta [1 ]
机构
[1] DIT Univ, Sch Comp, Dehra Dun, Uttarakhand, India
来源
CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS | 2023年 / 26卷 / 04期
关键词
Network security; Machine learning; Ensemble learning; Feature selection; Internet of things; DETECTION SYSTEM; FRAMEWORK; MODEL;
D O I
10.1007/s10586-022-03735-8
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The network traffic has observed astounding expansion and is set to explode in the next few years. Security attacks are becoming more and more synchronized as attackers are involved in using new orchestrated techniques that are capable of initiating attacks such as zero-day vector and slow loris. These attacks are surpassing the current network analytic solutions employed in the infrastructure of the network. Machine learning (ML) based approaches are successfully quelling modern-day attacks by analyzing the patterns in the encrypted network traffic. Detection strategies based on labelled datasets that are a combination of synthesized attacks and modern normal attacks became the need of the hour. In this study, three benchmark datasets; UNSWNB15, NSL- KDD, and BoT-Internet of things are a combination of modern-day orchestrated security attacks. The datasets are processed and feature selection is performed using information gain and correlation coefficient (Pearson). Once the features are identified they are subjected to the following classifiers; stacking of light gradient boosting machine (LGBM) and random forest, stochastic gradient descent, Gaussian Naive Bayes (GNB), support vector machine (SVM), bagging + reduced error pruning, K nearest neighbour and AdaBoost. Thus it has been observed that stacking of LGBM and random forest has given the highest predictions for all three datasets.
引用
收藏
页码:2339 / 2350
页数:12
相关论文
共 39 条
[31]   A new design of intrusion detection in IoT sector using optimal feature selection and high ranking-based ensemble learning model [J].
Gopalakrishnan, B. ;
Purusothaman, P. .
PEER-TO-PEER NETWORKING AND APPLICATIONS, 2022, 15 (05) :2199-2226
[32]   Intrusion Detection Technique in Wireless Sensor Network using Grid Search Random Forest with Boruta Feature Selection Algorithm [J].
Subbiah, Sridevi ;
Anbananthen, Kalaiarasi Sonai Muthu ;
Thangaraj, Saranya ;
Kannan, Subarmaniam ;
Chelliah, Deisy .
JOURNAL OF COMMUNICATIONS AND NETWORKS, 2022, 24 (02) :264-273
[33]   A hybrid feature weighted attention based deep learning approach for an intrusion detection system using the random forest algorithm [J].
Hashmi, Arshad ;
Barukab, Omar M. ;
Osman, Ahmad Hamza .
PLOS ONE, 2024, 19 (05)
[34]   An improved binary manta ray foraging optimization algorithm based feature selection and random forest classifier for network intrusion detection [J].
Hassan, Ibrahim Hayatu ;
Abdullahi, Mohammed ;
Aliyu, Mansur Masama ;
Yusuf, Sahabi Ali ;
Abdulrahim, Abdulrazaq .
INTELLIGENT SYSTEMS WITH APPLICATIONS, 2022, 16
[35]   Classification framework for faulty-software using enhanced exploratory whale optimizer-based feature selection scheme and random forest ensemble learning [J].
Mafarja, Majdi ;
Thaher, Thaer ;
Al-Betar, Mohammed Azmi ;
Too, Jingwei ;
Awadallah, Mohammed A. ;
Abu Doush, Iyad ;
Turabieh, Hamza .
APPLIED INTELLIGENCE, 2023, 53 (15) :18715-18757
[36]   Anomaly-Based Network Intrusion Detection System through Feature Selection and Hybrid Machine Learning Technique [J].
Pattawaro, Apichit ;
Polprasert, Chantri .
2018 16TH INTERNATIONAL CONFERENCE ON ICT AND KNOWLEDGE ENGINEERING (ICT&KE), 2018, :64-69
[37]   Improves Intrusion Detection Performance In Wireless Sensor Networks Through Machine Learning, Enhanced By An Accelerated Deep Learning Model With Advanced Feature Selection [J].
Saleh, Hadeel M. ;
Marouane, Hend ;
Fakhfakh, Ahmed .
Iraqi Journal for Computer Science and Mathematics, 2024, 5 (03) :790-814
[38]   WS-AWRE: Intrusion Detection Using Optimized Whale Sine Feature Selection and Artificial Neural Network (ANN) Weighted Random Forest Classifier [J].
Aldabash, Omar Abdulkhaleq ;
Akay, Mehmet Fatih .
APPLIED SCIENCES-BASEL, 2024, 14 (05)
[39]   Classification framework for faulty-software using enhanced exploratory whale optimizer-based feature selection scheme and random forest ensemble learning [J].
Majdi Mafarja ;
Thaer Thaher ;
Mohammed Azmi Al-Betar ;
Jingwei Too ;
Mohammed A. Awadallah ;
Iyad Abu Doush ;
Hamza Turabieh .
Applied Intelligence, 2023, 53 :18715-18757