Building a Cloud-IDS by Hybrid Bio-Inspired Feature Selection Algorithms Along With Random Forest Model

被引:18
作者
Bakro, Mhamad [1 ]
Kumar, Rakesh Ranjan [1 ]
Husain, Mohammad [2 ]
Ashraf, Zubair [3 ]
Ali, Arshad [2 ]
Yaqoob, Syed Irfan [4 ]
Ahmed, Mohammad Nadeem [5 ]
Parveen, Nikhat [6 ]
机构
[1] CV Raman Global Univ, Dept Comp Sci & Engn, Bhubaneswar 752054, Odisha, India
[2] Islamic Univ Madinah, Fac Comp & Informat Syst, Dept Comp Sci, Madinah 42351, Saudi Arabia
[3] GLA Univ, Dept Comp Engn & Applicat, Mathura 281406, Uttar Pradesh, India
[4] Dr Vishwanath Karad MIT World Peace Univ, Dept Comp Sci & Applicat, Pune 411038, India
[5] King Khalid Univ, Dept Comp Sci, Abha 61421, Saudi Arabia
[6] Koneru Lakshmaiah Educ Fdn, Dept Comp Sci & Engn, Guntur 522302, Andhra Pradesh, India
关键词
Hybrid metaheuristic approach; GOA-GA-based feature selection; UNSW-NB15; CIC-DDoS2019; CIC Bell DNS EXF 2021; UNSW-NB15 DATA SET; INTRUSION DETECTION; ANOMALY DETECTION; NETWORK; ENSEMBLE;
D O I
10.1109/ACCESS.2024.3353055
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The adoption of cloud computing has become increasingly widespread across various domains. However, the inherent security vulnerabilities of cloud computing pose significant risks to its overall safety. Consequently, intrusion detection systems (IDS) play a pivotal role in identifying malicious activities within a cloud system. The considerable volume of network traffic data may contain redundant and irrelevant features that can impact the classification performance of the classifier. In addition, the complexity and time consumption increase while processing such a substantial volume of data in the cloud intrusion detection process. To enhance the performance of the IDS, this study proposes a hybrid feature selection approach, combining two bio-inspired algorithms, namely the grasshopper optimization algorithm (GOA) and the genetic algorithm (GA). The combination of these two algorithms ensures a more efficient search for optimal solutions. A random forest (RF) classifier is trained using those optimal features. Moreover, the proposal addresses the challenge of imbalanced data by employing a hybrid approach: over-sampling the minority classes using an adaptive synthetic (ADASYN) algorithm, while implementing random under-sampling (RUS) for the majority class as needed. This integrated strategy significantly influences each category, enhancing the true positive rate (TPR) while minimizing the false positive rate (FPR), thus improving the overall system performance. The proposed approach was evaluated using three datasets: UNSW-NB15, CIC-DDoS2019, and CIC Bell DNS EXF 2021. The recorded accuracies for these datasets were 98%, 99%, and 92%, respectively. The hybrid feature selection-based IDS demonstrated superior performance in multi-class classification, along with exemplary results for individual classes within the datasets. The proposed strategy exhibited a marked superiority with the random forest classifier, especially when compared to other classifiers including SVM, LR, FLN, LSTM, AlexNet, DNN, DBN, DT, and XGBoost. Moreover, this performance remained consistent and commendable even when benchmarked against contemporary state-of-the-art methodologies across multiple evaluation metrics.
引用
收藏
页码:8846 / 8874
页数:29
相关论文
共 99 条
[1]   Error-Robust Distributed Denial of Service Attack Detection Based on an Average Common Feature Extraction Technique [J].
Abreu Maranhao, Joao Paulo ;
Carvalho Lustosa da Costa, Joao Paulo ;
Pignaton de Freitas, Edison ;
Javidi, Elnaz ;
Timoteo de Sousa Junior, Rafael .
SENSORS, 2020, 20 (20) :1-21
[2]   A novel feature selection method for data mining tasks using hybrid Sine Cosine Algorithm and Genetic Algorithm [J].
Abualigah, Laith ;
Dulaimi, Akram Jamal .
CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2021, 24 (03) :2161-2176
[3]   Gene selection with Game Shapley Harris hawks optimizer for cancer classification [J].
Afreen, Sana ;
Bhurjee, Ajay Kumar ;
Aziz, Rabia Musheer .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2023, 242
[4]   Network intrusion detection system: A systematic study of machine learning and deep learning approaches [J].
Ahmad, Zeeshan ;
Shahid Khan, Adnan ;
Wai Shiang, Cheah ;
Abdullah, Johari ;
Ahmad, Farhan .
TRANSACTIONS ON EMERGING TELECOMMUNICATIONS TECHNOLOGIES, 2021, 32 (01)
[5]   Prioritization based Taxonomy of Cloud-based Outsource Software Development Challenges: Fuzzy AHP analysis [J].
Akbar, Muhammad Azeem ;
Shameem, Mohammad ;
Mahmood, Sajjad ;
Alsanad, Ahmed ;
Gumaei, Abdu .
APPLIED SOFT COMPUTING, 2020, 95
[6]   A new DDoS attacks intrusion detection model based on deep learning for cybersecurity [J].
Akgun, Devrim ;
Hizal, Selman ;
Cavusoglu, Unal .
COMPUTERS & SECURITY, 2022, 118
[7]   An Efficient NIDPS with Improved Salp Swarm Feature Optimization Method [J].
Alabrah, Amerah .
APPLIED SCIENCES-BASEL, 2023, 13 (12)
[8]   Resilient Back Propagation Neural Network Security Model For Containerized Cloud Computing [J].
Almiani, Muder ;
Abughazleh, Alia ;
Jararweh, Yaser ;
Razaque, Abdul .
SIMULATION MODELLING PRACTICE AND THEORY, 2022, 118
[9]   Intrusion detection in Edge-of-Things computing [J].
Almogren, Ahmad S. .
JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2020, 137 :259-265
[10]   A Feature Selection Model for Network Intrusion Detection System Based on PSO, GWO, FFA and GA Algorithms [J].
Almomani, Omar .
SYMMETRY-BASEL, 2020, 12 (06) :1-20