Ensemble Machine Learning Techniques for Accurate and Efficient Detection of Botnet Attacks in Connected Computers

被引:15
作者
Afrifa, Stephen [1 ,2 ]
Varadarajan, Vijayakumar [3 ,4 ,5 ]
Appiahene, Peter [2 ]
Zhang, Tao [1 ]
Domfeh, Emmanuel Adjei [2 ]
机构
[1] Tianjin Univ, Dept Informat & Commun Engn, Tianjin 300072, Peoples R China
[2] Univ Energy & Nat Resources, Dept Comp Sci & Informat, Sunyani 00233, Ghana
[3] Univ New South Wales, Sch Comp Sci & Engn, Sydney, NSW 2052, Australia
[4] Ajeenkya D Y Patil Univ, Int Div, Pune 412105, India
[5] Swiss Sch Business Management, Sch Informat Technol, CH-1213 Geneva, Switzerland
来源
ENG | 2023年 / 4卷 / 01期
关键词
machine learning; botnet; malware; network traffic; ensemble model; IoT;
D O I
10.3390/eng4010039
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
The transmission of information, ideas, and thoughts requires communication, which is a crucial component of human contact. The utilization of Internet of Things (IoT) devices is a result of the advent of enormous volumes of messages delivered over the internet. The IoT botnet assault, which attempts to perform genuine, lucrative, and effective cybercrimes, is one of the most critical IoT dangers. To identify and prevent botnet assaults on connected computers, this study uses both quantitative and qualitative approaches. This study employs three basic machine learning (ML) techniques-random forest (RF), decision tree (DT), and generalized linear model (GLM)-and a stacking ensemble model to detect botnets in computer network traffic. The results reveled that random forest attained the best performance with a coefficient of determination (R2) of 0.9977, followed by decision tree with an R2 of 0.9882, while GLM was the worst among the basic machine learning models with an R2 of 0.9522. Almost all ML models achieved satisfactory performance, with an R2 above 0.93. Overall, the stacking ensemble model obtained the best performance, with a root mean square error (RMSE) of 0.0084 m, a mean absolute error (MAE) of 0.0641 m, and an R2 of 0.9997. Regarding the stacking ensemble model as compared with the single machine learning models, the R2 of the stacking ensemble machine learning increased by 0.2% compared to the RF, 1.15% compared to the DT, and 3.75% compared to the GLM, while RMSE decreased by approximately 0.15% compared to the GLM, DT, and RF single machine learning techniques. Furthermore, this paper suggests best practices for preventing botnet attacks. Businesses should make major investments to combat botnets. This work contributes to knowledge by presenting a novel method for detecting botnet assaults using an artificial-intelligence-powered solution with real-time behavioral analysis. This study can assist companies, organizations, and government bodies in making informed decisions for a safer network that will increase productivity.
引用
收藏
页码:650 / 664
页数:15
相关论文
共 47 条
  • [1] Afrifa S., 2022, Int. J. Innov. Technol. Interdisc. Sci., V5, P1069
  • [2] Mathematical and Machine Learning Models for Groundwater Level Changes: A Systematic Review and Bibliographic Analysis
    Afrifa, Stephen
    Zhang, Tao
    Appiahene, Peter
    Varadarajan, Vijayakumar
    [J]. FUTURE INTERNET, 2022, 14 (09):
  • [3] Botnet Detection in IoT Devices Using Random Forest Classifier with Independent Component Analysis
    Akash, Nazmus Sakib
    Rouf, Shakir
    Jahan, Sigma
    Chowdhury, Amlan
    Chakrabarty, Amitabha
    Uddin, Jia
    [J]. JOURNAL OF INFORMATION AND COMMUNICATION TECHNOLOGY-MALAYSIA, 2022, 21 (02): : 201 - 232
  • [4] Detection of Malware by Deep Learning as CNN-LSTM Machine Learning Techniques in Real Time
    Akhtar, Muhammad Shoaib
    Feng, Tao
    [J]. SYMMETRY-BASEL, 2022, 14 (11):
  • [5] A DDoS Detection and Prevention System for IoT Devices and Its Application to Smart Home Environment
    Al-Begain, Khalid
    Khan, Murad
    Alothman, Basil
    Joumaa, Chibli
    Alrashed, Ebrahim
    [J]. APPLIED SCIENCES-BASEL, 2022, 12 (22):
  • [6] Improved Detection of Malicious Domain Names Using Gradient Boosted Machines and Feature Engineering
    Alhogail, Areej
    Al-Turaiki, Isra
    [J]. INFORMATION TECHNOLOGY AND CONTROL, 2022, 51 (02): : 313 - 331
  • [7] Modeling of Botnet Detection Using Barnacles Mating Optimizer with Machine Learning Model for Internet of Things Environment
    Alrayes, Fatma S.
    Maray, Mohammed
    Gaddah, Abdulbaset
    Yafoz, Ayman
    Alsini, Raed
    Alghushairy, Omar
    Mohsen, Heba
    Motwakel, Abdelwahed
    [J]. ELECTRONICS, 2022, 11 (20)
  • [8] Appiahene P., 2019, P INT C ARTIFICIAL I, P6
  • [9] Predicting Bank Operational Efficiency Using Machine Learning Algorithm: Comparative Study of Decision Tree, Random Forest, and Neural Networks
    Appiahene, Peter
    Missah, Yaw Marfo
    Najim, Ussiph
    [J]. ADVANCES IN FUZZY SYSTEMS, 2020, 2020
  • [10] Evaluation of information technology impact on bank's performance: The Ghanaian experience
    Appiahene, Peter
    Missah, Yaw Marfo
    Najim, Ussiph
    [J]. INTERNATIONAL JOURNAL OF ENGINEERING BUSINESS MANAGEMENT, 2019, 11