An Exploratory Analysis of Feature Selection for Malware Detection with Simple Machine Learning Algorithms

被引:1
|
作者
Rahman, Md Ashikur [1 ]
Islam, Syful [2 ]
Nugroho, Yusuf Sulistyo [3 ]
Al Irsyadi, Fatah Yasin [3 ]
Hossain, Md Javed [1 ]
机构
[1] Noakhali Sci & Technol Univ, Dhaka, Bangladesh
[2] Bangabandhu Sheikh Mujibur Rahman Sci & Technol U, Dhaka, Bangladesh
[3] Univ Muhammadiyah Surakarta, Surakarta, Indonesia
关键词
Malware Detection; Machine Learning; Feature Selection; Information Gain; Cybersecurity;
D O I
10.24138/jcomss-2023-0091
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Computers have become increasingly vulnerable to malicious attacks with an increase in popularity and the proliferation of open system architectures. There are numerous malware detection technologies available to protect the computer operating system from such attacks. This type of malware detector targets programs based on patterns detected in the properties of computer applications. As the amount of analytical data increases, the computer defense system is adversely affected. The performance of the detection mechanism has been hindered due to the presence of numerous irrelevant characteristics. The goal of this study is to provide a feature selection approach that will help malware detection systems be more accurate by detecting pertinent and significant traits. Furthermore, by selecting the most important features, it is possible to maintain an acceptable level of accuracy in the detection of malware while significantly lowering the computational cost. The proposed method displays the most important features (MIFs) obtained from each machine learning method, including data cleaning and feature selection. Furthermore, the method applies six machine learning classification techniques to the selected feature set. Several classifiers were evaluated based on several characteristics for malware detection, including Support Vector Machines (SVM), Logistic Regression (LR), K-nearest neighbor (K-NN), Decision Tree (DT), Naive Bayes (NB), and Random Forest (RF). Our suggested model was tested on two malware datasets to determine its effectiveness. In terms of accuracy, precision, F1 scores, and recall, the experimental findings show that RF and DT classifiers beat other techniques.
引用
收藏
页码:207 / 219
页数:13
相关论文
共 50 条
  • [1] FEATURE SELECTION AND MACHINE LEARNING CLASSIFICATION FOR MALWARE DETECTION
    Khammas, Ban Mohammed
    Monemi, Alireza
    Bassi, Joseph Stephen
    Ismail, Ismahani
    Nor, Sulaiman Mohd
    Marsono, Muhammad Nadzir
    JURNAL TEKNOLOGI, 2015, 77 (01):
  • [2] An Effective Malware Detection Method Using Hybrid Feature Selection and Machine Learning Algorithms
    Namita Dabas
    Prachi Ahlawat
    Prabha Sharma
    Arabian Journal for Science and Engineering, 2023, 48 : 9749 - 9767
  • [3] An Effective Malware Detection Method Using Hybrid Feature Selection and Machine Learning Algorithms
    Dabas, Namita
    Ahlawat, Prachi
    Sharma, Prabha
    ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2023, 48 (08) : 9749 - 9767
  • [4] Malware Analysis and Detection Using Machine Learning Algorithms
    Akhtar, Muhammad Shoaib
    Feng, Tao
    SYMMETRY-BASEL, 2022, 14 (11):
  • [5] Android malware detection applying feature selection techniques and machine learning
    Mohammad Reza Keyvanpour
    Mehrnoush Barani Shirzad
    Farideh Heydarian
    Multimedia Tools and Applications, 2023, 82 : 9517 - 9531
  • [6] Android malware detection applying feature selection techniques and machine learning
    Keyvanpour, Mohammad Reza
    Shirzad, Mehrnoush Barani
    Heydarian, Farideh
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (06) : 9517 - 9531
  • [7] A Novel Malware Analysis for Malware Detection and Classification using Machine Learning Algorithms
    Sethi, Kamalakanta
    Chaudhary, Shankar Kumar
    Tripathy, Bata Krishan
    Bera, Padmalochan
    SIN'17: PROCEEDINGS OF THE 10TH INTERNATIONAL CONFERENCE ON SECURITY OF INFORMATION AND NETWORKS, 2017, : 107 - 113
  • [8] A Study on the Effect of Feature Selection on Malware Analysis using Machine Learning
    Babaagba, Kehinde Oluwatoyin
    Adesanya, Samuel Olumide
    PROCEEDINGS OF 2019 8TH INTERNATIONAL CONFERENCE ON EDUCATIONAL AND INFORMATION TECHNOLOGY (ICEIT 2019), 2019, : 51 - 55
  • [9] Evaluation of Machine Learning Algorithms for Malware Detection
    Akhtar, Muhammad Shoaib
    Feng, Tao
    SENSORS, 2023, 23 (02)
  • [10] Malware Detection and Classification with Machine Learning Algorithms
    Kumar, R. Vinoth
    Islam, Md Mojahidul
    Apon, Abir Hossain
    Prantha, C. S.
    SMART TRENDS IN COMPUTING AND COMMUNICATIONS, VOL 5, SMARTCOM 2024, 2024, 949 : 143 - 158