Android malware classification using optimum feature selection and ensemble machine learning

被引:0
|
作者
Islam R. [2 ]
Sayed M.I. [1 ]
Saha S. [1 ]
Hossain M.J. [2 ]
Masud M.A. [2 ]
机构
[1] Computer Science, Western University, London, ON
[2] Computer Science and Information Technology, Patuakhali Science and Technology University, Dumki, Patuakhali
来源
Internet of Things and Cyber-Physical Systems | 2023年 / 3卷
关键词
Android; Category classification; Dynamic analysis; Ensemble; Malware; Supervised ML;
D O I
10.1016/j.iotcps.2023.03.001
中图分类号
学科分类号
摘要
The majority of smartphones on the market run on the Android operating system. Security has been a core concern with this platform since it allows users to install apps from unknown sources. With thousands of apps being produced and launched daily, malware detection using Machine Learning (ML) has attracted significant attention compared to traditional detection techniques. Despite academic and commercial efforts, developing an efficient and reliable method for classifying malware remains challenging. As a result, several datasets for malware analysis have been generated and made available during the past ten years. These datasets may contain static features, such as API calls, intents, and permissions, or dynamic features, like logcat errors, shared memory, and system calls. Dynamic analysis is more resilient when it comes to code obfuscation. Though binary classification and multi-classification have been carried out in recent studies, the latter provides valuable insight into the nature of malware. Because each malware variant operates differently, identifying its category might help prevent it. Using the well-known ensemble ML approach called weighted voting, this study performed dynamic feature analysis for multi-classification. Random Forest, K-nearest Neighbors, Multi-Level Perceptrons, Decision Trees, Support Vector Machines, and Logistic Regression are all studied in this ensemble model. We used a recent dataset named CCCS-CIC-AndMal-2020, which contains an extensive collection of Android applications and malware samples. A well-researched data preparation phase followed by weighted voting based on R2 scores of the ML classifiers presents an accuracy of 95.0% even after excluding 60.2% features, outperforming all recent studies. © 2023 The Authors
引用
收藏
页码:100 / 111
页数:11
相关论文
共 50 条
  • [1] High Performance Classification of Android Malware Using Ensemble Machine Learning
    Ouk, Pagnchakneat C.
    Pak, Wooguil
    CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 72 (01): : 381 - 398
  • [2] Android Malware Detection Using Ensemble Feature Learning
    Rout, Siddhartha Suman
    Vashishtha, Lalit Kumar
    Chatterjee, Kakali
    Rout, Jitendra Kumar
    INFORMATION SYSTEMS AND MANAGEMENT SCIENCE, ISMS 2021, 2023, 521 : 531 - 539
  • [3] Ensemble Machine Learning Approach for Android Malware Classification Using Hybrid Features
    Pektas, Abdurrahman
    Acarman, Tankut
    PROCEEDINGS OF THE 10TH INTERNATIONAL CONFERENCE ON COMPUTER RECOGNITION SYSTEMS CORES 2017, 2018, 578 : 191 - 200
  • [4] FEATURE SELECTION AND MACHINE LEARNING CLASSIFICATION FOR MALWARE DETECTION
    Khammas, Ban Mohammed
    Monemi, Alireza
    Bassi, Joseph Stephen
    Ismail, Ismahani
    Nor, Sulaiman Mohd
    Marsono, Muhammad Nadzir
    JURNAL TEKNOLOGI, 2015, 77 (01):
  • [5] Android Malware Detection Using Machine Learning with Feature Selection Based on the Genetic Algorithm
    Lee, Jaehyeong
    Jang, Hyuk
    Ha, Sungmin
    Yoon, Yourim
    MATHEMATICS, 2021, 9 (21)
  • [6] DETECTION OF ANDROID MALWARE USING DEEP LEARNING ENSEMBLE WITH CHEETAH-OPTIMIZED FEATURE SELECTION
    Almotairi, Sultan
    Khan, Mohd Abdul Rahim
    Alharbi, Olayan
    Alzaid, Zaid
    Hausawi, Yasser M.
    Almutairi, Jaber
    ADVANCES AND APPLICATIONS IN DISCRETE MATHEMATICS, 2024, 41 (05): : 357 - 392
  • [7] Ensemble Feature Selection for Android SMS Malware Detection
    Ibrahim, Syed F.
    Hossain, Md Sakir
    Islam, Md Moontasirul
    Mostofa, Md Golam
    ADVANCES IN CYBERSECURITY, CYBERCRIMES, AND SMART EMERGING TECHNOLOGIES, 2023, 4 : 15 - 26
  • [8] BFEDroid: A Feature Selection Technique to Detect Malware in Android Apps Using Machine Learning
    Chimeleze, Collins
    Jamil, Norziana
    Ismail, Roslan
    Lam, Kwok-Yan
    Teh, Je Sen
    Samual, Joshua
    Okeke, Chidiebere Akachukwu
    SECURITY AND COMMUNICATION NETWORKS, 2022, 2022
  • [9] Feature Selection and Ensemble of Classifiers for Android Malware Detection
    Coronado-De-Alba, Lilian D.
    Rodriguez-Mota, Abraham
    Escamilla-Ambrosio, Ponciano J.
    2016 8TH IEEE LATIN-AMERICAN CONFERENCE ON COMMUNICATIONS (LATINCOM), 2016,
  • [10] Android malware detection applying feature selection techniques and machine learning
    Mohammad Reza Keyvanpour
    Mehrnoush Barani Shirzad
    Farideh Heydarian
    Multimedia Tools and Applications, 2023, 82 : 9517 - 9531