Android Malware Detection: An Empirical Investigation into Machine Learning Classifiers

被引：2

作者：

Raval, Aaditya ^{[1
]}

Anwar, Mohd ^{[1
]}

机构：

[1] North Carolina Agr & Tech State Univ, Human Ctr AI Lab, Comp Sci Dept, Greensboro, NC 27411 USA

来源：

2024 IEEE INTERNATIONAL CONFERENCE ON INFORMATION REUSE AND INTEGRATION FOR DATA SCIENCE, IRI 2024 | 2024年

关键词：

Android malware; malware detection; machine learning; ANOVA feature selection; malware analysis; mobile security; Android security; static analysis;

D O I：

10.1109/IRI62200.2024.00039

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Smartphones have become an all-in-one device due to their constant connectivity and ability to provide a wide range of functions. However, they can expose users to various forms of malware, posing significant threats to their privacy, security, and financial well-being. In this context, the detection and classification of Android malware have emerged as critical research areas in cybersecurity. Our study develops six distinct models/classifiers for detecting and classifying Android malware, leveraging the following machine-learning algorithms: MLP, logistic regression, random forest, SVM, XGBoost, and AdaBoost. Through a systematic evaluation process, we assess the efficacy of each model, highlighting their respective strengths and weaknesses. These findings not only contribute to the existing body of knowledge but also pave the way for future research and innovation in the field of Android security. Furthermore, we investigate the impact of data preprocessing and feature selection strategies on model performance and generalization capabilities. Our experimental results reveal that Random Forest (RF) and Extreme Gradient Boosting (XGBoost) classifiers outperformed others in classifying Android malware, showcasing performance of around 93% and 92%, respectively across accuracy, precision, recall, and f1-score. With an AUC of 0.93 for RF and 0.92 for XGBoost, these models can clearly distinguish between malware and benign samples with minimum misclassifications. Our findings shed light on the effectiveness of machine learning algorithms in combating Android malware and offer valuable insights into the most suitable models. Ultimately, this research study advances the understanding of Android malware detection and classification, providing a foundation for developing robust security solutions in the mobile computing landscape.

引用

页码：144 / 149

页数：6

共 25 条

[1]

42matters AG, Amazon Appstore Statistics and trends 2024

[2]

Allix K, 2016, 13TH WORKING CONFERENCE ON MINING SOFTWARE REPOSITORIES (MSR 2016), P468, DOI [10.1109/MSR.2016.056, 10.1145/2901739.2903508]

[3]

Amazon, Mobile Apps

[4]

[Anonymous], About Us

[5]

APKMirror, About us

[6]

APKPure, US

[7] Drebin: Effective and Explainable Detection of Android Malware in Your Pocket [J].

Arp, Daniel ;

Spreitzenbarth, Michael ;

Huebner, Malte ;

Gascon, Hugo ;

Rieck, Konrad .

21ST ANNUAL NETWORK AND DISTRIBUTED SYSTEM SECURITY SYMPOSIUM (NDSS 2014), 2014,

[8]

Canadian Centre for Cyber Security, 2020, About us

[9]

Enck W., 2010, P 9 USENIX C OP SYST, P1, DOI DOI 10.1145/2494522

[10]

F-Droid, Packages

← 1 2 3 →