Machine-Learning-Based Android Malware Family Classification Using Built-In and Custom Permissions

被引:8
作者
Kim, Minki [1 ]
Kim, Daehan [1 ]
Hwang, Changha [2 ]
Cho, Seongje [3 ]
Han, Sangchul [4 ]
Park, Minkyu [4 ]
机构
[1] Dankook Univ, Dept Data & Knowledge Serv Engn, Yongin 16890, South Korea
[2] Dankook Univ, Dept Stat, Yongin 16890, South Korea
[3] Dankook Univ, Dept Software Sci, Yongin 16890, South Korea
[4] Konkuk Univ, Dept Comp Engn, Chungju 27478, South Korea
来源
APPLIED SCIENCES-BASEL | 2021年 / 11卷 / 21期
基金
新加坡国家研究基金会;
关键词
Android malware; malware family classification; machine learning; built-in permission; custom permission; balanced accuracy; Matthews correlation coefficient; DETECTION SYSTEM; FEATURES;
D O I
10.3390/app112110244
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Malware family classification is grouping malware samples that have the same or similar characteristics into the same family. It plays a crucial role in understanding notable malicious patterns and recovering from malware infections. Although many machine learning approaches have been devised for this problem, there are still several open questions including, "Which features, classifiers, and evaluation metrics are better for malware familial classification "? In this paper, we propose a machine learning approach to Android malware family classification using built-in and custom permissions. Each Android app must declare proper permissions to access restricted resources or to perform restricted actions. Permission declaration is an efficient and obfuscation-resilient feature for malware analysis. We developed a malware family classification technique using permissions and conducted extensive experiments with several classifiers on a well-known dataset, DREBIN. We then evaluated the classifiers in terms of four metrics: macrolevel F1-score, accuracy, balanced accuracy (BAC), and the Matthews correlation coefficient (MCC). BAC and the MCC are known to be appropriate for evaluating imbalanced data classification. Our experimental results showed that: (i) custom permissions had a positive impact on classification performance; (ii) even when the same classifier and the same feature information were used, there was a difference up to 3.67% between accuracy and BAC; (iii) LightGBM and AdaBoost performed better than other classifiers we considered.
引用
收藏
页数:24
相关论文
共 50 条
  • [41] Android Malware Family Classification Based on Sensitive Opcode Sequence
    Jiang, Jianguo
    Li, Song
    Yu, Min
    Li, Gang
    Liu, Chao
    Chen, Kai
    Liu, Hui
    Huang, Weiqing
    [J]. 2019 IEEE SYMPOSIUM ON COMPUTERS AND COMMUNICATIONS (ISCC), 2019, : 63 - 69
  • [42] Detection of Android Malware Using Machine Learning and Siamese Shot Learning Technique for Security
    Almarshad, Fahdah A.
    Zakariah, Mohammed
    Gashgari, Ghada Abdalaziz
    Aldakheel, Eman Abdullah
    Alzahrani, Abdullah I. A.
    [J]. IEEE ACCESS, 2023, 11 : 127697 - 127714
  • [43] Machine-Learning-Based Malware Detection for Virtual Machine by Analyzing Opcode Sequence
    Wang, Xiao
    Zhang, Jianbiao
    Zhang, Ai
    [J]. ADVANCES IN BRAIN INSPIRED COGNITIVE SYSTEMS, BICS 2018, 2018, 10989 : 717 - 726
  • [44] Malware Family Classification using Active Learning by Learning
    Chen, Chin-Wei
    Su, Ching-Hung
    Lee, Kun-Wei
    Bair, Ping-Hao
    [J]. 2020 22ND INTERNATIONAL CONFERENCE ON ADVANCED COMMUNICATION TECHNOLOGY (ICACT): DIGITAL SECURITY GLOBAL AGENDA FOR SAFE SOCIETY!, 2020, : 590 - 595
  • [45] Androhealthcheck: A malware detection system for android using machine learning
    Agrawal P.
    Trivedi B.
    [J]. Lecture Notes on Data Engineering and Communications Technologies, 2021, 66 : 35 - 41
  • [46] AndyWar: an intelligent android malware detection using machine learning
    Roy, Sandipan
    Bhanja, Samit
    Das, Abhishek
    [J]. INNOVATIONS IN SYSTEMS AND SOFTWARE ENGINEERING, 2025, 21 (01) : 303 - 311
  • [47] Android Malware Detection Using Parallel Machine Learning Classifiers
    Yerima, Suleiman Y.
    Sezer, Sakir
    Muttik, Igor
    [J]. 2014 EIGHTH INTERNATIONAL CONFERENCE ON NEXT GENERATION MOBILE APPS, SERVICES AND TECHNOLOGIES (NGMAST), 2014, : 37 - 42
  • [48] Backdoor Attack on Machine Learning Based Android Malware Detectors
    Li, Chaoran
    Chen, Xiao
    Wang, Derui
    Wen, Sheng
    Ahmed, Muhammad Ejaz
    Camtepe, Seyit
    Xiang, Yang
    [J]. IEEE TRANSACTIONS ON DEPENDABLE AND SECURE COMPUTING, 2022, 19 (05) : 3357 - 3370
  • [49] Effectiveness of machine learning based android malware detectors against adversarial attacks
    Jyothish, A.
    Mathew, Ashik
    Vinod, P.
    [J]. CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2024, 27 (03): : 2549 - 2569
  • [50] Malware Classification System Based on Machine Learning
    Qu Wei
    Shi Xiao
    Li Dongbao
    [J]. PROCEEDINGS OF THE 2019 31ST CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2019), 2019, : 647 - 652