Machine-Learning-Based Android Malware Family Classification Using Built-In and Custom Permissions

被引:8
|
作者
Kim, Minki [1 ]
Kim, Daehan [1 ]
Hwang, Changha [2 ]
Cho, Seongje [3 ]
Han, Sangchul [4 ]
Park, Minkyu [4 ]
机构
[1] Dankook Univ, Dept Data & Knowledge Serv Engn, Yongin 16890, South Korea
[2] Dankook Univ, Dept Stat, Yongin 16890, South Korea
[3] Dankook Univ, Dept Software Sci, Yongin 16890, South Korea
[4] Konkuk Univ, Dept Comp Engn, Chungju 27478, South Korea
来源
APPLIED SCIENCES-BASEL | 2021年 / 11卷 / 21期
基金
新加坡国家研究基金会;
关键词
Android malware; malware family classification; machine learning; built-in permission; custom permission; balanced accuracy; Matthews correlation coefficient; DETECTION SYSTEM; FEATURES;
D O I
10.3390/app112110244
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Malware family classification is grouping malware samples that have the same or similar characteristics into the same family. It plays a crucial role in understanding notable malicious patterns and recovering from malware infections. Although many machine learning approaches have been devised for this problem, there are still several open questions including, "Which features, classifiers, and evaluation metrics are better for malware familial classification "? In this paper, we propose a machine learning approach to Android malware family classification using built-in and custom permissions. Each Android app must declare proper permissions to access restricted resources or to perform restricted actions. Permission declaration is an efficient and obfuscation-resilient feature for malware analysis. We developed a malware family classification technique using permissions and conducted extensive experiments with several classifiers on a well-known dataset, DREBIN. We then evaluated the classifiers in terms of four metrics: macrolevel F1-score, accuracy, balanced accuracy (BAC), and the Matthews correlation coefficient (MCC). BAC and the MCC are known to be appropriate for evaluating imbalanced data classification. Our experimental results showed that: (i) custom permissions had a positive impact on classification performance; (ii) even when the same classifier and the same feature information were used, there was a difference up to 3.67% between accuracy and BAC; (iii) LightGBM and AdaBoost performed better than other classifiers we considered.
引用
收藏
页数:24
相关论文
共 50 条
  • [1] Permissions-based Android malware detection using machine learning
    Alomar, Atheer
    AlJarullah, Asma
    Abu-Ghazalah, Sarah
    Neural Computing and Applications, 2025, 37 (06) : 5255 - 5270
  • [2] Dynamic Permissions based Android Malware Detection using Machine Learning Techniques
    Mahindru, Arvind
    Singh, Paramvir
    PROCEEDINGS OF THE 10TH INNOVATIONS IN SOFTWARE ENGINEERING CONFERENCE, 2017, : 202 - 210
  • [3] Analysis and Classification of Android Malware using Machine Learning Algorithms
    Tarar, Neha
    Sharma, Shweta
    Krishna, C. Rama
    PROCEEDINGS OF THE 2018 3RD INTERNATIONAL CONFERENCE ON INVENTIVE COMPUTATION TECHNOLOGIES (ICICT 2018), 2018, : 738 - 743
  • [4] On the Impact of Sample Duplication in Machine-Learning-Based Android Malware Detection
    Zhao, Yanjie
    Li, Li
    Wang, Haoyu
    Cai, Haipeng
    Bissyande, Tegawende F.
    Klein, Jacques
    Grundy, John
    ACM TRANSACTIONS ON SOFTWARE ENGINEERING AND METHODOLOGY, 2021, 30 (03)
  • [5] Machine-Learning based analysis and classification of Android malware signatures
    Martin, Ignacio
    Alberto Hernandez, Jose
    de los Santos, Sergio
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2019, 97 : 295 - 305
  • [6] Machine learning aided Android malware classification
    Milosevic, Nikola
    Dehghantanha, Ali
    Choo, Kitn-Kwang Raymond
    COMPUTERS & ELECTRICAL ENGINEERING, 2017, 61 : 266 - 274
  • [7] Entropy-Based Feature Grouping in Machine Learning for Android Malware Classification
    Shim, Hyunseok
    Jung, Souhwan
    INFORMATION SECURITY APPLICATIONS, WISA 2020, 2020, 12583 : 59 - 72
  • [8] Android Malware Detection Based on Machine Learning
    Wang, Qing-Fei
    Fang, Xiang
    2018 4TH ANNUAL INTERNATIONAL CONFERENCE ON NETWORK AND INFORMATION SYSTEMS FOR COMPUTERS (ICNISC 2018), 2018, : 434 - 436
  • [9] Android Malware Family Classification: What Works - API Calls, Permissions or API Packages?
    Kumar, Saurabh
    Mishra, Debadatta
    Shukla, Sandeep Kumar
    2021 14TH INTERNATIONAL CONFERENCE ON SECURITY OF INFORMATION AND NETWORKS (SIN 2021), 2021,
  • [10] Machine learning based Malware Classification for Android Applications using Multimodal Image Representations
    Kumar, Ajit
    Sagar, Pramod K.
    Kuppusamy, K. S.
    Aghila, G.
    PROCEEDINGS OF THE 10TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS AND CONTROL (ISCO'16), 2016,