Towards a fair comparison and realistic evaluation framework of android malware detectors based on static analysis and machine learning

被引:15
|
作者
Molina-Coronado, Borja [1 ]
Mori, Usue [2 ]
Mendiburu, Alexander [1 ]
Miguel-Alonso, Jose [1 ]
机构
[1] Univ Basque Country UPV EHU, Dept Comp Architecture & Technol, Ps Manuel Lardizabal 1, Donostia San Sebastian 20018, Gipuzkoa, Spain
[2] Univ Basque Country UPV EHU, Dept Comp Sci & Artificial Intelligence, Ps Manuel Lardizabal 1, Donostia San Sebastian 20018, Gipuzkoa, Spain
关键词
Android malware detection; Machine learning; Mobile security; Experimental analysis; Static analysis; OBFUSCATION; DISCOVERY; KNOWLEDGE; MODEL;
D O I
10.1016/j.cose.2022.102996
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
As in other cybersecurity areas, machine learning (ML) techniques have emerged as a promising solution to detect Android malware. In this sense, many proposals employing a variety of algorithms and feature sets have been presented to date, often reporting impresive detection performances. However, the lack of reproducibility and the absence of a standard evaluation framework make these proposals difficult to compare. In this paper, we perform an analysis of 10 influential research works on Android malware detection using a common evaluation framework. We have identified five factors that, if not taken into account when creating datasets and designing detectors, significantly affect the trained ML models and their performances. In particular, we analyze the effect of (1) the presence of duplicated samples, (2) label (goodware/greyware/malware) attribution, (3) class imbalance, (4) the presence of apps that use evasion techniques and, (5) the evolution of apps. Based on this extensive experimentation, we conclude that the studied ML-based detectors have been evaluated optimistically, which justifies the good published results. Our findings also highlight that it is imperative to generate realistic experimental scenarios, taking into account the aforementioned factors, to foster the rise of better ML-based Android malware detection solutions. (c) 2022 Elsevier Ltd. All rights reserved.
引用
收藏
页数:16
相关论文
共 50 条
  • [21] Static and Dynamic Malware Analysis Using Machine Learning
    Ijaz, Muhammad
    Durad, Muhammad Hanif
    Ismail, Maliha
    PROCEEDINGS OF 2019 16TH INTERNATIONAL BHURBAN CONFERENCE ON APPLIED SCIENCES AND TECHNOLOGY (IBCAST), 2019, : 687 - 691
  • [22] Static Malware Analysis Using Machine and Deep Learning
    Singh, Himanshu Kumar
    Singh, Jyoti Prakash
    Tewari, Anand Shanker
    PROCEEDINGS OF INTERNATIONAL CONFERENCE ON COMPUTING AND COMMUNICATION NETWORKS (ICCCN 2021), 2022, 394 : 437 - 446
  • [23] Efficient and Effective Static Android Malware Detection Using Machine Learning
    Bansal, Vidhi
    Ghosh, Mohona
    Baliyan, Niyati
    INFORMATION SYSTEMS SECURITY, ICISS 2022, 2022, 13784 : 103 - 118
  • [24] A Machine-Learning-Based Framework for Supporting Malware Detection and Analysis
    Cuzzocrea, Alfredo
    Mercaldo, Francesco
    Martinelli, Fabio
    COMPUTATIONAL SCIENCE AND ITS APPLICATIONS, ICCSA 2021, PT III, 2021, 12951 : 353 - 365
  • [25] Machine-Learning based analysis and classification of Android malware signatures
    Martin, Ignacio
    Alberto Hernandez, Jose
    de los Santos, Sergio
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2019, 97 : 295 - 305
  • [26] AmandaSystem: A new framework for static and dynamic Android malware analysis
    Tang, Jianfei
    Zhao, Hui
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2022, 43 (05) : 6575 - 6589
  • [27] Beyond the Hype: An Evaluation of Commercially Available Machine Learning-based Malware Detectors
    Bridges, Robert A.
    Oesch, Sean
    Iannacone, Michael D.
    Huffer, Kelly M. T.
    Jewell, Brian
    Nichols, Jeff A.
    Weber, Brian
    Verma, Miki E.
    Scofield, Daniel
    Miles, Craig
    Plummer, Thomas
    Daniell, Mark
    Tall, Anne M.
    Beaver, Justin M.
    Smith, Jared M.
    DIGITAL THREATS: RESEARCH AND PRACTICE, 2023, 4 (02):
  • [28] Analysis of Machine Learning Solutions to Detect Malware in Android
    Jamil, Qudsia
    Shah, Munam Ali
    2016 SIXTH INTERNATIONAL CONFERENCE ON INNOVATIVE COMPUTING TECHNOLOGY (INTECH), 2016, : 226 - 232
  • [29] An Android Malware Detection System Based on Machine Learning
    Wen, Long
    Yu, Haiyang
    GREEN ENERGY AND SUSTAINABLE DEVELOPMENT I, 2017, 1864
  • [30] Android Malware Detection Using Machine Learning with Feature Selection Based on the Genetic Algorithm
    Lee, Jaehyeong
    Jang, Hyuk
    Ha, Sungmin
    Yoon, Yourim
    MATHEMATICS, 2021, 9 (21)