Effective and Efficient Hybrid Android Malware Classification Using Pseudo-Label Stacked Auto-Encoder

被引:77
作者
Mahdavifar, Samaneh [1 ]
Alhadidi, Dima [2 ]
Ghorbani, Ali. A. [1 ]
机构
[1] Univ New Brunswick, Canadian Inst Cybersecur CIC, Fac Comp Sci, Fredericton, NB, Canada
[2] Univ Windsor, Sch Comp Sci, Windsor, ON, Canada
关键词
Android malware; Category; Classification; Hybrid analysis; Semi-supervised learning; Stacked auto-encoder; Deep learning;
D O I
10.1007/s10922-021-09634-4
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Android has become the target of attackers because of its popularity. The detection of Android mobile malware has become increasingly important due to its significant threat. Supervised machine learning, which has been used to detect Android malware is far from perfect because it requires a significant amount of labeled data. Since labeled data is expensive and difficult to get while unlabeled data is abundant and cheap in this context, we resort to a semi-supervised learning technique, namely pseudo-label stacked auto-encoder (PLSAE), which involves training using a set of labeled and unlabeled instances. We use a hybrid approach of dynamic analysis and static analysis to craft feature vectors. We evaluate our proposed model on CICMalDroid2020, which includes 17,341 most recent samples of five different Android apps categories. After that, we compare the results with state-of-the-art techniques in terms of accuracy and efficiency. Experimental results show that our proposed framework outperforms other semi-supervised approaches and common machine learning algorithms.
引用
收藏
页数:34
相关论文
共 57 条
  • [1] URLdeepDetect: A Deep Learning Approach for Detecting Malicious URLs Using Semantic Vector Models
    Afzal, Sara
    Asim, Muhammad
    Javed, Abdul Rehman
    Beg, Mirza Omer
    Baker, Thar
    [J]. JOURNAL OF NETWORK AND SYSTEMS MANAGEMENT, 2021, 29 (03)
  • [2] Alrabaee S., 2016, P 11 INT C MAL UNW S, P1
  • [3] FOSSIL: A Resilient and Efficient System for Identifying FOSS Functions in Malware Binaries
    Alrabaee, Saed
    Shirani, Paria
    Wang, Lingyu
    Debbabi, Mourad
    [J]. ACM TRANSACTIONS ON PRIVACY AND SECURITY, 2018, 21 (02)
  • [4] [Anonymous], 2019, CONTAGIO MOBILE MALW
  • [5] [Anonymous], 2016, INTRO SEMISUPERVISED
  • [6] [Anonymous], 2008, Proceedings of the 25th international conference on Machine learning, DOI DOI 10.1145/1390156.1390256
  • [7] Drebin: Effective and Explainable Detection of Android Malware in Your Pocket
    Arp, Daniel
    Spreitzenbarth, Michael
    Huebner, Malte
    Gascon, Hugo
    Rieck, Konrad
    [J]. 21ST ANNUAL NETWORK AND DISTRIBUTED SYSTEM SECURITY SYMPOSIUM (NDSS 2014), 2014,
  • [8] Blum A., 1998, Proceedings of the Eleventh Annual Conference on Computational Learning Theory, P92, DOI 10.1145/279943.279962
  • [9] Blum A, 2004, P 21 INT C MACH LEAR, P13, DOI 10.1145/1015330.1015429
  • [10] DroidCat: Effective Android Malware Detection and Categorization via App-Level Profiling
    Cai, Haipeng
    Meng, Na
    Ryder, Barbara
    Yao, Daphne
    [J]. IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2019, 14 (06) : 1455 - 1470