Effective and Efficient Hybrid Android Malware Classification Using Pseudo-Label Stacked Auto-Encoder

被引:77
作者
Mahdavifar, Samaneh [1 ]
Alhadidi, Dima [2 ]
Ghorbani, Ali. A. [1 ]
机构
[1] Univ New Brunswick, Canadian Inst Cybersecur CIC, Fac Comp Sci, Fredericton, NB, Canada
[2] Univ Windsor, Sch Comp Sci, Windsor, ON, Canada
关键词
Android malware; Category; Classification; Hybrid analysis; Semi-supervised learning; Stacked auto-encoder; Deep learning;
D O I
10.1007/s10922-021-09634-4
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Android has become the target of attackers because of its popularity. The detection of Android mobile malware has become increasingly important due to its significant threat. Supervised machine learning, which has been used to detect Android malware is far from perfect because it requires a significant amount of labeled data. Since labeled data is expensive and difficult to get while unlabeled data is abundant and cheap in this context, we resort to a semi-supervised learning technique, namely pseudo-label stacked auto-encoder (PLSAE), which involves training using a set of labeled and unlabeled instances. We use a hybrid approach of dynamic analysis and static analysis to craft feature vectors. We evaluate our proposed model on CICMalDroid2020, which includes 17,341 most recent samples of five different Android apps categories. After that, we compare the results with state-of-the-art techniques in terms of accuracy and efficiency. Experimental results show that our proposed framework outperforms other semi-supervised approaches and common machine learning algorithms.
引用
收藏
页数:34
相关论文
共 57 条
  • [51] A Semi-Supervised Deep Network Embedding Approach Based on the Neighborhood Structure
    Wu, Wenmao
    Yu, Zhizhou
    He, Jieyue
    [J]. BIG DATA MINING AND ANALYTICS, 2019, 2 (03) : 205 - 216
  • [52] Android malware detection based on system call sequences and LSTM
    Xiao, Xi
    Zhang, Shaofeng
    Mercaldo, Francesco
    Hu, Guangwu
    Sangaiah, Arun Kumar
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (04) : 3979 - 3999
  • [53] Yang WenZhu Yang WenZhu, 2018, Information Processing in Agriculture, V5, P246, DOI 10.1016/j.inpa.2018.01.004
  • [54] An Android mutation malware detection based on deep learning using visualization of importance from codes
    Yen, Yao-Saint
    Sun, Hung-Min
    [J]. MICROELECTRONICS RELIABILITY, 2019, 93 : 109 - 114
  • [55] Yuan ZL, 2014, ACM SIGCOMM COMP COM, V44, P371, DOI [10.1145/2740070.2631434, 10.1145/2619239.2631434]
  • [56] Semantics-Aware Android Malware Classification Using Weighted Contextual API Dependency Graphs
    Zhang, Mu
    Duan, Yue
    Yin, Heng
    Zhao, Zhiruo
    [J]. CCS'14: PROCEEDINGS OF THE 21ST ACM CONFERENCE ON COMPUTER AND COMMUNICATIONS SECURITY, 2014, : 1105 - 1116
  • [57] Zhu X., 2003, P 20 INT C MACH LEAR, P912