A Review of Android Malware Detection Approaches Based on Machine Learning

被引:163
作者
Liu, Kaijun [1 ,2 ]
Xu, Shengwei [3 ]
Xu, Guoai [1 ,2 ]
Zhang, Miao [1 ,2 ]
Sun, Dawei [4 ]
Liu, Haifeng [5 ]
机构
[1] Beijing Univ Posts & Telecommun, Sch Cyberspace Secur, Beijing 100876, Peoples R China
[2] Beijing Univ Posts & Telecommun, Natl Engn Lab Mobile Network Secur, Beijing 100876, Peoples R China
[3] Beijing Elect Sci & Technol Inst, Informat Secur Res Inst, Beijing 100070, Peoples R China
[4] Beijing Softsec Technol Co Ltd, Res Ctr Intelligent Software Secur, Beijing 100876, Peoples R China
[5] Beijing Informat Secur Test & Evaluat Ctr, Beijing 100101, Peoples R China
基金
中国国家自然科学基金;
关键词
Android security; malware detection; machine learning; feature extraction; classifier evaluation; STATIC ANALYSIS; PATTERN-RECOGNITION; FEATURE-SELECTION; MODEL; CLASSIFICATION; ACCURACY; FEATURES; USAGE; APPS; RISK;
D O I
10.1109/ACCESS.2020.3006143
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Android applications are developing rapidly across the mobile ecosystem, but Android malware is also emerging in an endless stream. Many researchers have studied the problem of Android malware detection and have put forward theories and methods from different perspectives. Existing research suggests that machine learning is an effective and promising way to detect Android malware. Notwithstanding, there exist reviews that have surveyed different issues related to Android malware detection based on machine learning. We believe our work complements the previous reviews by surveying a wider range of aspects of the topic. This paper presents a comprehensive survey of Android malware detection approaches based on machine learning. We briefly introduce some background on Android applications, including the Android system architecture, security mechanisms, and classification of Android malware. Then, taking machine learning as the focus, we analyze and summarize the research status from key perspectives such as sample acquisition, data preprocessing, feature selection, machine learning models, algorithms, and the evaluation of detection effectiveness. Finally, we assess the future prospects for research into Android malware detection based on machine learning. This review will help academics gain a full picture of Android malware detection based on machine learning. It could then serve as a basis for subsequent researchers to start new work and help to guide research in the field more generally.
引用
收藏
页码:124579 / 124607
页数:29
相关论文
共 291 条
[51]  
Burguera I., 2011, P 1 ACM WORKSHOP SEC, P15
[52]  
Cai H., 2020, ACM T SOFTW ENG METH, V29, P1
[53]   DroidCat: Effective Android Malware Detection and Categorization via App-Level Profiling [J].
Cai, Haipeng ;
Meng, Na ;
Ryder, Barbara ;
Yao, Daphne .
IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2019, 14 (06) :1455-1470
[54]   Effectiveness of Opcode ngrams for Detection of Multi Family Android Malware [J].
Canfora, Gerardo ;
De Lorenzo, Andrea ;
Medvet, Eric ;
Mercaldo, Francesco ;
Visaggio, Corrado Aaron .
PROCEEDINGS 10TH INTERNATIONAL CONFERENCE ON AVAILABILITY, RELIABILITY AND SECURITY ARES 2015, 2015, :333-340
[55]  
Canfora Gerardo, 2015, P 3 INT WORKSH SOFTW, P13
[56]  
Caruana R, 2004, P 10 ACM SIGKDD INT, P69, DOI DOI 10.1145/1014052.1014063
[57]  
Chakraborty A, 2018, ARXIV PREPRINT ARXIV
[58]  
Chakradeo S., 2013, P 6 ACM C SECURITY P, P13, DOI DOI 10.1145/2462096.2462100
[59]  
Chan PPK, 2014, INT CONF MACH LEARN, P82, DOI 10.1109/ICMLC.2014.7009096
[60]   A survey on feature selection methods [J].
Chandrashekar, Girish ;
Sahin, Ferat .
COMPUTERS & ELECTRICAL ENGINEERING, 2014, 40 (01) :16-28