Yes, Machine Learning Can Be More Secure! A Case Study on Android Malware Detection

被引:159
作者
Demontis, Ambra [1 ]
Melis, Marco [1 ]
Biggio, Battista [1 ]
Maiorca, Davide [1 ]
Arp, Daniel [2 ]
Rieck, Konrad [2 ]
Corona, Igino [1 ]
Giacinto, Giorgio [1 ]
Roli, Fabio [1 ]
机构
[1] Univ Cagliari, Dept Elect & Elect Engn, Piazza Armi, I-09123 Cagliari, Italy
[2] Tech Univ Carolo Wilhelmina Braunschweig, Inst Syst Secur, Rebenring 56, D-38106 Braunschweig, Germany
关键词
Android malware detection; static analysis; secure machine learning; computer security; FEATURE-SELECTION; CLASSIFIERS; ROBUST;
D O I
10.1109/TDSC.2017.2700270
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
To cope with the increasing variability and sophistication of modern attacks, machine learning has been widely adopted as a statistically-sound tool for malware detection. However, its security against well-crafted attacks has not only been recently questioned, but it has been shown that machine learning exhibits inherent vulnerabilities that can be exploited to evade detection at test time. In other words, machine learning itself can be the weakest link in a security system. In this paper, we rely upon a previously-proposed attack framework to categorize potential attack scenarios against learning-based malware detection tools, by modeling attackers with different skills and capabilities. We then define and implement a set of corresponding evasion attacks to thoroughly assess the security of Drebin, an Android malware detector. The main contribution of this work is the proposal of a simple and scalable secure-learning paradigm that mitigates the impact of evasion attacks, while only slightly worsening the detection rate in the absence of attack. We finally argue that our secure-learning approach can also be readily applied to other malware detection tasks.
引用
收藏
页码:711 / 724
页数:14
相关论文
共 50 条
[1]  
Aafer Y, 2013, L N INST COMP SCI SO, V127, P86
[2]  
[Anonymous], 1997, Technical Report
[3]   The weakest link revisited [J].
Arce, Iván .
IEEE Security and Privacy, 2003, 1 (02) :72-76
[4]  
Arp D., 2014, P 21 NDSS, P12
[5]  
Barreno M, 2006, P 2006 ACM S INF COM, P16
[6]   The security of machine learning [J].
Barreno, Marco ;
Nelson, Blaine ;
Joseph, Anthony D. ;
Tygar, J. D. .
MACHINE LEARNING, 2010, 81 (02) :121-148
[7]   A Methodology for Empirical Analysis of Permission-Based Security Models and its Application to Android [J].
Barrera, David ;
Kayacik, H. Guenes ;
van Oorschot, P. C. ;
Somayaji, Anil .
PROCEEDINGS OF THE 17TH ACM CONFERENCE ON COMPUTER AND COMMUNICATIONS SECURITY (CCS'10), 2010, :73-84
[8]  
Biggio B., 2012, 29 INT C MACHINE LEA, P1467
[9]  
Biggio B., 2013, MACHINE LEARNING KNO, P387, DOI [DOI 10.1007/978-3-642-40994, DOI 10.1007/978-3-642-40994-3_25]
[10]   PATTERN RECOGNITION SYSTEMS UNDER ATTACK: DESIGN ISSUES AND RESEARCH CHALLENGES [J].
Biggio, Battista ;
Fumera, Giorgio ;
Roli, Fabio .
INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2014, 28 (07)