Feature-Space Bayesian Adversarial Learning Improved Malware Detector Robustness

被引:0
作者
Doan, Bao Gia [1 ]
Yang, Shuiqiao [2 ]
Montague, Paul [4 ]
De Vel, Olivier [3 ]
Abraham, Tamas [4 ]
Camtepe, Seyit [3 ]
Kanhere, Salil S. [2 ]
Abbasnejad, Ehsan [1 ]
Ranasinghe, Damith C. [1 ]
机构
[1] Univ Adelaide, Adelaide, SA, Australia
[2] Univ New South Wales, Kensington, NSW, Australia
[3] CSIRO, Data61, Eveleigh, Australia
[4] Def Sci & Technol Grp, Canberra, ACT, Australia
来源
THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 12 | 2023年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a new algorithm to train a robust malware detector. Malware is a prolific problem and malware detectors are a front-line defense. Modern detectors rely on machine learning algorithms. Now, the adversarial objective is to devise alterations to the malware code to decrease the chance of being detected whilst preserving the functionality and realism of the malware. Adversarial learning is effective in improving robustness but generating functional and realistic adversarial malware samples is non-trivial. Because: i) in contrast to tasks capable of using gradient-based feedback, adversarial learning in a domain without a differentiable mapping function from the problem space (malware code inputs) to the feature space is hard; and ii) it is difficult to ensure the adversarial malware is realistic and functional. This presents a challenge for developing scalable adversarial machine learning algorithms for large datasets at a production or commercial scale to realize robust malware detectors. We propose an alternative; perform adversarial learning in the feature space in contrast to the problem space. We prove the projection of perturbed, yet valid malware, in the problem space into feature space will always be a subset of adversarials generated in the feature space. Hence, by generating a robust network against feature-space adversarial examples, we inherently achieve robustness against problem-space adversarial examples. We formulate a Bayesian adversarial learning objective that captures the distribution of models for improved robustness. To explain the robustness of the Bayesian adversarial learning algorithm, we prove that our learning method bounds the difference between the adversarial risk and empirical risk and improves robustness. We show that Bayesian neural networks (BNNs) achieve state-of-the-art results; especially in the False Positive Rate (FPR) regime. Adversarially trained BNNs achieve state-of-the-art robustness. Notably, adversarially trained BNNs are robust against stronger attacks with larger attack budgets by a margin of up to 15% on a recent production-scale malware dataset of more than 20 million samples. Importantly, our efforts create a benchmark for future defenses in the malware domain.
引用
收藏
页码:14783 / 14791
页数:9
相关论文
共 48 条
[41]   Image super-resolution reconstruction via improved dictionary learning based on coupled feature space [J].
Zhan S. ;
Fang Q. ;
Yang F.-M. ;
Chang L.-L. ;
Yan T. .
Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2016, 44 (05) :1189-1195
[42]   A deep-transfer-learning fault diagnosis method for gearboxes based on discriminative feature extraction and improved domain adversarial neural networks [J].
He, Xiaoliang ;
Zhao, Feng ;
Song, Nianyun ;
Su, Chun ;
Liu, Pengfei .
NONDESTRUCTIVE TESTING AND EVALUATION, 2025,
[43]   One-Shot Fault Diagnosis of Three-Dimensional Printers Through Improved Feature Space Learning [J].
Li, Chuan ;
Cabrera, Diego ;
Sancho, Fernando ;
Sanchez, Rene-Vinicio ;
Cerrada, Mariela ;
de Oliveira, Jose Valente .
IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2021, 68 (09) :8768-8776
[44]   An improved evolutionary approach-based hybrid algorithm for Bayesian network structure learning in dynamic constrained search space [J].
Jingguo Dai ;
Jia Ren ;
Wencai Du ;
Vladimir Shikhin ;
Jixin Ma .
Neural Computing and Applications, 2020, 32 :1413-1434
[45]   An improved evolutionary approach-based hybrid algorithm for Bayesian network structure learning in dynamic constrained search space [J].
Dai, Jingguo ;
Ren, Jia ;
Du, Wencai ;
Shikhin, Vladimir ;
Ma, Jixin .
NEURAL COMPUTING & APPLICATIONS, 2020, 32 (05) :1413-1434
[46]   Hydrodynamic feature extraction and intelligent identification of flow regimes in vaneless space of a pump turbine using improved empirical wavelet transform and Bayesian optimized convolutional neural network [J].
Zheng, Xianghao ;
Li, Hao ;
Zhang, Suqi ;
Zhang, Yuning ;
Li, Jinwei ;
Zhang, Yuning ;
Zhao, Weiqiang .
ENERGY, 2023, 282
[47]   A sequential feature extraction method based on discrete wavelet transform, phase space reconstruction, and singular value decomposition and an improved extreme learning machine for rolling bearing fault diagnosis [J].
Li, D. Z. ;
Zheng, X. ;
Xie, Q. W. ;
Jin, Q. B. .
PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART E-JOURNAL OF PROCESS MECHANICAL ENGINEERING, 2018, 232 (06) :635-649
[48]   Improved robustness for deep learning-based segmentation of multi-center myocardial perfusion cardiovascular MRI datasets using data-adaptive uncertainty-guided space-time analysis [J].
Yalcinkaya, Dilek M. ;
Youssef, Khalid ;
Heydari, Bobak ;
Wei, Janet ;
Merz, C. Noel Bairey ;
Judd, Robert ;
Dharmakumar, Rohan ;
Simonetti, Orlando P. ;
Weinsaft, Jonathan W. ;
Raman, Subha V. ;
Sharif, Behzad .
JOURNAL OF CARDIOVASCULAR MAGNETIC RESONANCE, 2024, 26 (02)