Application of Anomaly Detection Models to Malware Detection in the Presence of Concept Drift

被引：2

作者：

Escudero Garcia, David ^{[1
]}

DeCastro-Garcia, Noemi ^{[2
]}

机构：

[1] Univ Leon, Res Inst Appl Sci Cybersecur, Campus Vegazana S-N, Leon 24071, Spain

[2] Univ Leon, Dept Math, Campus Vegazana S-N, Leon 24071, Spain

来源：

HYBRID ARTIFICIAL INTELLIGENT SYSTEMS, HAIS 2023 | 2023年 / 14001卷

关键词：

Machine learning; Malware detection; Concept drift; Cybersecurity; Anomaly detection; RECALL;

D O I：

10.1007/978-3-031-40725-3_2

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Machine learning is one of the main approaches to malware detection in the literature, since machine learning models are more adaptive than signature based solutions. One of the main challenges in the application of machine learning to malware detection is the presence of concept drift, which is a change in the data distribution over time. To tackle drift, online models that can be dynamically updated passively or by actively detecting change are applied. However, these models require new instances to be labelled to update the model. Usually, labels are scarce, cannot be obtained immediately and the presence of imbalance in the data make the construction of an effective model difficult. It has been studied that concept drift has a lower impact on benign instances, so we test the effectiveness of anomaly detection models to detect malware in the presence of concept drift. Anomaly detection models only need benign instances for training, and therefore may be less affected by the scarcity of labelled malicious instances. The results show that anomaly detection models achieve better results than supervised online models in conditions of heavy data imbalance and label scarcity.

引用

页码：15 / 26

页数：12