A malware detection model based on a negative selection algorithm with penalty factor

被引：16

作者：

Zhang PengTao ^{[1
,2
]}

Wang Wei ^{[1
,2
]}

Tan Ying ^{[1
,2
]}

机构：

[1] Peking Univ, Sch Elect Engn & Comp Sci, Dept Machine Intelligence, Beijing 100871, Peoples R China

[2] Peking Univ, Minist Educ, Key Lab Machine Percept, Beijing 100871, Peoples R China

来源：

SCIENCE CHINA-INFORMATION SCIENCES | 2010年 / 53卷 / 12期

基金：

中国国家自然科学基金;

关键词：

penalty factor; negative selection algorithm; signature extraction; artificial immune system; malware detection;

D O I：

10.1007/s11432-010-4123-5

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

A malware detection model based on a negative selection algorithm with penalty factor (NSAPF) is proposed in this paper. This model extracts a malware instruction library (MIL), containing instructions that tend to appear in malware, through deep instruction analysis with respect to instruction frequency and file frequency. From the MIL, the proposed model creates a malware candidate signature library (MCSL) and a benign program malware-like signature library (BPMSL) by splitting programs orderly into various short bit strings. Depending on whether a signature matches "self", the NSAPF further divides the MCSL into two malware detection signature libraries (MDSL1 and MDSL2), and uses these as a two-dimensional reference for detecting suspicious programs. The model classifies suspicious programs as malware and benign programs by matching values of the suspicious programs with MDSL1 and MDSL2. Introduction of a penalty factor C in the negative selection algorithm enables this model to overcome the drawback of traditional negative selection algorithms in defining the harmfulness of "self" and "nonself", and focus on the harmfulness of the code, thus greatly improving the effectiveness of the model and also enabling the model to satisfy the different requirements of users in terms of true positive and false positive rates. Experimental results confirm that the proposed model achieves a better true positive rate on completely unknown malware and a better generalization ability while keeping a low false positive rate. The model can balance and adjust the true positive and false positive rates by adjusting the penalty factor C to achieve better performance.

引用

页码：2461 / 2471

页数：11

共 15 条

[1] A framework for evolving multi-shaped detectors in negative selection
Balachandran, Sankalp
Dasgupta, Dipankar
Nino, Fernando
Garrett, Deon
[J]. 2007 IEEE SYMPOSIUM ON FOUNDATIONS OF COMPUTATIONAL INTELLIGENCE, VOLS 1 AND 2, 2007, : 401 - +
[2] Deng PS, 2003, 37TH ANNUAL 2003 INTERNATIONAL CARNAHAN CONFERENCE ON SECURITY TECHNOLOGY, PROCEEDINGS, P600
[3] Edge KS, 2006, GECCO 2006: GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, VOL 1 AND 2, P103
[4] FORREST S, 1994, 1994 IEEE COMPUTER SOCIETY SYMPOSIUM ON RESEARCH IN SECURITY AND PRIVACY, PROCEEDINGS, P202
[5] A sense of self for unix processes
Forrest, S
Hofmeyr, SA
Somayaji, A
Longstaff, TA
[J]. 1996 IEEE SYMPOSIUM ON SECURITY AND PRIVACY, PROCEEDINGS, 1996, : 120 - 128
[6] Henchiri O, 2006, IEEE DATA MINING, P891
[7] Karnik A, 2007, AMS 2007: FIRST ASIA INTERNATIONAL CONFERENCE ON MODELLING & SIMULATION ASIA MODELLING SYMPOSIUM, PROCEEDINGS, P165
[8] Kim J, 2001, IEEE C EVOL COMPUTAT, P1244, DOI 10.1109/CEC.2001.934333
[9] LEE H, 2004, ICCS 2004, P499
[10] Dynamic detection for computer virus based on immune system
Li Tao
[J]. SCIENCE IN CHINA SERIES F-INFORMATION SCIENCES, 2008, 51 (10): : 1475 - 1486

← 1 2 →