Feature Selection for Malware Detection Based on Reinforcement Learning

被引:30
作者
Fang, Zhiyang [1 ]
Wang, Junfeng [1 ]
Geng, Jiaxuan [1 ]
Kan, Xuan [1 ]
机构
[1] Sichuan Univ, Coll Comp Sci, Chengdu 610065, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature selection; malware detection; deep reinforcement learning; Q-learning; MALICIOUS CODE;
D O I
10.1109/ACCESS.2019.2957429
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Machine learning based malware detection has been proved great success in the past few years. Most of the conventional methods are based on supervised learning, which relies on static features with labels. While selecting static features requires both human expertise and labor. New selections, which fix features from a wide range, are handcrafted by careful manual experimentation or modified from existing methods. Despite their success, the static features are still hard to be determined. In this paper, a Deep Q-learning based Feature Selection Architecture (DQFSA) is introduced to cover the deficiencies of traditional methods. The proposed architecture automatically selects a small set of highly differentiated features for malware detection task without human intervention. DQFSA trains an agent through Q-learning to maximize the expected accuracy of the classifiers on a validation dataset by sequentially interacting with the features space. The agent, based on an epsilon-greedy exploration strategy and experience replay, explores a large but finite space of possible actions and iteratively discovers selections with improved performance on the learning task. Actions are a set of reasonable choices, which indicate whether a feature is chosen or not. Extensive experimental results indicate that the proposed DQFSA outperforms existing baseline approaches for feature selection on malware detection with minimum features, improves the generalization performance of the learning model and reduces human intervention. More specifically, the proposed architecture's underlying representation is robust enough for re-calibrating models to other domains of information security.
引用
收藏
页码:176177 / 176187
页数:11
相关论文
共 31 条
[1]  
ANDERSON HS, 2018, ARXIV180108917
[2]  
[Anonymous], 1998, REINFORCEMENT LEARNI
[3]  
[Anonymous], 2015, P INT C LEARN REPR I
[4]   Improving malware detection using multi-view ensemble learning [J].
Bai, Jinrong ;
Wang, Junfeng .
SECURITY AND COMMUNICATION NETWORKS, 2016, 9 (17) :4227-4241
[5]   A Malware Detection Scheme Based on Mining Format Information [J].
Bai, Jinrong ;
Wang, Junfeng ;
Zou, Guozhong .
SCIENTIFIC WORLD JOURNAL, 2014,
[6]  
Baker B., 2017, INT C LEARNING REPRE
[7]  
Bazrafshan Z, 2013, 2013 5TH CONFERENCE ON INFORMATION AND KNOWLEDGE TECHNOLOGY (IKT), P113, DOI 10.1109/IKT.2013.6620049
[8]  
Coggan M., 2004, TECH REP
[9]  
El Merabet H, 2019, INT J ADV COMPUT SC, V10, P366
[10]  
Elovici Y, 2007, LECT NOTES ARTIF INT, V4667, P44