Efficient Detection and Classification of Internet-of-Things Malware Based on Byte Sequences from Executable Files

被引:28
作者
Wan, Tzu-Ling [1 ]
Ban, Tao [3 ]
Cheng, Shin-Ming [1 ,2 ]
Lee, Yen-Ting [1 ]
Sun, Bo [4 ]
Isawa, Ryoichi [3 ]
Takahashi, Takeshi [3 ]
Inoue, Daisuke [3 ]
机构
[1] Natl Taiwan Univ Sci & Technol, Dept Comp Sci & Informat Engn, Taipei 106, Taiwan
[2] Acad Sinica, Res Ctr Informat Technol Innovat, Taipei 106, Taiwan
[3] Natl Inst Informat & Commun Technol, Koganei, Tokyo 1848795, Japan
[4] Saitama Inst Technol, Saitama 3690293, Japan
来源
IEEE OPEN JOURNAL OF THE COMPUTER SOCIETY | 2020年 / 1卷 / 01期
关键词
Computer security; machine learning; binary code; malware analysis; static analysis; SECURITY;
D O I
10.1109/OJCS.2020.3033974
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Simple implementation and autonomous operation features make the Internet-of-Things (IoT) vulnerable to malware attacks. Static analysis of IoT malware executable files is a feasible approach to understanding the behavior of IoT malware for mitigation and prevention. However, current analytic approaches based on opcodes or call graphs typically do not work well with diversity in central processing unit (CPU) architectures and are often resource intensive. In this paper, we propose an efficient method for leveraging machine learning methods to detect and classify IoT malware programs. We show that reliable and efficient detection and classification can be achieved by exploring the essential discriminating information stored in the byte sequences at the entry points of executable programs. We demonstrate the performance of the proposed method using a large-scale dataset consisting of 111K benignware and 111K malware programs from seven CPU architectures. The proposed method achieves near optimal generalization performance for malware detection (99.96% accuracy) and for malware family classification (98.47% accuracy). Moreover, when CPU architecture information is considered in learning, the proposed method combined with support vector machine classifiers can yield even higher generalization performance using fewer bytes from the executable files. The findings in this paper are promising for implementing light-weight malware protection on IoT devices with limited resources.
引用
收藏
页码:262 / 275
页数:14
相关论文
共 29 条
[1]   Analyzing and Detecting Emerging Internet of Things Malware: A Graph-Based Approach [J].
Alasmary, Hisham ;
Khormali, Aminollah ;
Anwar, Afsah ;
Park, Jeman ;
Choi, Jinchun ;
Abusnaina, Ahmed ;
Awad, Amro ;
Nyang, Daehun ;
Mohaisen, Aziz .
IEEE INTERNET OF THINGS JOURNAL, 2019, 6 (05) :8977-8988
[2]   Robust Malware Detection for Internet of (Battlefield) Things Devices Using Deep Eigenspace Learning [J].
Azmoodeh, Amin ;
Dehghantanha, Ali ;
Choo, Kim-Kwang Raymond .
IEEE TRANSACTIONS ON SUSTAINABLE COMPUTING, 2019, 4 (01) :88-95
[3]   Efficient Malware Packer Identification Using Support Vector Machines with Spectrum Kernel [J].
Ban, Tao ;
Isawa, Ryoichi ;
Guo, Shanqing ;
Inoue, Daisuke ;
Nakao, Koji .
2013 EIGHTH ASIA JOINT CONFERENCE ON INFORMATION SECURITY (ASIAJCIS), 2013, :69-76
[4]  
Bohio M. J., 2015, ANAL BACKDOOR BOT MI
[5]   Traffic-Aware Patching for Cyber Security in Mobile IoT [J].
Cheng, Shin-Ming ;
Chen, Pin-Yu ;
Lin, Ching-Chao ;
Hsiao, Hsu-Chun .
IEEE COMMUNICATIONS MAGAZINE, 2017, 55 (07) :29-35
[6]  
Costin A., 2018, BLACKHAT US
[7]   An opcode-based technique for polymorphic Internet of Things malware detection [J].
Darabian, Hamid ;
Dehghantanha, Ali ;
Hashemi, Sattar ;
Homayoun, Sajad ;
Choo, Kim-Kwang Raymond .
CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2020, 32 (06)
[8]   Malware detection based on deep learning algorithm [J].
Ding Yuxin ;
Zhu Siyi .
NEURAL COMPUTING & APPLICATIONS, 2019, 31 (02) :461-472
[9]  
E. Foundation, 2019, IOT COMM AD SURV 201
[10]  
Edwards L, 2016, ROUTL COMPANION BUS, P16