Scalable malware detection system using big data and distributed machine learning approach

被引:0
作者
Manish Kumar
机构
[1] M. S. Ramaiah Institute of Technology,Department of Master of Computer Applications
来源
Soft Computing | 2022年 / 26卷
关键词
Malware; Big data; Machine learning; Static analysis; Dynamic analysis; Locality-sensitive hashing;
D O I
暂无
中图分类号
学科分类号
摘要
Computer, Internet, and Smartphone have changed our life as never before. Today, we cannot even imagine our life without these technologies. If we look around, we find everything, everywhere connected and controlled by system and software. We find amazing software and mobile applications which have become nerve of our daily life. Our dependency on this software and systems is so and so much that it is scary even to imagine, what if this system fails at any point in time. There is always a threat surrounded by various types of cyber-attacks. Every day cybercriminals are evolving their attacking strategy. Cyber-attacks using ever-more sophisticated malware are the major cause of concern for all types of users. Cyber-world has witnessed rapid changes in malware attacking strategy in the recent past. The volume, velocity, and complexity of malware are posing new challenges for malware detection systems. A scalable malware detection system with the capability to detect complex attacks is the time of need. In this paper, we have proposed a scalable malware detection system using big data and a machine learning approach. The machine learning model proposed in the system is implemented using Apache Spark which supports distributed learning. Locality-sensitive hashing is used for malware detection, which significantly reduces the malware detection time. A five-stage iterative process has been used to carry out the implementation and experimental analysis. The proposed model shown in the paper has achieved 99.8% accuracy. The proposed model has also significantly reduced the learning and malware detection time compared to models proposed by other researchers.
引用
收藏
页码:3987 / 4003
页数:16
相关论文
共 74 条
[1]  
Ali M(2020)scalable malware clustering using multi-stage tree parallelization IEEE Int Conf Intell Secur Informatics (ISI) 2020 1-6
[2]  
Hagen J(2018)Robust malware detection for internet of (Battlefield) things devices using deep eigenspace learning IEEE Trans Sustain Comput 4 88-95
[3]  
Oliver J(2020)Systematic approach to malware analysis (SAMA) Appl Sci 10 1360-410
[4]  
Azmoodeh A(2018)Malware classification using self organising feature maps and machine activity data Comput Secur 73 399-908
[5]  
Dehghantanha A(2021)A learning-based static malware detection system with integrated feature Intell Autom Soft Comput 27 891-377
[6]  
Choo KKR(2016)Malware analysis and classification using sequence alignments Intell Autom Soft Comput 22 371-3196
[7]  
Bermejo Higuera J(2020)Combined kNN classification and hierarchical similarity hash for fast malware detection Appl Sci 10 5173-121
[8]  
Abad Aramburu C(2018)Detection of malicious code variants based on deep learning IEEE Trans Industr Inf 14 3187-18
[9]  
Bermejo Higuera JR(2018)Big data framework for zero-day malware detection Cybern Syst 49 103-3225
[10]  
Sicilia Urban MA(2018)Classification of malware analytics techniques: a systematic literature review Int J Secur Appl 12 9-118