CNN Based Malicious Website Detection by Invalidating Multiple Web Spams

被引：18

作者：

Liu, Dongjie ^{[1
,2
]}

Lee, Jong-Hyouk ^{[3
]}

机构：

[1] Chinese Acad Sci, Comp Network Informat Ctr, Beijing 100190, Peoples R China

[2] Univ Chinese Acad Sci, Sch Comp Sci & Technol, Beijing 100190, Peoples R China

[3] Sejong Univ, Dept Comp & Informat Secur, Seoul 13557, South Korea

来源：

IEEE ACCESS | 2020年 / 8卷

关键词：

Machine learning; Internet; Browsers; Uniform resource locators; Support vector machines; Feature extraction; Crawlers; Convolutional neural network; machine learning; malicious website detection; NEURAL-NETWORK; DEEP CNN;

D O I：

10.1109/ACCESS.2020.2995157

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Although a variety of techniques to detect malicious websites have been proposed, it becomes more and more difficult for those methods to provide a satisfying result nowadays. Many malicious websites can still escape detection with various Web spam techniques. In this paper, we first summarize three types of Web spam techniques used by malicious websites, such as redirection spam, hidden IFrame spam, and content hiding spam. We then present a new detection method that adopts the perspective of users and takes screenshots of malicious webpages to invalidate Web spams. The proposed detection method uses a Convolutional Neural Network, which is a class of deep neural networks, as a classification algorithm. In order to verify the effectiveness of the method, two different experiments have been conducted. First, the proposed method was tested based on a constructed complex dataset. We present comparison results between the proposed method and representative machine learning-based detection algorithms. Second, the proposed method was tested to detect malicious websites in a real-world Web environment for three months. These experimental results illustrate that the proposed method has a better performance and is applicable to a practical Web environment.

引用

页码：97258 / 97266

页数：9

共 50 条

[41] Visualization Feature and CNN Based Homology Classification of Malicious Code
CHU Qianfeng
LIU Gongshen
ZHU Xinyu
ChineseJournalofElectronics, 2020, 29 (01) : 154 - 160
[42] Visualization Feature and CNN Based Homology Classification of Malicious Code
Chu, Qianfeng
Liu, Gongshen
Zhu, Xinyu
CHINESE JOURNAL OF ELECTRONICS, 2020, 29 (01) : 154 - 160
[43] Lightweight method for detecting JavaScript-based malicious Web pages
Ma, Hongliang
Wang, Wei
Han, Zhen
Huazhong Keji Daxue Xuebao (Ziran Kexue Ban)/Journal of Huazhong University of Science and Technology (Natural Science Edition), 2014, 42 (11): : 34 - 38
[44] Bearing Fault Detection Using Scalogram and Switchable Normalization-Based CNN (SN-CNN)
Neupane, Dhiraj
Kim, Yunsu
Seok, Jongwon
IEEE ACCESS, 2021, 9 : 88151 - 88166
[45] Network-based detection of Android malicious apps
Shree Garg
Sateesh K. Peddoju
Anil K. Sarje
International Journal of Information Security, 2017, 16 : 385 - 400
[46] Network-based detection of Android malicious apps
Garg, Shree
Peddoju, Sateesh K.
Sarje, Anil K.
INTERNATIONAL JOURNAL OF INFORMATION SECURITY, 2017, 16 (04) : 385 - 400
[47] Detection of Malicious Executable Files Based on Clustering of Activities
R. A. Ognev
E. V. Zhukovskii
D. P. Zegzhda
Automatic Control and Computer Sciences, 2021, 55 : 1092 - 1098
[48] Evaluating CNN and LSTM for Web Attack Detection
Wang, Jiabao
Zhou, Zhenji
Chen, Jun
PROCEEDINGS OF 2018 10TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND COMPUTING (ICMLC 2018), 2018, : 283 - 287
[49] Malicious PDF document detection based on mixed feature
Du X.
Lin Y.
Sun Y.
Tongxin Xuebao/Journal on Communications, 2019, 40 (02): : 118 - 128
[50] Detection of Malicious Executable Files Based on Clustering of Activities
Ognev, R. A.
Zhukovskii, E., V
Zegzhda, D. P.
AUTOMATIC CONTROL AND COMPUTER SCIENCES, 2021, 55 (08) : 1092 - 1098

← 1 2 3 4 5 →