CNN Based Malicious Website Detection by Invalidating Multiple Web Spams

被引:18
|
作者
Liu, Dongjie [1 ,2 ]
Lee, Jong-Hyouk [3 ]
机构
[1] Chinese Acad Sci, Comp Network Informat Ctr, Beijing 100190, Peoples R China
[2] Univ Chinese Acad Sci, Sch Comp Sci & Technol, Beijing 100190, Peoples R China
[3] Sejong Univ, Dept Comp & Informat Secur, Seoul 13557, South Korea
关键词
Machine learning; Internet; Browsers; Uniform resource locators; Support vector machines; Feature extraction; Crawlers; Convolutional neural network; machine learning; malicious website detection; NEURAL-NETWORK; DEEP CNN;
D O I
10.1109/ACCESS.2020.2995157
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Although a variety of techniques to detect malicious websites have been proposed, it becomes more and more difficult for those methods to provide a satisfying result nowadays. Many malicious websites can still escape detection with various Web spam techniques. In this paper, we first summarize three types of Web spam techniques used by malicious websites, such as redirection spam, hidden IFrame spam, and content hiding spam. We then present a new detection method that adopts the perspective of users and takes screenshots of malicious webpages to invalidate Web spams. The proposed detection method uses a Convolutional Neural Network, which is a class of deep neural networks, as a classification algorithm. In order to verify the effectiveness of the method, two different experiments have been conducted. First, the proposed method was tested based on a constructed complex dataset. We present comparison results between the proposed method and representative machine learning-based detection algorithms. Second, the proposed method was tested to detect malicious websites in a real-world Web environment for three months. These experimental results illustrate that the proposed method has a better performance and is applicable to a practical Web environment.
引用
收藏
页码:97258 / 97266
页数:9
相关论文
共 50 条
  • [1] Learning URL Embedding for Malicious Website Detection
    Yan, Xiaodan
    Xu, Yang
    Cui, Baojiang
    Zhang, Shuhan
    Guo, Taibiao
    Li, Chaoliang
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2020, 16 (10) : 6673 - 6681
  • [2] Investigating the Influence of Feature Sources for Malicious Website Detection
    Chaiban, Ahmad
    Sovilj, Dusan
    Soliman, Hazem
    Salmon, Geoff
    Lin, Xiaodong
    APPLIED SCIENCES-BASEL, 2022, 12 (06):
  • [3] Adaptive segmented webpage text based malicious website detection
    Sun, Guoying
    Zhang, Zhaoxin
    Cheng, Yanan
    Chai, Tingting
    COMPUTER NETWORKS, 2022, 216
  • [4] Multi-Modal Features Representation-Based Convolutional Neural Network Model for Malicious Website Detection
    Alsaedi, Mohammed
    Ghaleb, Fuad A.
    Saeed, Faisal
    Ahmad, Jawad
    Alasli, Mohammed
    IEEE ACCESS, 2024, 12 : 7271 - 7284
  • [5] A Deep Learning-Based Framework for Phishing Website Detection
    Tang, Lizhen
    Mahmoud, Qusay H.
    IEEE ACCESS, 2022, 10 : 1509 - 1521
  • [6] Malicious Web Content Detection Using Machine Leaning
    Desai, Anand
    Jatakia, Janvi
    Naik, Rohit
    Raul, Nataasha
    2017 2ND IEEE INTERNATIONAL CONFERENCE ON RECENT TRENDS IN ELECTRONICS, INFORMATION & COMMUNICATION TECHNOLOGY (RTEICT), 2017, : 1432 - 1436
  • [7] CNN-Webshell: Malicious Web Shell Detection with Convolutional Neural Network
    Tian, Yifan
    Wang, Jiabao
    Zhou, Zhenji
    Zhou, Shengli
    PROCEEDINGS OF 2017 VI INTERNATIONAL CONFERENCE ON NETWORK, COMMUNICATION AND COMPUTING (ICNCC 2017), 2017, : 75 - 79
  • [8] Detection of malicious and non-malicious website visitors using unsupervised neural network learning
    Stevanovic, Dusan
    Vlajic, Natalija
    An, Aijun
    APPLIED SOFT COMPUTING, 2013, 13 (01) : 698 - 708
  • [9] Machine Learning & Concept Drift based Approach for Malicious Website Detection
    Singhal, Siddharth
    Chawla, Utkarsh
    Shorey, Rajeev
    2020 INTERNATIONAL CONFERENCE ON COMMUNICATION SYSTEMS & NETWORKS (COMSNETS), 2020,
  • [10] MALICIOUS WEBSITE DETECTION UNDER THE EXPLORATORY ATTACK
    Wang, Manlin
    Zhang, Fei
    Chan, Patrick P. K.
    PROCEEDINGS OF 2013 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS (ICMLC), VOLS 1-4, 2013, : 565 - 570