CNN Based Malicious Website Detection by Invalidating Multiple Web Spams

被引:18
|
作者
Liu, Dongjie [1 ,2 ]
Lee, Jong-Hyouk [3 ]
机构
[1] Chinese Acad Sci, Comp Network Informat Ctr, Beijing 100190, Peoples R China
[2] Univ Chinese Acad Sci, Sch Comp Sci & Technol, Beijing 100190, Peoples R China
[3] Sejong Univ, Dept Comp & Informat Secur, Seoul 13557, South Korea
关键词
Machine learning; Internet; Browsers; Uniform resource locators; Support vector machines; Feature extraction; Crawlers; Convolutional neural network; machine learning; malicious website detection; NEURAL-NETWORK; DEEP CNN;
D O I
10.1109/ACCESS.2020.2995157
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Although a variety of techniques to detect malicious websites have been proposed, it becomes more and more difficult for those methods to provide a satisfying result nowadays. Many malicious websites can still escape detection with various Web spam techniques. In this paper, we first summarize three types of Web spam techniques used by malicious websites, such as redirection spam, hidden IFrame spam, and content hiding spam. We then present a new detection method that adopts the perspective of users and takes screenshots of malicious webpages to invalidate Web spams. The proposed detection method uses a Convolutional Neural Network, which is a class of deep neural networks, as a classification algorithm. In order to verify the effectiveness of the method, two different experiments have been conducted. First, the proposed method was tested based on a constructed complex dataset. We present comparison results between the proposed method and representative machine learning-based detection algorithms. Second, the proposed method was tested to detect malicious websites in a real-world Web environment for three months. These experimental results illustrate that the proposed method has a better performance and is applicable to a practical Web environment.
引用
收藏
页码:97258 / 97266
页数:9
相关论文
共 50 条
  • [21] Feature optimization and hybrid classification for malicious web page detection
    Deng, Weiping
    Peng, Yan
    Yang, Fan
    Song, Jun
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2022, 34 (16)
  • [22] VPNDroid: Malicious Android VPN Detection Using a CNN-RF Method
    Polatidis, Nikolaos
    Pimenidis, Elias
    Trovati, Marcello
    Iliadis, Lazaros
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PART X, 2023, 14263 : 444 - 453
  • [23] Detecting Malicious Behavior in Microservice Based Web Applications
    Ozbek, Mustafa
    Sandikkaya, Mehmet Tahir
    2019 27TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2019,
  • [24] Frequency Domain Feature Based Robust Malicious Traffic Detection
    Fu, Chuanpu
    Li, Qi
    Shen, Meng
    Xu, Ke
    IEEE-ACM TRANSACTIONS ON NETWORKING, 2023, 31 (01) : 452 - 467
  • [25] JABBERWOCK: A Tool for WebAssembly Dataset Generation towards Malicious Website Detection
    Komiya, Chika
    Yanai, Naoto
    Yamashita, Kyosuke
    Okamura, Shingo
    2023 53RD ANNUAL IEEE/IFIP INTERNATIONAL CONFERENCE ON DEPENDABLE SYSTEMS AND NETWORKS WORKSHOPS, DSN-W, 2023, : 36 - 39
  • [26] Detection of Malicious Requests on Web Logs Using Data Mining Techniques
    Sahin, Mehmet Emin
    Ozdemir, Suat
    2019 4TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ENGINEERING (UBMK), 2019, : 463 - 468
  • [27] EINSPECT: Evolution-Guided Analysis and Detection of Malicious Web Pages
    Eshete, Birhanu
    Villafiorita, Adolfo
    Weldemariam, Komminist
    Zulkernine, Mohammad
    2013 IEEE 37TH ANNUAL COMPUTER SOFTWARE AND APPLICATIONS CONFERENCE (COMPSAC), 2013, : 375 - 380
  • [28] CNN-Based Object Detection via Segmentation Capabilities in Outdoor Natural Scenes
    Naseer, Aysha
    Al Mudawi, Naif
    Abdelhaq, Maha
    Alonazi, Mohammed
    Alazeb, Abdulwahab
    Algarni, Asaad
    Jalal, Ahmad
    IEEE ACCESS, 2024, 12 : 84984 - 85000
  • [29] Malicious Domain Detection Based on Decision Tree
    Thein, Thin Tharaphe
    Shiraishi, Yoshiaki
    Morii, Masakatu
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2023, E106D (09) : 1490 - 1494
  • [30] Detection of malicious nodes based on consortium blockchain
    Luo S.
    Lai L.
    Hu T.
    Hu X.
    PeerJ Computer Science, 2024, 10 : 1 - 26