An efficient malicious webpage static detection framework based on optimized Bayesian and hybrid machine learning

被引:0
作者
Yang, Fan [1 ]
Zhu, Chaoqun [1 ]
Xu, Heng [1 ]
Qian, Yongfeng [1 ]
Song, Jun [1 ]
机构
[1] China Univ Geosci, Sch Comp Sci, Wuhan, Peoples R China
基金
中国国家自然科学基金;
关键词
feature extraction; machine learning; malicious webpage detection; threat assessment; WEB PAGE; CODE;
D O I
10.1002/cpe.6792
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Malicious webpage detection is a crucial work in both theory and practical environment. In practical applications, static detection methods are usually regarded as a priority choice, which can quickly detect unknown malicious web pages and avoid a costly in-depth analysis. However, existing solution of static detection typically has the following problems. For example, a single static detection may lead to a higher false positive rate, and the integrated detection usually has a lower detection efficiency. In this article, we propose an efficient webpage static detection framework, especially considering both the detection efficiency and the detection accuracy. Then, on the basis of the extended feature sets from URL, HTML, and JavaScript, we introduce an optimized naive Bayesian algorithm, in which a novel amplification factor strategy is proposed. Finally, a webpage threat assessment model oriented to general machine learning is presented to achieve the refined detection. Three main properties are provided: high detection efficiency, high detection accuracy, and better applicability. Furthermore, the comprehensive experimental results and comparative analysis is given to show the advantages of the proposed framework.
引用
收藏
页数:15
相关论文
共 44 条
  • [1] Context-sensitive and keyword density-based supervised machine learning techniques for malicious webpage detection
    Altay, Betul
    Dokeroglu, Tansel
    Cosar, Ahmet
    [J]. SOFT COMPUTING, 2019, 23 (12) : 4177 - 4191
  • [2] Detecting Mobile Malicious Webpages in Real Time
    Amrutkar, Chaitrali
    Kim, Young Seuk
    Traynor, Patrick
    [J]. IEEE TRANSACTIONS ON MOBILE COMPUTING, 2017, 16 (08) : 2184 - 2197
  • [3] [Anonymous], 2014, INT J RELIABLE INF A
  • [4] [Anonymous], 2020, Digital 2020. Global Digital Overview. Essential Insights into How People Around the World Use the Internet, Mobile Devices, Social Media
  • [5] [Anonymous], 2020, K SECURITY LAB KASPE
  • [6] [Anonymous], 2016, INT J INF SECUR CYBE
  • [7] [Anonymous], 2020, TELEPORT ULTRA
  • [8] Babiker M, 2018, 2018 6TH INTERNATIONAL SYMPOSIUM ON DIGITAL FORENSIC AND SECURITY (ISDFS), P344
  • [9] An Analysis of Phishing Blacklists: Google Safe Browsing, OpenPhish, and PhishTank
    Bell, Simon
    Komisarczuk, Peter
    [J]. PROCEEDINGS OF THE AUSTRALASIAN COMPUTER SCIENCE WEEK MULTICONFERENCE (ACSW 2020), 2020,
  • [10] A Multicloud-Model-Based Many-Objective Intelligent Algorithm for Efficient Task Scheduling in Internet of Things
    Cai, Xingjuan
    Geng, Shaojin
    Wu, Di
    Cai, Jianghui
    Chen, Jinjun
    [J]. IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (12): : 9645 - 9653