Malicious web content detection by machine learning

被引:60
作者
Hou, Yung-Tsung [1 ]
Chang, Yimeng [2 ]
Chen, Tsuhan [2 ]
Laih, Chi-Sung [3 ]
Chen, Chia-Mei [1 ]
机构
[1] Natl Sun Yat Sen Univ, Kaohsiung 80424, Taiwan
[2] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
[3] Natl Cheng Kung Univ, Tainan 70101, Taiwan
关键词
Dynamic [!text type='HTML']HTML[!/text; Malicious webpage; Machine learning;
D O I
10.1016/j.eswa.2009.05.023
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The recent development of the dynamic HTML gives attackers a new and powerful technique to compromise computer systems. A Malicious dynamic HTML code is usually embedded in a normal webpage. The malicious webpage infects the victim when a user browses it. Furthermore such DHTML code can disguise itself easily through obfuscation or transformation, which makes the detection even harder. Anti-virus software packages commonly use signature-based approaches which might not be able to efficiently identify camouflaged malicious HTML codes. Therefore, our paper proposes a malicious web page detection using the technique of machine learning. Our study analyzes the characteristic of a malicious webpage systematically and presents important features for machine learning. Experimental results demonstrate that our method is resilient to code obfuscations and can correctly determine whether a webpage is malicious or not. (C) 2009 Elsevier Ltd. All rights reserved.
引用
收藏
页码:55 / 60
页数:6
相关论文
共 50 条
  • [31] Lexical features based malicious URL detection using machine learning techniques
    Saleem Raja, A.
    Vinodini, R.
    Kavitha, A.
    MATERIALS TODAY-PROCEEDINGS, 2021, 47 : 163 - 166
  • [32] Employing machine learning based malicious signal detection for cognitive radio networks
    Turkyilmaz, Yasin
    Senturk, Arafat
    Bayrakdar, Muhammed Enes
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2023, 35 (02)
  • [33] Machine Learning for Implanted Malicious Code Detection with Incompletely Specified System Implementations
    Hsu, Yating
    Lee, David
    2011 19TH IEEE INTERNATIONAL CONFERENCE ON NETWORK PROTOCOLS (ICNP), 2011,
  • [34] Machine Learning-Based Malicious X.509 Certificates' Detection
    Li, Jiaxin
    Zhang, Zhaoxin
    Guo, Changyong
    APPLIED SCIENCES-BASEL, 2021, 11 (05): : 1 - 24
  • [35] Machine Learning-Based Detection and Categorization of Malicious Accounts on Social Media
    Bhattacharyya, Ajay
    Kulkarni, Adita
    SOCIAL COMPUTING AND SOCIAL MEDIA, PT I, SCSM 2024, 2024, 14703 : 328 - 337
  • [36] Detection of Malicious Software by Analyzing Distinct Artifacts Using Machine Learning and Deep Learning Algorithms
    Ashik, Mathew
    Jyothish, A.
    Anandaram, S.
    Vinod, P.
    Mercaldo, Francesco
    Martinelli, Fabio
    Santone, Antonella
    ELECTRONICS, 2021, 10 (14)
  • [37] A Review on Machine Learning Approaches for Network Malicious Behavior Detection in Emerging Technologies
    Rabbani, Mahdi
    Wang, Yongli
    Khoshkangini, Reza
    Jelodar, Hamed
    Zhao, Ruxin
    Bagheri Baba Ahmadi, Sajjad
    Ayobi, Seyedvalyallah
    ENTROPY, 2021, 23 (05)
  • [38] Machine learning for encrypted malicious traffic detection: Approaches, datasets and comparative study
    Wang, Zihao
    Fok, Kar Wai
    Thing, Vrizlynn L. L.
    COMPUTERS & SECURITY, 2022, 113
  • [39] Feature mining for encrypted malicious traffic detection with deep learning and other machine learning algorithms
    Wang, Zihao
    Thing, Vrizlynn L. L.
    COMPUTERS & SECURITY, 2023, 128
  • [40] A blockchain and stacked machine learning approach for malicious nodes’ detection in internet of things
    Shakira Musa Baig
    Muhammad Umar Javed
    Ahmad Almogren
    Nadeem Javaid
    Mohsin Jamil
    Peer-to-Peer Networking and Applications, 2023, 16 : 2811 - 2832