Malicious web content detection by machine learning

被引:60
|
作者
Hou, Yung-Tsung [1 ]
Chang, Yimeng [2 ]
Chen, Tsuhan [2 ]
Laih, Chi-Sung [3 ]
Chen, Chia-Mei [1 ]
机构
[1] Natl Sun Yat Sen Univ, Kaohsiung 80424, Taiwan
[2] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
[3] Natl Cheng Kung Univ, Tainan 70101, Taiwan
关键词
Dynamic [!text type='HTML']HTML[!/text; Malicious webpage; Machine learning;
D O I
10.1016/j.eswa.2009.05.023
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The recent development of the dynamic HTML gives attackers a new and powerful technique to compromise computer systems. A Malicious dynamic HTML code is usually embedded in a normal webpage. The malicious webpage infects the victim when a user browses it. Furthermore such DHTML code can disguise itself easily through obfuscation or transformation, which makes the detection even harder. Anti-virus software packages commonly use signature-based approaches which might not be able to efficiently identify camouflaged malicious HTML codes. Therefore, our paper proposes a malicious web page detection using the technique of machine learning. Our study analyzes the characteristic of a malicious webpage systematically and presents important features for machine learning. Experimental results demonstrate that our method is resilient to code obfuscations and can correctly determine whether a webpage is malicious or not. (C) 2009 Elsevier Ltd. All rights reserved.
引用
收藏
页码:55 / 60
页数:6
相关论文
共 50 条
  • [1] Malicious Web Content Detection Using Machine Leaning
    Desai, Anand
    Jatakia, Janvi
    Naik, Rohit
    Raul, Nataasha
    2017 2ND IEEE INTERNATIONAL CONFERENCE ON RECENT TRENDS IN ELECTRONICS, INFORMATION & COMMUNICATION TECHNOLOGY (RTEICT), 2017, : 1432 - 1436
  • [2] Malicious Webpage Classification Based on Web Content Features using Machine Learning and Deep Learning
    Raja, Saleem A.
    Sundarvadivazhagan, B.
    Vijayarangan, R.
    Veeramani, S.
    2022 INTERNATIONAL CONFERENCE ON GREEN ENERGY, COMPUTING AND SUSTAINABLE TECHNOLOGY (GECOST), 2022, : 314 - 319
  • [3] Popularity-Based Detection of Malicious Content in Facebook Using Machine Learning Approach
    Sahoo, Somya Ranjan
    Gupta, B. B.
    FIRST INTERNATIONAL CONFERENCE ON SUSTAINABLE TECHNOLOGIES FOR COMPUTATIONAL INTELLIGENCE, 2020, 1045 : 163 - 176
  • [4] Detection of malicious URLs using machine learning
    Reyes-Dorta, Nuria
    Caballero-Gil, Pino
    Rosa-Remedios, Carlos
    WIRELESS NETWORKS, 2024, 30 (09) : 7543 - 7560
  • [5] Malicious URL Detection Using Machine Learning
    Hani, Dr Raed Bani
    Amoura, Motasem
    Ammourah, Mohammad
    Abu Khalil, Yazeed
    2024 15TH INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION SYSTEMS, ICICS 2024, 2024,
  • [6] Malicious URL Detection based on Machine Learning
    Cho Do Xuan
    Hoa Dinh Nguyen
    Nikolaevich, Tisenko Victor
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2020, 11 (01) : 148 - 153
  • [7] Obfuscated Malicious Java']JavaScript Detection by Machine Learning
    Pan, Jinkun
    Mao, Xiaoguang
    PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON ADVANCES IN MECHANICAL ENGINEERING AND INDUSTRIAL INFORMATICS (AMEII 2016), 2016, 73 : 805 - 810
  • [8] Malicious URL and Intrusion Detection using Machine Learning
    Hamza, Amr
    Hammam, Farah
    Abouzeid, Medhat
    Ahmed, Mohammad Arsalan
    Dhou, Salam
    Aloul, Fadi
    38TH INTERNATIONAL CONFERENCE ON INFORMATION NETWORKING, ICOIN 2024, 2024, : 795 - 800
  • [9] Exploring and Identifying Malicious Sites in Dark Web Using Machine Learning
    Kawaguchi, Yuki
    Ozawa, Seiichi
    NEURAL INFORMATION PROCESSING (ICONIP 2019), PT III, 2019, 11955 : 319 - 327
  • [10] Empirical Study on Malicious URL Detection Using Machine Learning
    Patgiri, Ripon
    Katari, Hemanth
    Kumar, Ronit
    Sharma, Dheeraj
    DISTRIBUTED COMPUTING AND INTERNET TECHNOLOGY, ICDCIT 2019, 2019, 11319 : 380 - 388