Malicious web content detection by machine learning

被引:60
|
作者
Hou, Yung-Tsung [1 ]
Chang, Yimeng [2 ]
Chen, Tsuhan [2 ]
Laih, Chi-Sung [3 ]
Chen, Chia-Mei [1 ]
机构
[1] Natl Sun Yat Sen Univ, Kaohsiung 80424, Taiwan
[2] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
[3] Natl Cheng Kung Univ, Tainan 70101, Taiwan
关键词
Dynamic [!text type='HTML']HTML[!/text; Malicious webpage; Machine learning;
D O I
10.1016/j.eswa.2009.05.023
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The recent development of the dynamic HTML gives attackers a new and powerful technique to compromise computer systems. A Malicious dynamic HTML code is usually embedded in a normal webpage. The malicious webpage infects the victim when a user browses it. Furthermore such DHTML code can disguise itself easily through obfuscation or transformation, which makes the detection even harder. Anti-virus software packages commonly use signature-based approaches which might not be able to efficiently identify camouflaged malicious HTML codes. Therefore, our paper proposes a malicious web page detection using the technique of machine learning. Our study analyzes the characteristic of a malicious webpage systematically and presents important features for machine learning. Experimental results demonstrate that our method is resilient to code obfuscations and can correctly determine whether a webpage is malicious or not. (C) 2009 Elsevier Ltd. All rights reserved.
引用
收藏
页码:55 / 60
页数:6
相关论文
共 50 条
  • [21] Applying machine learning techniques for detection of malicious code in network traffic
    Elovici, Yuval
    Shabtai, Asaf
    Moskovitch, Robert
    Tahan, Gil
    Glezer, Chanan
    KI 2007: ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2007, 4667 : 44 - +
  • [22] Evaluation of Machine Learning Algorithms for Detection of Malicious Traffic in SCADA Network
    Rajesh, L.
    Satyanarayana, Penke
    JOURNAL OF ELECTRICAL ENGINEERING & TECHNOLOGY, 2022, 17 (02) : 913 - 928
  • [23] MACHINE LEARNING AND LINK ANALYSIS FOR WEB CONTENT MINING
    Carullo, Moreno
    Binaghi, Elisabetta
    KDIR 2010: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND INFORMATION RETRIEVAL, 2010, : 156 - 161
  • [24] An Intelligent Detection of Malicious Intrusions in IoT Based on Machine Learning and Deep Learning Techniques
    Iftikhar, Saman
    Khan, Danish
    Al-Madani, Daniah
    Alheeti, Khattab M. Ali
    Fatima, Kiran
    COMPUTER SCIENCE JOURNAL OF MOLDOVA, 2022, 30 (03) : 288 - 307
  • [25] Evaluation of Machine Learning Algorithms for Detection of Malicious Traffic in SCADA Network
    L. Rajesh
    Penke Satyanarayana
    Journal of Electrical Engineering & Technology, 2022, 17 : 913 - 928
  • [26] Detection of Dangerous Web Pages Based on the Analysis of Suicidal Content Using Machine Learning Algorithms
    Lyovkin, Maxim
    Frolov, Aleksey A.
    Perminov, Egor
    PROCEEDINGS OF THE 2021 IEEE CONFERENCE OF RUSSIAN YOUNG RESEARCHERS IN ELECTRICAL AND ELECTRONIC ENGINEERING (ELCONRUS), 2021, : 513 - 516
  • [27] Feature optimization and hybrid classification for malicious web page detection
    Deng, Weiping
    Peng, Yan
    Yang, Fan
    Song, Jun
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2022, 34 (16)
  • [28] Detecting Malicious Driving with Machine Learning
    Yardy, Kevin
    Almehmadi, Abdulaziz
    El-Khatib, Khalil
    2019 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE (WCNC), 2019,
  • [29] Machine Learning Approaches to Malicious PowerShell Scripts Detection and Feature Combination Analysis
    Hung, Hsiang-Hua
    Chen, Jiann-Liang
    Ma, Yi-Wei
    JOURNAL OF INTERNET TECHNOLOGY, 2024, 25 (01): : 167 - 173
  • [30] A blockchain and stacked machine learning approach for malicious nodes' detection in internet of things
    Baig, Shakira Musa
    Javed, Muhammad Umar
    Almogren, Ahmad
    Javaid, Nadeem
    Jamil, Mohsin
    PEER-TO-PEER NETWORKING AND APPLICATIONS, 2023, 16 (06) : 2811 - 2832