Malicious web content detection by machine learning

被引:60
作者
Hou, Yung-Tsung [1 ]
Chang, Yimeng [2 ]
Chen, Tsuhan [2 ]
Laih, Chi-Sung [3 ]
Chen, Chia-Mei [1 ]
机构
[1] Natl Sun Yat Sen Univ, Kaohsiung 80424, Taiwan
[2] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
[3] Natl Cheng Kung Univ, Tainan 70101, Taiwan
关键词
Dynamic [!text type='HTML']HTML[!/text; Malicious webpage; Machine learning;
D O I
10.1016/j.eswa.2009.05.023
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The recent development of the dynamic HTML gives attackers a new and powerful technique to compromise computer systems. A Malicious dynamic HTML code is usually embedded in a normal webpage. The malicious webpage infects the victim when a user browses it. Furthermore such DHTML code can disguise itself easily through obfuscation or transformation, which makes the detection even harder. Anti-virus software packages commonly use signature-based approaches which might not be able to efficiently identify camouflaged malicious HTML codes. Therefore, our paper proposes a malicious web page detection using the technique of machine learning. Our study analyzes the characteristic of a malicious webpage systematically and presents important features for machine learning. Experimental results demonstrate that our method is resilient to code obfuscations and can correctly determine whether a webpage is malicious or not. (C) 2009 Elsevier Ltd. All rights reserved.
引用
收藏
页码:55 / 60
页数:6
相关论文
共 50 条
  • [41] A machine learning technique for Android malicious attacks detection based on API calls
    AL-Akhrasa, Mousa
    Alghamdib, Saud
    Omarc, Hani
    Alshareefb, Hazzaa
    DECISION SCIENCE LETTERS, 2024, 13 (01) : 29 - 44
  • [42] Identifying Malicious Web Domains Using Machine Learning Techniques with Online Credibility and Performance Data
    Hu, Zhongyi
    Chiong, Raymond
    Pranata, Ilug
    Susilo, Willy
    Bao, Yukun
    2016 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2016, : 5186 - 5194
  • [43] Machine Learning Based Phishing Web Sites Detection
    Huu Hieu Nguyen
    Duc Thai Nguyen
    AETA 2015: RECENT ADVANCES IN ELECTRICAL ENGINEERING AND RELATED SCIENCES, 2016, 371 : 123 - 131
  • [44] Malicious Network Traffic Detection for DNS over HTTPS using Machine Learning Algorithms
    Casanova, Lionel F. Gonzalez
    Lin, Po-Chiang
    APSIPA TRANSACTIONS ON SIGNAL AND INFORMATION PROCESSING, 2023, 12 (02)
  • [45] BGP Dataset-Based Malicious User Activity Detection Using Machine Learning
    Park, Hansol
    Kim, Kookjin
    Shin, Dongil
    Shin, Dongkyoo
    INFORMATION, 2023, 14 (09)
  • [46] A Study on Detection of Malicious Behavior Based on Host Process Data Using Machine Learning
    Han, Ryeobin
    Kim, Kookjin
    Choi, Byunghun
    Jeong, Youngsik
    APPLIED SCIENCES-BASEL, 2023, 13 (07):
  • [47] Detection of Malicious Threats Exploiting Clock-Gating Hardware Using Machine Learning
    Kose, Nuri Alperen
    Jinad, Razaq
    Rasheed, Amar
    Shashidhar, Narasimha
    Baza, Mohamed
    Alshahrani, Hani
    SENSORS, 2024, 24 (03)
  • [48] Black box attack and network intrusion detection using machine learning for malicious traffic
    Zhu, Yiran
    Cui, Lei
    Ding, Zhenquan
    Li, Lun
    Liu, Yongji
    Hao, Zhiyu
    COMPUTERS & SECURITY, 2022, 123
  • [49] Robust Malicious Executable Detection Using Host-Based Machine Learning Classifier
    Soliman, Khaled
    Sobh, Mohamed
    Bahaa-Eldin, Ayman M.
    CMC-COMPUTERS MATERIALS & CONTINUA, 2024, 79 (01): : 1419 - 1439
  • [50] A Comprehensive Study on Efficient and Accurate Machine Learning-Based Malicious PE Detection
    Barut, Onur
    Zhang, Tong
    Luo, Yan
    Li, Peilong
    2023 IEEE 20TH CONSUMER COMMUNICATIONS & NETWORKING CONFERENCE, CCNC, 2023,