UTANSA: Static Approach for Multi-language Malicious Web Scripts Detection

被引:2
|
作者
Huang, Weiqing [1 ,2 ]
Jia, Chenggang [1 ,2 ]
Yu, Min [1 ]
Li, Gang [3 ]
Liu, Chao [1 ]
Jiang, Jianguo [1 ,2 ]
机构
[1] Chinese Acad Sci, Inst Informat Engn, Beijing 100093, Peoples R China
[2] Univ Chinese Acad Sci, Sch Cyber Secur, Beijing 100049, Peoples R China
[3] Deakin Univ, Sch Informat Technol, 221 Burwood Higyway, Burwood, Vic 3125, Australia
来源
26TH IEEE SYMPOSIUM ON COMPUTERS AND COMMUNICATIONS (IEEE ISCC 2021) | 2021年
关键词
Web Security; Malicious Scripts; Abstract Syntax Tree;
D O I
10.1109/ISCC53001.2021.9631400
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In order to detect malicious web scripts automatically, many detection methods using static features and machine learning are proposed. However, the existing detection methods can only detect web scripts of specific programming languages. This paper proposes the unified text features and abstract syntax tree(AST) node sequence features algorithm(UTANSA) that exploits the text feature classification method and AST node classification method, together with the corresponding unified method to enhance the generalization ability of the model. Through the algorithm, two unified approaches are proposed based on text features and AST node features respectively, so that the detection model can detect multi-language web scripts. We choose scripts written in the JavaScript(JS) and PHP languages for experimentation to evaluate our approach. The results show that the detection model trained with the proposed method has a similar detection effect as trained with only JS samples or PHP samples.
引用
收藏
页数:7
相关论文
共 50 条
  • [1] AMA: Static Code Analysis of Web Page For The Detection of Malicious Scripts
    Seshagiri, Prabhu
    Vazhayil, Anu
    Sriram, Padmamala
    PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING AND COMMUNICATIONS, 2016, 93 : 768 - 773
  • [2] MERLIN: Multi-Language Web Vulnerability Detection
    Figueiredo, Alexandra
    Lide, Tatjana
    Matos, David
    Correia, Miguel
    2020 IEEE 19TH INTERNATIONAL SYMPOSIUM ON NETWORK COMPUTING AND APPLICATIONS (NCA), 2020,
  • [3] MERLIN: Multi-Language Web Vulnerability Detection
    Figueiredo, Alexandra
    Lide, Tatjana
    Matos, David
    Correia, Miguel
    2020 IEEE 19th International Symposium on Network Computing and Applications, NCA 2020, 2020,
  • [4] On Multi-language Abstraction Towards a Static Analysis of Multi-language Programs
    Buro, Samuele
    Crole, Roy L.
    Mastroeni, Isabella
    STATIC ANALYSIS (SAS 2020), 2020, 12389 : 310 - 332
  • [5] On multi-language abstraction: Towards a static analysis of multi-language programs
    Buro, Samuele
    Crole, Roy
    Mastroeni, Isabella
    FORMAL METHODS IN SYSTEM DESIGN, 2023, 65 (1) : 4 - 38
  • [6] Towards Multi-Language Static Code Analysis
    Siddiqui, Sanaa
    Metta, Ravindra
    Madhukar, Kumar
    2023 IEEE 34TH INTERNATIONAL SYMPOSIUM ON SOFTWARE RELIABILITY ENGINEERING WORKSHOPS, ISSREW, 2023, : 81 - 82
  • [7] Retrieval method of static code information for multi-language
    Pang, Long
    Wang, Tian-Tian
    Su, Xiao-Hong
    Ma, Pei-Jun
    Harbin Gongye Daxue Xuebao/Journal of Harbin Institute of Technology, 2011, 43 (03): : 62 - 66
  • [8] HYBRID METHOD TO DESIGN MULTI-LANGUAGE WEB SITES
    Bizoi, Mihai
    Suduc, Ana-Maria
    Gorghiu, Gabriel
    CSEDU 2009: PROCEEDINGS OF THE FIRST INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED EDUCATION, VOL I, 2009, : 427 - 430
  • [9] Reusing Single-Language Analyses for Static Analysis of Multi-language Programs
    Roth, Tobias
    COMPANION PROCEEDINGS OF THE 2023 ACM SIGPLAN INTERNATIONAL CONFERENCE ON SYSTEMS, PROGRAMMING, LANGUAGES, AND APPLICATIONS: SOFTWARE FOR HUMANITY, SPLASH COMPANION 2023, 2023, : 16 - 18
  • [10] Semantic language and multi-language MT approach based on SL
    QingShi Gao
    Yue Hu
    Li Li
    XiaoYu Gao
    Journal of Computer Science and Technology, 2003, 18 : 848 - 852