Identify Vulnerability Types: A Cross-Project Multiclass Vulnerability Classification System Based on Deep Domain Adaptation

被引:0
作者
Du, Gewangzi [1 ,2 ]
Chen, Liwei [1 ,2 ]
Wu, Tongshuai [1 ,2 ]
Zhu, Chenguang [1 ,2 ]
Shi, Gang [1 ,2 ]
机构
[1] Chinese Acad Sci, Inst Informat Engn, Beijing, Peoples R China
[2] Univ Chinese Acad Sci, Sch Cyber Secur, Beijing, Peoples R China
来源
NEURAL INFORMATION PROCESSING, ICONIP 2023, PT VI | 2024年 / 14452卷
基金
中国国家自然科学基金;
关键词
cyber security; multiclass classification; snippet attention; deep learning; domain adaptation;
D O I
10.1007/978-981-99-8076-5_35
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Software Vulnerability Detection(SVD) is a important means to ensure system security due to the ubiquity of software. Deep learning-based approaches achieve state-of-the-art performance in SVD but one of the most crucial issues is coping with the scarcity of labeled data in projects to be detected. One reliable solution is to employ transfer learning skills to leverage labeled data from other software projects. However, existing cross-project approaches only focused on detecting whether the function code is vulnerable or not. The requirement to identify vulnerability types is essential because it offers information to patch the vulnerabilities. Our aim in this paper is to propose the first system for cross-project multiclass vulnerability classification. We detect at the granularity of code snippet, which is finer-grained compare to function and effective to catch inter-procedure vulnerability patterns. After generating code snippets, we define several principles to extract snippet attentions and build a deep model to obtain the fused deep features; We then extend different domain adaptation approaches to reduce feature distributions of different projects. Experimental results indicate that our system outperforms other state-of-the-art systems.
引用
收藏
页码:481 / 499
页数:19
相关论文
共 23 条
  • [1] [Anonymous], 2019, Checkmarx
  • [2] Behera A, 2021, AAAI CONF ARTIF INTE, V35, P929
  • [3] Hawkeye: Towards a Desired Directed Grey-box Fuzzer
    Chen, Hongxu
    Xue, Yinxing
    Li, Yuekang
    Chen, Bihuan
    Xie, Xiaofei
    Wu, Xiuheng
    Liu, Yang
    [J]. PROCEEDINGS OF THE 2018 ACM SIGSAC CONFERENCE ON COMPUTER AND COMMUNICATIONS SECURITY (CCS'18), 2018, : 2095 - 2108
  • [4] cve.mitre, CVE
  • [5] cwe.mitre, Common Weakness Enumeration
  • [6] Donahue J, 2014, PR MACH LEARN RES, V32
  • [7] Duan X, 2019, PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P4665
  • [8] joern, Joern
  • [9] CEREBRO: Context-Aware Adaptive Fuzzing for Effective Vulnerability Detection
    Li, Yuekang
    Xue, Yinxing
    Chen, Hongxu
    Wu, Xiuheng
    Zhang, Cen
    Xie, Xiaofei
    Wang, Haijun
    Liu, Yang
    [J]. ESEC/FSE'2019: PROCEEDINGS OF THE 2019 27TH ACM JOINT MEETING ON EUROPEAN SOFTWARE ENGINEERING CONFERENCE AND SYMPOSIUM ON THE FOUNDATIONS OF SOFTWARE ENGINEERING, 2019, : 533 - 544
  • [10] SySeVR: A Framework for Using Deep Learning to Detect Software Vulnerabilities
    Li, Zhen
    Zou, Deqing
    Xu, Shouhuai
    Jin, Hai
    Zhu, Yawei
    Chen, Zhaoxuan
    [J]. IEEE TRANSACTIONS ON DEPENDABLE AND SECURE COMPUTING, 2022, 19 (04) : 2244 - 2258