LineVul: A Transformer-based Line-Level Vulnerability Prediction

被引：168

作者：

Fu, Michael ^{[1
]}

Tantithamthavorn, Chakkrit ^{[1
]}

机构：

[1] Monash Univ, Clayton, Vic, Australia

来源：

2022 MINING SOFTWARE REPOSITORIES CONFERENCE (MSR 2022) | 2022年

基金：

澳大利亚研究理事会;

关键词：

D O I：

10.1145/3524842.3528452

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Software vulnerabilities are prevalent in software systems, causing a variety of problems including deadlock, information loss, or system failures. Thus, early predictions of software vulnerabilities are critically important in safety-critical software systems. Various ML/DL-based approaches have been proposed to predict vulnerabilities at the file/function/method level. Recently, IVDetect (a graph-based neural network) is proposed to predict vulnerabilities at the function level. Yet, the IVDetect approach is still inaccurate and coarse-grained. In this paper, we propose LINEVUL, a Transformer-based line-level vulnerability prediction approach in order to address several limitations of the state-of-the-art IVDetect approach. Through an empirical evaluation of a large-scale real-world dataset with 188k+ C/C++ functions, we show that LINEVUL achieves (1) 160%-379% higher F1-measure for function-level predictions; (2) 12%-25% higher Top-10 Accuracy for line-level predictions; and (3) 29%-53% less Effort@20%Recall than the baseline approaches, highlighting the significant advancement of LINEVUL towards more accurate and more cost-effective line-level vulnerability predictions. Our additional analysis also shows that our LINEVUL is also very accurate (75%-100%) for predicting vulnerable functions affected by the Top-25 most dangerous CWEs, highlighting the potential impact of our LINEVUL in real-world usage scenarios.

引用

页码：608 / 620

页数：13

共 64 条

[1]

Ancona Marco, 2018, 6 INT C LEARNING REP

[2]

[Anonymous], Microsoft Exchange Flaw: Attacks Surge After Code Published

[3]

[Anonymous], COST CYBERCRIME

[4]

[Anonymous], CYBERCRIME COST WORL

[5]

[Anonymous], 2011, BIGLEARN NIPS WORKSH

[6]

[Anonymous], IVDECTECT REPLICATIO

[7]

Chakraborty Saikat, 2021, IEEE T SOFTWARE ENG

[8]

checkmarx, About us

[9]

CppCheck, About us

[10]

cwe, CWE787

← 1 2 3 4 5 6 7 →