Large language models based vulnerability detection: How does it enhance performance?

被引：0

作者：

Xuan, Cho Do ^{[1
]}

Quang, Dat Bui ^{[2
]}

Quang, Vinh Dang ^{[1
]}

机构：

[1] Posts & Telecommun Inst Technol, Dept Informat Secur, Hanoi, Vietnam

[2] Posts & Telecommun Inst Technol, Dept Informat Technol, Hanoi, Vietnam

来源：

INTERNATIONAL JOURNAL OF INFORMATION SECURITY | 2025年 / 24卷 / 01期

关键词：

Software vulnerability detection; Embedding code; Transformer encoder; Large language models; Ensemble learning; FRAMEWORK;

D O I：

10.1007/s10207-025-00983-8

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Detecting vulnerabilities in C/C + + source code has become a critical challenge in information security, especially as the growing number and severity of new vulnerabilities increasingly impact organizations. In this context, Large Language Models (LLMs) have emerged as a promising approach; however, building a model capable of effectively predicting and classifying various types of vulnerabilities from diverse datasets remains a complex problem, demanding innovative and comprehensive solutions. Our research proposes a breakthrough approach by developing the FG-CVD ensemble learning model, an advanced architecture that combines code embedding techniques and Knowledge Argument to enhance feature representation and the ability to learn complex relationships within source code. These improvements are specifically designed on the foundation of code embedding and the Transformer architecture of LLMs to boost the detection and classification of sophisticated vulnerability patterns. To evaluate the model's effectiveness, we conducted extensive experiments on four representative datasets: Reveal, BigVul, RealVul, and FFMQ + QEmu. The experimental results demonstrated FG-CVD's superior performance with an average accuracy of 85%, a prediction precision of 43%, a recall of 65%, and an F1-score of 47%. Notably, the model exhibited flexible adaptability to datasets with different structures and efficiently addressed data imbalance between labels. Moreover, through rigorous cross-dataset testing, the model showcased strong generalization capabilities and high stability, underscoring not only the academic value of the approach but also its practical potential, outperforming traditional approaches across a range of metrics and experimental scenarios.

引用

页数：18

共 60 条

[1] On Hardware Security Bug Code Fixes by Prompting Large Language Models [J].

Ahmad, Baleegh ;

Thakur, Shailja ;

Tan, Benjamin ;

Karri, Ramesh ;

Pearce, Hammond .

IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2024, 19 :4043-4057

[2]

[Anonymous], 2024, Stubborn Weaknesses in the CWE Top 25

[3]

blackduck, Open Source Security and Risk Analysis Report (OSSRA) Synopsys

[4] Intrusion Detection-Based Ensemble Learning and Microservices for Zero Touch Networks [J].

Bugshan, Neda ;

Khalil, Ibrahim ;

Kalapaaking, Aditya Pribadi ;

Atiquzzaman, Mohammed .

IEEE COMMUNICATIONS MAGAZINE, 2023, 61 (06) :86-92

[5]

Bui Van-Cong, 2023, 2023 RIVF International Conference on Computing and Communication Technologies (RIVF), P112, DOI 10.1109/RIVF60135.2023.10471834

[6] SCcheck: A Novel Graph-Driven and Attention- Enabled Smart Contract Vulnerability Detection Framework for Web 3.0 Ecosystem [J].

Cao, Yuanlong ;

Jiang, Fan ;

Xiao, Jianmao ;

Chen, Shaolong ;

Shao, Xun ;

Wu, Celimuge .

IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2024, 11 (05) :4007-4019

[7] Revisiting the Performance of Deep Learning-Based Vulnerability Detection on Realistic Datasets [J].

Chakraborty, Partha ;

Arumugam, Krishna Kanth ;

Alfadel, Mahmoud ;

Nagappan, Meiyappan ;

McIntosh, Shane .

IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2024, 50 (08) :2163-2177

[8] NatGen: Generative Pre-training by "Naturalizing" Source Code [J].

Chakraborty, Saikat ;

Ahmed, Toufique ;

Ding, Yangruibo ;

Devanbu, Premkumar T. ;

Ray, Baishakhi .

PROCEEDINGS OF THE 30TH ACM JOINT MEETING EUROPEAN SOFTWARE ENGINEERING CONFERENCE AND SYMPOSIUM ON THE FOUNDATIONS OF SOFTWARE ENGINEERING, ESEC/FSE 2022, 2022, :18-30

[9] Deep Learning Based Vulnerability Detection: Are We There Yet? [J].

Chakraborty, Saikat ;

Krishna, Rahul ;

Ding, Yangruibo ;

Ray, Baishakhi .

IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2022, 48 (09) :3280-3296

[10] DiverseVul: A New Vulnerable Source Code Dataset for Deep Learning Based Vulnerability Detection [J].

Chen, Yizheng ;

Ding, Zhoujie ;

Alowain, Lamya ;

Chen, Xinyun ;

Wagner, David .

PROCEEDINGS OF THE 26TH INTERNATIONAL SYMPOSIUM ON RESEARCH IN ATTACKS, INTRUSIONS AND DEFENSES, RAID 2023, 2023, :654-668

← 1 2 3 4 5 6 →