SbrPBert: A BERT-Based Model for Accurate Security Bug Report Prediction

被引:1
|
作者
Cao, Xudong [1 ]
liu, Tianwei [2 ]
Zhang, Jianyuan [3 ]
Feng, Mengyue [1 ]
Zhang, Xin [4 ]
Cao, Wanying [1 ]
Sun, Hongyu [2 ]
Zhang, Yuqing [1 ]
机构
[1] Univ Chinese Acad Sci, Natl Comp Network Intrus Protect Ctr, Beijing, Peoples R China
[2] Xidian Univ, Sch Cyber Engn, Xian, Peoples R China
[3] Lanzhou Univ Technol, Sch Comp & Commun, Lanzhou, Peoples R China
[4] Sch Cyberspace Secur, Xian Univ Posts & Telecommun, Xian, Peoples R China
来源
52ND ANNUAL IEEE/IFIP INTERNATIONAL CONFERENCE ON DEPENDABLE SYSTEMS AND NETWORKS WORKSHOP VOLUME (DSN-W 2022) | 2022年
基金
中国国家自然科学基金;
关键词
deep learning; Bert; security bug report; vulnerability;
D O I
10.1109/DSN-W54100.2022.00030
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Bidirectional Encoder Representation from Transformers (Bert) has achieved impressive performance in several Natural Language Processing (NLP) tasks. However, there has been limited investigation on its adaptation guidelines in specialized fields. Here we focus on the software security domain. Early identification of security-related reports in software bug reports is one of the essential means to prevent security accidents. However, the prediction of security bug reports (SBRs) is limited by the scarcity and imbalance of samples in this field and the complex characteristics of SBRs. So motivated, we constructed the largest dataset in this field and proposed a Security Bug Report Prediction Model Based on Bert (SbrPBert). By introducing a layer-based learning rate attenuation strategy and a fine-tuning method for freezing some layers, our model outperforms the baseline model on both our dataset and other small-sample datasets. This means the practical value of the model in BUG tracking systems or projects that lack samples. Moreover, our model has detected 56 hidden vulnerabilities through deployment on the Mozilla and RedHat projects so far.
引用
收藏
页码:129 / 134
页数:6
相关论文
共 50 条
  • [1] BSAM: A BERT-Based Model with Statistical Information for Personality Prediction
    Xu, Bin
    Wang, Tongqing
    Gao, Kening
    Zhang, Zhaowu
    WEB AND BIG DATA, PT II, APWEB-WAIM 2022, 2023, 13422 : 538 - 545
  • [2] Umami-BERT: An interpretable BERT-based model for umami peptides prediction
    Zhang, Jingcheng
    Yan, Wenjing
    Zhang, Qingchuan
    Li, Zihan
    Liang, Li
    Zuo, Min
    Zhang, Yuyu
    FOOD RESEARCH INTERNATIONAL, 2023, 172
  • [3] Accurate TCR-pMHC interaction prediction using a BERT-based transfer learning method
    Zhang, Jiawei
    Ma, Wang
    Yao, Hui
    BRIEFINGS IN BIOINFORMATICS, 2024, 25 (01)
  • [4] A BERT-based review helpfulness prediction model utilizing consistency of ratings and texts
    Li, Xinzhe
    Li, Qinglong
    Ryu, Dongyeop
    Kim, Jaekyeong
    APPLIED INTELLIGENCE, 2025, 55 (06)
  • [5] A BERT-based model for the prediction of lncRNA subcellular localization in Homo sapiens
    Zhang, Zhao-Yue
    Zhang, Zheng
    Ye, Xiucai
    Sakurai, Tetsuya
    Lin, Hao
    INTERNATIONAL JOURNAL OF BIOLOGICAL MACROMOLECULES, 2024, 265
  • [6] BERT-Based Scientific Paper Quality Prediction
    Sasaki, Taiki
    Ito, Yasuaki
    Nakano, Koji
    Kasagi, Akihiko
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2022, PT IV, 2022, 13532 : 212 - 223
  • [7] A BERT-based Idiom Detection Model
    Gamage, Gihan
    De Silva, Daswin
    Adikari, Achini
    Alahakoon, Damminda
    2022 15TH INTERNATIONAL CONFERENCE ON HUMAN SYSTEM INTERACTION (HSI), 2022,
  • [8] BERT-Based Chinese Relation Extraction for Public Security
    Hou, Jiaqi
    Li, Xin
    Yao, Haipeng
    Sun, Haichun
    Mai, Tianle
    Zhu, Rongchen
    IEEE ACCESS, 2020, 8 : 132367 - 132375
  • [9] BERT-based keyword extraction model for the Turkish language
    Bilal Babayigit
    Hamza Sattuf
    Neural Computing and Applications, 2025, 37 (16) : 9807 - 9819
  • [10] EHR-BERT: A BERT-based model for effective anomaly detection in electronic health records
    Niu, Haoran
    Omitaomu, Olufemi A.
    Langston, Michael A.
    Olama, Mohammad
    Ozmen, Ozgur
    Klasky, Hilda B.
    Laurio, Angela
    Ward, Merry
    Nebeker, Jonathan
    JOURNAL OF BIOMEDICAL INFORMATICS, 2024, 150