SbrPBert: A BERT-Based Model for Accurate Security Bug Report Prediction

被引:1
作者
Cao, Xudong [1 ]
liu, Tianwei [2 ]
Zhang, Jianyuan [3 ]
Feng, Mengyue [1 ]
Zhang, Xin [4 ]
Cao, Wanying [1 ]
Sun, Hongyu [2 ]
Zhang, Yuqing [1 ]
机构
[1] Univ Chinese Acad Sci, Natl Comp Network Intrus Protect Ctr, Beijing, Peoples R China
[2] Xidian Univ, Sch Cyber Engn, Xian, Peoples R China
[3] Lanzhou Univ Technol, Sch Comp & Commun, Lanzhou, Peoples R China
[4] Sch Cyberspace Secur, Xian Univ Posts & Telecommun, Xian, Peoples R China
来源
52ND ANNUAL IEEE/IFIP INTERNATIONAL CONFERENCE ON DEPENDABLE SYSTEMS AND NETWORKS WORKSHOP VOLUME (DSN-W 2022) | 2022年
基金
中国国家自然科学基金;
关键词
deep learning; Bert; security bug report; vulnerability;
D O I
10.1109/DSN-W54100.2022.00030
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Bidirectional Encoder Representation from Transformers (Bert) has achieved impressive performance in several Natural Language Processing (NLP) tasks. However, there has been limited investigation on its adaptation guidelines in specialized fields. Here we focus on the software security domain. Early identification of security-related reports in software bug reports is one of the essential means to prevent security accidents. However, the prediction of security bug reports (SBRs) is limited by the scarcity and imbalance of samples in this field and the complex characteristics of SBRs. So motivated, we constructed the largest dataset in this field and proposed a Security Bug Report Prediction Model Based on Bert (SbrPBert). By introducing a layer-based learning rate attenuation strategy and a fine-tuning method for freezing some layers, our model outperforms the baseline model on both our dataset and other small-sample datasets. This means the practical value of the model in BUG tracking systems or projects that lack samples. Moreover, our model has detected 56 hidden vulnerabilities through deployment on the Mozilla and RedHat projects so far.
引用
收藏
页码:129 / 134
页数:6
相关论文
共 50 条
  • [41] A Disease-Prediction Protocol Integrating Triage Priority and BERT-Based Transfer Learning for Intelligent Triage
    Wang, Boran
    Gao, Zhuliang
    Lin, Zhikang
    Wang, Rui
    BIOENGINEERING-BASEL, 2023, 10 (04):
  • [42] A BERT-Based Hybrid Short Text Classification Model Incorporating CNN and Attention-Based BiGRU
    Bao, Tong
    Ren, Ni
    Luo, Rui
    Wang, Baojia
    Shen, Gengyu
    Guo, Ting
    JOURNAL OF ORGANIZATIONAL AND END USER COMPUTING, 2021, 33 (06)
  • [43] BERT-Based Joint Model for Aspect Term Extraction and Aspect Polarity Detection in Arabic Text
    Chouikhi, Hasna
    Alsuhaibani, Mohammed
    Jarray, Fethi
    ELECTRONICS, 2023, 12 (03)
  • [44] Enhancing Sentiment Analysis for Chinese Texts Using a BERT-Based Model with a Custom Attention Mechanism
    Ding, Linlin
    Han, Yiming
    Li, Mo
    Li, Dong
    WEB INFORMATION SYSTEMS AND APPLICATIONS, WISA 2024, 2024, 14883 : 172 - 179
  • [45] A UNIVERSAL BERT-BASED FRONT-END MODEL FOR MANDARIN TEXT-TO-SPEECH SYNTHESIS
    Bai, Zilong
    Hu, Beibei
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 6074 - 6078
  • [46] On Cognitive Level Classification of Assessment-items Using Pre-trained BERT-based Model
    Dipto, Adnan Saif
    Limon, Md. Mahmudur Rahman
    Tuba, Fatima Tanjum
    Uddin, Md Mohsin
    Khan, M. Saddam Hossain
    Tuhin, Rashedul Amin
    PROCEEDINGS OF 2023 7TH INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND INFORMATION RETRIEVAL, NLPIR 2023, 2023, : 245 - 251
  • [47] A BERT-based Intent Recognition and Slot Filling Joint Model for Air Traffic Control Instruction Understanding
    Deng, Qihan
    Yang, Yang
    Zhang, Xiaoxiao
    Qian, Shengsheng
    Zhang, Minghua
    Cai, Kaiquan
    2023 IEEE/AIAA 42ND DIGITAL AVIONICS SYSTEMS CONFERENCE, DASC, 2023,
  • [48] A BERT-Based Generation Model to Transform Medical Texts to SQL Queries for Electronic Medical Records: Model Development and Validation
    Pan, Youcheng
    Wang, Chenghao
    Hu, Baotian
    Xiang, Yang
    Wang, Xiaolong
    Chen, Qingcai
    Chen, Junjie
    Du, Jingcheng
    JMIR MEDICAL INFORMATICS, 2021, 9 (12)
  • [49] BERT-LSTM network prediction model based on Transformer
    Guo, Jiachen
    Liu, Jun
    Yang, Chenxi
    Dong, Jianguo
    Wang, Zhengyi
    Dong Shijian
    PROCEEDINGS OF THE 36TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC 2024, 2024, : 3098 - 3103
  • [50] BERT-siRNA: siRNA target prediction based on BERT pre-trained interpretable model
    Xu, Jiayu
    Xu, Nan
    Xie, Weixin
    Zhao, Chengkui
    Yu, Lei
    Feng, Weixing
    GENE, 2024, 910