SbrPBert: A BERT-Based Model for Accurate Security Bug Report Prediction

被引:1
|
作者
Cao, Xudong [1 ]
liu, Tianwei [2 ]
Zhang, Jianyuan [3 ]
Feng, Mengyue [1 ]
Zhang, Xin [4 ]
Cao, Wanying [1 ]
Sun, Hongyu [2 ]
Zhang, Yuqing [1 ]
机构
[1] Univ Chinese Acad Sci, Natl Comp Network Intrus Protect Ctr, Beijing, Peoples R China
[2] Xidian Univ, Sch Cyber Engn, Xian, Peoples R China
[3] Lanzhou Univ Technol, Sch Comp & Commun, Lanzhou, Peoples R China
[4] Sch Cyberspace Secur, Xian Univ Posts & Telecommun, Xian, Peoples R China
来源
52ND ANNUAL IEEE/IFIP INTERNATIONAL CONFERENCE ON DEPENDABLE SYSTEMS AND NETWORKS WORKSHOP VOLUME (DSN-W 2022) | 2022年
基金
中国国家自然科学基金;
关键词
deep learning; Bert; security bug report; vulnerability;
D O I
10.1109/DSN-W54100.2022.00030
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Bidirectional Encoder Representation from Transformers (Bert) has achieved impressive performance in several Natural Language Processing (NLP) tasks. However, there has been limited investigation on its adaptation guidelines in specialized fields. Here we focus on the software security domain. Early identification of security-related reports in software bug reports is one of the essential means to prevent security accidents. However, the prediction of security bug reports (SBRs) is limited by the scarcity and imbalance of samples in this field and the complex characteristics of SBRs. So motivated, we constructed the largest dataset in this field and proposed a Security Bug Report Prediction Model Based on Bert (SbrPBert). By introducing a layer-based learning rate attenuation strategy and a fine-tuning method for freezing some layers, our model outperforms the baseline model on both our dataset and other small-sample datasets. This means the practical value of the model in BUG tracking systems or projects that lack samples. Moreover, our model has detected 56 hidden vulnerabilities through deployment on the Mozilla and RedHat projects so far.
引用
收藏
页码:129 / 134
页数:6
相关论文
共 50 条
  • [21] BERT-Based Deep Spatial-Temporal Network for Taxi Demand Prediction
    Cao, Dun
    Zeng, Kai
    Wang, Jin
    Sharma, Pradip Kumar
    Ma, Xiaomin
    Liu, Yonghe
    Zhou, Siyuan
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (07) : 9442 - 9454
  • [22] Assessing a BERT-based model for analyzing subjectivity and classifying academic articles
    Mehmood A.
    Shahid F.
    Khan R.
    Ahmed S.
    Ibrahim M.M.
    Zheng Z.
    Multimedia Tools and Applications, 2024, 83 (42) : 90511 - 90532
  • [23] CASBERT: BERT-based retrieval for compositely annotated biosimulation model entities
    Munarko, Yuda
    Rampadarath, Anand
    Nickerson, David P.
    FRONTIERS IN BIOINFORMATICS, 2023, 3
  • [24] AMP-BERT: Prediction of antimicrobial peptide function based on a BERT model
    Lee, Hansol
    Lee, Songyeon
    Lee, Ingoo
    Nam, Hojung
    PROTEIN SCIENCE, 2023, 32 (01)
  • [25] A BERT-Based Framework for Automated Extraction of Behavioral Indicators of Compromise from Security Incident Reports
    Bekhouche, Mohamed El Amine
    Adi, Kamel
    FOUNDATIONS AND PRACTICE OF SECURITY, PT I, FPS 2023, 2024, 14551 : 219 - 232
  • [26] Rumor detection using BERT-based social circle and interaction network model
    Thirumoorthy, K.
    Britto, J. Jerold John
    Haripriya, P.
    Shreenee, N.
    SOCIAL NETWORK ANALYSIS AND MINING, 2024, 14 (01)
  • [27] A BERT-Based Model to Analyse Disaster’s Data for Efficient Resource Management
    Sonu Lamba
    Pranav Vidyarthi
    Mudit Aggarwal
    Priyanshi Gangawar
    Snehita Mulapalli
    SN Computer Science, 6 (2)
  • [28] The Impact of Combining Arabic Sarcasm Detection Datasets On The Performance Of BERT-based Model
    Obeidat, Rasha
    Bashayreh, Amjad
    Younis, Lojin Bani
    2022 13TH INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION SYSTEMS (ICICS), 2022, : 22 - 29
  • [29] BERT-based Transfer Learning Model for COVID-19 Sentiment Analysis on Turkish Instagram Comments
    Karayigit, Habibe
    Akdagli, Ali
    Aci, Cigdem Inan
    INFORMATION TECHNOLOGY AND CONTROL, 2022, 51 (03): : 409 - 428
  • [30] Empirical Studies on Deep-learning-based Security Bug Report Prediction Methods
    Zheng W.
    Chen J.-Z.
    Wu X.-X.
    Chen X.
    Xia X.
    Ruan Jian Xue Bao/Journal of Software, 2020, 31 (05): : 1294 - 1313