Sentence embedding and fine-tuning to automatically identify duplicate bugs

被引:0
|
作者
Isotani, Haruna [1 ]
Washizaki, Hironori [1 ]
Fukazawa, Yoshiaki [1 ]
Nomoto, Tsutomu [2 ]
Ouji, Saori [3 ]
Saito, Shinobu [3 ]
机构
[1] Waseda Univ, Dept Comp Sci & Engn, Tokyo, Japan
[2] NTT CORP, Software Innovat Ctr, Tokyo, Japan
[3] NTT CORP, Comp & Data Sci Labs, Tokyo, Japan
来源
FRONTIERS IN COMPUTER SCIENCE | 2023年 / 4卷
关键词
bug reports; duplicate detection; BERT; sentence embedding; natural language processing; information retrieval;
D O I
10.3389/fcomp.2022.1032452
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Industrial software maintenance is critical but burdensome. Activities such as detecting duplicate bug reports are often performed manually. Herein an automated duplicate bug report detection system improves maintenance efficiency using vectorization of the contents and deep learning-based sentence embedding to calculate the similarity of the whole report from vectors of individual elements. Specifically, sentence embedding is realized using Sentence-BERT fine tuning. Additionally, its performance is experimentally compared to baseline methods to validate the proposed system. The proposed system detects duplicate bug reports more effectively than existing methods.
引用
收藏
页数:13
相关论文
共 50 条
  • [21] Faster Convergence for Transformer Fine-tuning with Line Search Methods
    Kenneweg, Philip
    Galli, Leonardo
    Kenneweg, Tristan
    Hammer, Barbara
    2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [22] Pretraining and Fine-Tuning Strategies for Sentiment Analysis of Latvian Tweets
    Thakkar, Gaurish
    Pinnis, Marcis
    HUMAN LANGUAGE TECHNOLOGIES - THE BALTIC PERSPECTIVE (HLT 2020), 2020, 328 : 55 - 61
  • [23] Twice fine-tuning deep neural networks for paraphrase identification
    Ko, Bowon
    Choi, Ho-Jin
    ELECTRONICS LETTERS, 2020, 56 (09) : 444 - 446
  • [24] Boosting generalization of fine-tuning BERT for fake news detection
    Qin, Simeng
    Zhang, Mingli
    INFORMATION PROCESSING & MANAGEMENT, 2024, 61 (04)
  • [25] Fine-Tuning BERT Models for Multiclass Amharic News Document Categorization
    Endalie, Demeke
    COMPLEXITY, 2025, 2025 (01)
  • [26] A Pairwise Probe for Understanding BERT Fine-Tuning on Machine Reading Comprehension
    Cai, Jie
    Zhu, Zhengzhou
    Nie, Ping
    Liu, Qian
    PROCEEDINGS OF THE 43RD INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '20), 2020, : 1665 - 1668
  • [27] Sentiment Analysis Using Pre-Trained Language Model With No Fine-Tuning and Less Resource
    Kit, Yuheng
    Mokji, Musa Mohd
    IEEE ACCESS, 2022, 10 : 107056 - 107065
  • [28] Prediction of Author's Profile Basing on Fine-Tuning BERT Model
    Bsir B.
    Khoufi N.
    Zrigui M.
    Informatica (Slovenia), 2024, 48 (01): : 69 - 78
  • [29] Fine-tuning your answers: a bag of tricks for improving VQA models
    Roberto Arroyo
    Sergio Álvarez
    Aitor Aller
    Luis M. Bergasa
    Miguel E. Ortiz
    Multimedia Tools and Applications, 2022, 81 : 26889 - 26913
  • [30] An Application of Transfer Learning: Fine-Tuning BERT for Spam Email Classification
    Bhopale, Amol P.
    Tiwari, Ashish
    MACHINE LEARNING AND BIG DATA ANALYTICS (PROCEEDINGS OF INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND BIG DATA ANALYTICS (ICMLBDA) 2021), 2022, 256 : 67 - 77