Sentence embedding and fine-tuning to automatically identify duplicate bugs

被引:0
|
作者
Isotani, Haruna [1 ]
Washizaki, Hironori [1 ]
Fukazawa, Yoshiaki [1 ]
Nomoto, Tsutomu [2 ]
Ouji, Saori [3 ]
Saito, Shinobu [3 ]
机构
[1] Waseda Univ, Dept Comp Sci & Engn, Tokyo, Japan
[2] NTT CORP, Software Innovat Ctr, Tokyo, Japan
[3] NTT CORP, Comp & Data Sci Labs, Tokyo, Japan
来源
FRONTIERS IN COMPUTER SCIENCE | 2023年 / 4卷
关键词
bug reports; duplicate detection; BERT; sentence embedding; natural language processing; information retrieval;
D O I
10.3389/fcomp.2022.1032452
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Industrial software maintenance is critical but burdensome. Activities such as detecting duplicate bug reports are often performed manually. Herein an automated duplicate bug report detection system improves maintenance efficiency using vectorization of the contents and deep learning-based sentence embedding to calculate the similarity of the whole report from vectors of individual elements. Specifically, sentence embedding is realized using Sentence-BERT fine tuning. Additionally, its performance is experimentally compared to baseline methods to validate the proposed system. The proposed system detects duplicate bug reports more effectively than existing methods.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Duplicate Bug Report Detection by Using Sentence Embedding and Fine-tuning
    Isotani, Haruna
    Washizaki, Hironori
    Fukazawa, Yoshiaki
    Nomoto, Tsutomu
    Ouji, Saori
    Saito, Shinobu
    2021 IEEE INTERNATIONAL CONFERENCE ON SOFTWARE MAINTENANCE AND EVOLUTION (ICSME 2021), 2021, : 535 - 544
  • [2] Unveiling Key Aspects of Fine-Tuning in Sentence Embeddings: A Representation Rank Analysis
    Jung, Euna
    Kim, Jaeill
    Ko, Jungmin
    Park, Jinwoo
    Rhee, Wonjong
    IEEE ACCESS, 2024, 12 : 159877 - 159888
  • [3] The Impact of Fine-Tuning in Embedding Models on Query-Dependent Retrieval Performance
    Alkan, Berkin
    Ozcan, Onur
    Karatas, Yahya Bahachr
    Unal, Muhammed Cihat
    Kolukisa, Oguz
    Karakaya, Ismail
    Karamanlioglu, Alper
    Demirel, Berkan
    32ND IEEE SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, SIU 2024, 2024,
  • [4] Knowledge-based BERT word embedding fine-tuning for emotion recognition
    Zhu, Zixiao
    Mao, Kezhi
    NEUROCOMPUTING, 2023, 552
  • [5] Fine-tuning ChatGPT for automatic scoring
    Latif E.
    Zhai X.
    Computers and Education: Artificial Intelligence, 2024, 6
  • [6] Fine-Tuning BERT Model for Materials Named Entity Recognition
    Zhao, Xintong
    Greenberg, Jane
    An, Yuan
    Hu, Xiaohua Tony
    2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2021, : 3717 - 3720
  • [7] Emerging trends: A gentle introduction to fine-tuning
    Church, Kenneth Ward
    Chen, Zeyu
    Ma, Yanjun
    NATURAL LANGUAGE ENGINEERING, 2021, 27 (06) : 763 - 778
  • [8] Transfer fine-tuning of BERT with phrasal paraphrases
    Arase, Yuki
    Tsujii, Junichi
    COMPUTER SPEECH AND LANGUAGE, 2021, 66
  • [9] SPEECH RECOGNITION BY SIMPLY FINE-TUNING BERT
    Huang, Wen-Chin
    Wu, Chia-Hua
    Luo, Shang-Bao
    Chen, Kuan-Yu
    Wang, Hsin-Min
    Toda, Tomoki
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 7343 - 7347
  • [10] Efficient Fine-Tuning of BERT Models on the Edge
    Vucetic, Danilo
    Tayaranian, Mohammadreza
    Ziaeefard, Maryam
    Clark, James J.
    Meyer, Brett H.
    Gross, Warren J.
    2022 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS 22), 2022, : 1838 - 1842