A Multidocument Summarization Technique for Informative Bug Summaries

被引：0

作者：

Mukhtar, Samal ^{[1
]}

Lee, Seonah ^{[1
,2
]}

Heo, Jueun ^{[1
]}

机构：

[1] Gyeongsang Natl Univ, Dept AI Convergence Engn, Jinju Si 52828, South Korea

[2] Gyeongsang Natl Univ, Dept Software Engn, Jinju Si 52828, South Korea

来源：

IEEE ACCESS | 2024年 / 12卷

基金：

新加坡国家研究基金会;

关键词：

Computer bugs; Support vector machines; Vectors; Semantics; Mathematical models; Transformers; Training data; Tokenization; Source coding; Software development management; Bug report summarization; classification; combination; bug summaries;

D O I：

10.1109/ACCESS.2024.3487443

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

To help developers grasp bug information, bug summaries should contain bug descriptions and information on the reproduction steps, environment, and solutions to be informative for developers. However, previously established bug report summarization techniques generate bug summaries mainly by identifying significant sentences, which often miss those bug report attributes. In this paper, we aim to generate informative summaries that cover these specific bug report attributes in a structured form. There are two challenges. First, the relevant information is sometimes scattered over multiple sources. Second, information on the reproduction steps and environment is often filtered out by previous techniques, which identify significant sentences on the basis of their relationships. Therefore, we propose a bug summarization technique that collects information from multiple sources, including duplicates and pull requests, and a classification technique for identifying sentences that provide relevant information on the reproduction steps and environment. Our proposed technique, ClaSum, consists of four steps: preprocessing, classification, sentence ranking, and summarization. We adopted RoBERTa for our classification step, Opinion and Topic association scores for the sentence ranking step, and BART for the summarization step. Our comparative experiments show that our technique outperforms the state-of-the-art technique BugSum in terms of the F1 score by 14%, 8%, and 35% on the SDS, ADS, and DDS datasets, respectively. Additionally, our qualitative investigation shows that our technique generates a more structural summary than two well-known LLMs, Gemini and Claude.

引用

页码：158908 / 158926

页数：19

共 42 条

[1] Anil R., 2023, arXiv, DOI DOI 10.48550/ARXIV.2312.11805
[2] [Anonymous], 2004, ANN M ASS COMPUTATIO
[3] Anthropic, 2024, Claude-3 Model Card, V1, P1
[4] Brown TB, 2020, Arxiv, DOI [arXiv:2005.14165, 10.48550/arXiv.2005.14165, DOI 10.48550/ARXIV.2005.14165]
[5] Carbonell J., 1998, Proceedings of the 21st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, P335, DOI 10.1145/290941.291025
[6] A comprehensive survey on support vector machine classification: Applications, challenges and trends
Cervantes, Jair
Garcia-Lamont, Farid
Rodriguez-Mazahua, Lisbeth
Lopez, Asdrubal
[J]. NEUROCOMPUTING, 2020, 408 : 189 - 215
[7] Chowdhery A, 2023, J MACH LEARN RES, V24
[8] Engel D., 2009, P 9 SIAM INT C DAT M, P1
[9] Chaff from the Wheat: Characterizing and Determining Valid Bug Reports
Fan, Yuanrui
Xia, Xin
Lo, David
Hassan, Ahmed E.
[J]. IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2020, 46 (05) : 495 - 525
[10] An approach to generate the bug report summaries using two-level feature extraction
Gupta, Som
Gupta, Sanjai Kumar
[J]. EXPERT SYSTEMS WITH APPLICATIONS, 2021, 176

← 1 2 3 4 5 →