DENATURE: duplicate detection and type identification in open source bug repositories

被引:1
作者
Chauhan, Ruby [1 ]
Sharma, Shakshi [2 ]
Goyal, Anjali [3 ]
机构
[1] NorthCap Univ, Sect 23 A, Gurugram 122017, Haryana, India
[2] Univ Tartu, Tartu, Estonia
[3] Sharda Univ, Sch Engn & Technol, Dept Comp Sci & Engn, Greater Noida, India
关键词
Bug tracking system; Bug reports; Duplicate detection; Bug type identification; Similarity measures; Classification; Information retrieval techniques; CLASSIFICATION; MODEL;
D O I
10.1007/s13198-023-01855-x
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Software projects reckon on the bug tracking systems to guide software maintenance activities. The critical information about the nature of the crash is carried by the bug reports which are submitted to bug repositories. This information is in free form text format and is submitted by users or developers. A large amount of bug reports gets collected in bug repositories. Out of these submitted bugs, many reports are mere identical of the already existing bugs. Furthermore, not all non-duplicate bugs are reproducible in nature. This paper introduces DENATURE, a two step framework for detecting duplication and identifying bug type. The proposed framework will help to minimize time and developer's effort utilized in resolution of bug reports which will further improvise overall software quality. Information retrieval techniques are used for finding duplicate bugs and machine learning classification techniques are used for identifying the type of bug report. Through experiments, we found that the proposed framework obtained prediction accuracy up to 88.81%.
引用
收藏
页码:S275 / S292
页数:18
相关论文
共 68 条
  • [1] Detecting duplicate bug reports with software engineering domain knowledge
    Aggarwal, Karan
    Timbers, Finbarr
    Rutgers, Tanner
    Hindle, Abram
    Stroulia, Eleni
    Greiner, Russell
    [J]. JOURNAL OF SOFTWARE-EVOLUTION AND PROCESS, 2017, 29 (03)
  • [2] Predicting bug-fixing time: A replication study using an open source software project
    Akbarinasaji, Shirin
    Caglayan, Bora
    Bener, Ayse
    [J]. JOURNAL OF SYSTEMS AND SOFTWARE, 2018, 136 : 173 - 186
  • [3] Alipour A, 2013, IEEE WORK CONF MIN S, P183, DOI 10.1109/MSR.2013.6624026
  • [4] Amoui M, 2013, IEEE WORK CONF MIN S, P173, DOI 10.1109/MSR.2013.6624025
  • [5] Nguyen AT, 2012, IEEE INT CONF AUTOM, P70, DOI 10.1145/2351676.2351687
  • [6] [Anonymous], 2014, P 11 WORKING C MININ
  • [7] Anvik J., 2005, P OOPSLA WORKSHOP EC, P35
  • [8] Banerjee S., 2012, 2012 IEEE 14th International Symposium on High-Assurance Systems Engineering (HASE 2012), P74, DOI 10.1109/HASE.2012.38
  • [9] Automated triaging of very large bug repositories
    Banerjee, Sean
    Syed, Zahid
    Helmick, Jordan
    Culp, Mark
    Ryan, Kenneth
    Cukic, Bojan
    [J]. INFORMATION AND SOFTWARE TECHNOLOGY, 2017, 89 : 1 - 13
  • [10] Banerjee S, 2013, PROC INT SYMP SOFTW, P208, DOI 10.1109/ISSRE.2013.6698920