New Methodology for Contextual Features Usage in Duplicate Bug Reports Detection

被引:0
|
作者
Neysiani, Behzad Soleimani [1 ]
Babamir, Seyed Morteza [1 ]
机构
[1] Univ Kashan, Fac Comp & Elect Engn, Dept Software Engn, Kashan, Esfahan, Iran
来源
2019 5TH INTERNATIONAL CONFERENCE ON WEB RESEARCH (ICWR) | 2019年
关键词
Information Retrieval; Natural Language Processing; Duplicate Detection; Bug Reports; Topic; Feature Expansion;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Duplicate bug report detection is one of the major problems in software triage systems like Bugzilla to deal with end user requests. User request contains some categorical and especially textual fields which need feature extraction for duplicate detection. Contextual and topical features are acquired using calculating cosine similarity between term frequency or inverse document frequency or BM25F technique from a pair of bug reports against some topics. This research proposes the individual Manhattan distance similarity approach instead of cosine distance similarity for every topic in contextual features to expand the feature dimension which can increase the accuracy of the duplicate bug report detection process. The four famous datasets of bug reports have used for evaluation of the proposed method including Android, Eclipse, Mozilla, and Open Office which the experimental results indicate performance improvement for four contextual features including general, cryptography, network, and Java topics.
引用
收藏
页码:178 / 183
页数:6
相关论文
共 44 条
  • [21] Duplicate Bug Report Detection: How Far Are We?
    Zhang, Ting
    Han, Donggyun
    Vinayakarao, Venkatesh
    Irsan, Ivana Clairine
    Xu, Bowen
    Thung, Ferdian
    Lo, David
    Jiang, Lingxiao
    ACM TRANSACTIONS ON SOFTWARE ENGINEERING AND METHODOLOGY, 2023, 32 (04)
  • [22] Exploring the Role of Automation in Duplicate Bug Report Detection: An Industrial Case Study
    Gotharsson, Malte
    Stahre, Karl
    Gay, Gregory
    Neto, Francisco Gomes de Oliveira
    PROCEEDINGS OF THE 2024 IEEE/ACM INTERNATIONAL CONFERENCE ON AUTOMATION OF SOFTWARE TEST, AST 2024, 2024, : 193 - 203
  • [23] An Approach to Detecting Duplicate Bug Reports using Natural Language and Execution Information
    Wang, Xiaoyin
    Zhang, Lu
    Xie, Tao
    Anvik, John
    Sun, Jiasu
    ICSE'08 PROCEEDINGS OF THE THIRTIETH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, 2008, : 461 - 470
  • [24] A comparative study of the performance of IR models on duplicate bug detection
    Kaushik, Nilam
    Tahvildari, Ladan
    2012 16TH EUROPEAN CONFERENCE ON SOFTWARE MAINTENANCE AND REENGINEERING (CSMR), 2012, : 159 - 168
  • [25] Duplicate Bug Report Detection with a Combination of Information Retrieval and Topic Modeling
    Anh Tuan Nguyen
    Tung Thanh Nguyen
    Nguyen, Tien N.
    Lo, David
    Sun, Chengnian
    2012 PROCEEDINGS OF THE 27TH IEEE/ACM INTERNATIONAL CONFERENCE ON AUTOMATED SOFTWARE ENGINEERING (ASE), 2012, : 70 - 79
  • [26] Duplicate Bug Report Detection and Classification System Based on Deep Learning Technique
    Kukkar, Ashima
    Mohana, Rajni
    Kumar, Yugal
    Nayyar, Anand
    Bilal, Muhammad
    Kwak, Kyung-Sup
    IEEE ACCESS, 2020, 8 (08): : 200749 - 200763
  • [27] Does Deep Learning improve the performance of duplicate bug report detection? An empirical study?
    Jiang, Yuan
    Su, Xiaohong
    Treude, Christoph
    Shang, Chao
    Wang, Tiantian
    JOURNAL OF SYSTEMS AND SOFTWARE, 2023, 198
  • [28] It Takes Two to TANGO: Combining Visual and Textual Information for Detecting Duplicate Video-Based Bug Reports
    Cooper, Nathan
    Bernal-Cardenas, Carlos
    Chaparro, Oscar
    Moran, Kevin
    Poshyvanyk, Denys
    2021 IEEE/ACM 43RD INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING (ICSE 2021), 2021, : 957 - 969
  • [29] Automatically Identifying Security Bug Reports via Multitype Features Analysis
    Zou, Deqing
    Deng, Zhijun
    Li, Zhen
    Jin, Hai
    INFORMATION SECURITY AND PRIVACY, 2018, 10946 : 619 - 633
  • [30] New labeled dataset of interconnected lexical typos for automatic correction in the bug reports
    Neysiani, Behzad Soleimani
    Babamir, Seyed Morteza
    SN APPLIED SCIENCES, 2019, 1 (11):