Locating Relevant Source Files for Bug Reports using Textual Analysis

被引:0
|
作者
Gharibi, Reza [1 ]
Rasekh, Amir Hossein [1 ]
Sadreddini, Mohammad Hadi [1 ]
机构
[1] Shiraz Univ, Dept Comp Sci & Engn & IT, Shiraz, Iran
来源
2017 18TH CSI INTERNATIONAL SYMPOSIUM ON COMPUTER SCIENCE AND SOFTWARE ENGINEERING CONFERENCE (CSSE) | 2017年
关键词
bug localization; information retrieval; bug report; classification; textual analysis; LOCALIZATION; RETRIEVAL; CODE;
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Bug reports are an important part of software project's life cycle since they help improve the software's quality. However, in well-known systems, the huge number of bug reports make it difficult for the developer team to efficiently locate the bug and then assign it to be fixed. To solve this issue, various bug localization techniques have been proposed to rank all the source files of a project with respect to how likely they are to contain a bug. This makes the source files' search space smaller and helps developers to find relevant source files quicker. In this paper, we propose a three component bug localization approach which leverages different textual properties of bug reports and source files as well as the relations between previously fixed bug reports and a newly received one. Our approach uses information retrieval, textual matching, and multi-label classification to improve the performance of bug localization. We evaluate our approach on two open source software projects (i.e. SWT and ZXing) to examine its performance. Experimental results show that our approach can rank appropriate source files for more than 80% of bugs in top 10 for these projects and also improve the MRR and MAP values compared to two existing bug localization tools, BugLocator and BLUiR.
引用
收藏
页码:67 / 72
页数:6
相关论文
共 50 条
  • [41] Identification of Security related Bug Reports via Text Mining using Supervised and Unsupervised Classification
    Goseva-Popstojanova, Katerina
    Tyo, Jacob
    2018 IEEE INTERNATIONAL CONFERENCE ON SOFTWARE QUALITY, RELIABILITY AND SECURITY (QRS 2018), 2018, : 344 - 355
  • [42] Fast Detection of Duplicate Bug Reports using LDA-based Topic Modeling and Classification
    Akilan, Thangarajah
    Shah, Dhruvit
    Patel, Nishi
    Mehta, Rinkal
    2020 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2020, : 1622 - 1629
  • [43] Locating Small Cells Using Geo-located UE Measurement Reports & RF Fingerprinting
    Joyce, Robert
    Zhang, Li
    2015 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2015, : 3275 - 3280
  • [44] Increasing the utility of performance audit reports: Using textual analytics tools to improve government reporting
    Duan, Huijue Kelly
    Hu, Hanxin
    Yoon, Yangin
    Vasarhelyi, Miklos
    INTELLIGENT SYSTEMS IN ACCOUNTING FINANCE & MANAGEMENT, 2022, 29 (04): : 201 - 218
  • [45] Conciseness, Financial Disclosure, and Market Reaction: A Textual Analysis of Annual Reports in Listed Chinese Companies
    Alduais, Fahd
    Almasria, Nashat Ali
    Samara, Abeer
    Masadeh, Ali
    INTERNATIONAL JOURNAL OF FINANCIAL STUDIES, 2022, 10 (04):
  • [46] Locating Partial Discharge Source Occurring on Transformer Bushing by Using the Improved TDOA Method
    Tian, Ye
    Qi, Bo
    Zhuo, Ran
    Fu, Mingli
    Li, Chengrong
    2016 INTERNATIONAL CONFERENCE ON CONDITION MONITORING AND DIAGNOSIS (CMD), 2016, : 144 - 147
  • [47] A Digital Collection Study and Framework Exploration - Applying Textual Analysis on Source Code Collection
    Charoenwet, Wachiraphan
    2018 3RD DIGITAL HERITAGE INTERNATIONAL CONGRESS (DIGITALHERITAGE) HELD JOINTLY WITH 2018 24TH INTERNATIONAL CONFERENCE ON VIRTUAL SYSTEMS & MULTIMEDIA (VSMM 2018), 2018, : 15 - 22
  • [48] Policy making in the financial industry: A framework for regulatory impact analysis using textual analysis
    Clapham B.
    Bender M.
    Lausen J.
    Gomber P.
    Journal of Business Economics, 2023, 93 (9) : 1463 - 1514
  • [49] Recovering test-to-code traceability using slicing and textual analysis
    Qusef, Abdallah
    Bavota, Gabriele
    Oliveto, Rocco
    De Lucia, Andrea
    Binkley, Dave
    JOURNAL OF SYSTEMS AND SOFTWARE, 2014, 88 : 147 - 168
  • [50] Utilizing source code syntax patterns to detect bug inducing commits using machine learning models
    Md Nadim
    Banani Roy
    Software Quality Journal, 2023, 31 : 775 - 807