Enhancing Bug Localization Using Phase-Based Approach

被引:0
作者
Mohsen, Amr Mansour [1 ]
Hassan, Hesham A. [2 ]
Wassif, Khaled T. [2 ]
Moawad, Ramadan [1 ]
Makady, Soha H. [2 ]
机构
[1] Future Univ Egypt, Fac Comp & Informat Technol, Dept Comp Sci, Cairo 11835, Egypt
[2] Cairo Univ, Fac Comp & Artificial Intelligence, Dept Comp Sci, Cairo 11865, Egypt
关键词
Computer bugs; Source coding; Encoding; Location awareness; Task analysis; Codes; Data mining; Bug localization; bidirectional encoder representations from transformers (BERT);
D O I
10.1109/ACCESS.2023.3265731
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Software bug localization is an important step in the software maintenance process. Automatic bug localization can reduce the time consumed in the process of localization. Some techniques are applied in the bug localization process, but those techniques suffer from limitations in time and accuracy. This paper proposes a phase-based bug localization approach to overcome these limitations. The approach is composed of three main phases which are raw data preparation, package classification, and source code recommendation. The main input to our approach is a bug report and the source code of the past versions for the target system of interest. From the bug report, various information is utilized: the summary, the description, the stack traces, and the fixed source code files. The raw data preparation phase is used to restructure those inputs. The package classification phase aims to locate the package that would include the source code to be modified as a first step, hence reducing the time needed to locate the source code file due to the lexical mismatch between those files and the bug report data. Bidirectional Encoder Representations from Transformers (BERT), which is a sentence embedding technique, is utilized in the package classification and source code recommendation phases. The experimental results show that our approach outperforms existing approaches according to TOP-N rank and Mean Reciprocal Rank (MRR) evaluation metrics.
引用
收藏
页码:35901 / 35913
页数:13
相关论文
共 40 条
  • [1] Bug Localization with Combination of Deep Learning and Information Retrieval
    An Ngoc Lam
    Anh Tuan Nguyen
    Hoan Anh Nguyen
    Nguyen, Tien N.
    [J]. 2017 IEEE/ACM 25TH INTERNATIONAL CONFERENCE ON PROGRAM COMPREHENSION (ICPC), 2017, : 218 - 229
  • [2] Combining Deep Learning with Information Retrieval to Localize Buggy Files for Bug Reports
    An Ngoc Lam
    Anh Tuan Nguyen
    Hoan Anh Nguyen
    Nguyen, Tien N.
    [J]. 2015 30TH IEEE/ACM INTERNATIONAL CONFERENCE ON AUTOMATED SOFTWARE ENGINEERING (ASE), 2015, : 476 - 481
  • [3] [Anonymous], 2008, P 2008 INT WORKING C
  • [4] Atwan J., 2021, SCI PROGRAMMING-NETH, P1
  • [5] A Similarity Integration Method based Information Retrieval and Word Embedding in Bug Localization
    Cheng, Shasha
    Yan, Xuefeng
    Khan, Arif Ali
    [J]. 2020 IEEE 20TH INTERNATIONAL CONFERENCE ON SOFTWARE QUALITY, RELIABILITY, AND SECURITY (QRS 2020), 2020, : 180 - 187
  • [6] Arabic Sentiment Analysis Using BERT Model
    Chouikhi, Hasna
    Chniter, Hamza
    Jarray, Fethi
    [J]. ADVANCES IN COMPUTATIONAL COLLECTIVE INTELLIGENCE (ICCCI 2021), 2021, 1463 : 621 - 632
  • [7] Fast Changeset-based Bug Localization with BERT
    Ciborowska, Agnieszka
    Damevski, Kostadin
    [J]. 2022 ACM/IEEE 44TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING (ICSE 2022), 2022, : 946 - 957
  • [8] Ciborowska Agnieszka, 2022, arXiv
  • [9] Das S., 2021, Social Netw. Comput. Sci., V2, P69, DOI DOI 10.1007/S42979-020-00427-1
  • [10] On the classification of bug reports to improve bug localization
    Fang, Fan
    Wu, John
    Li, Yanyan
    Ye, Xin
    Aljedaani, Wajdi
    Mkaouer, Mohamed Wiem
    [J]. SOFT COMPUTING, 2021, 25 (11) : 7307 - 7323