CGMBL: Combining GAN and Method Name for Bug Localization

被引:0
作者
Chen, Hao [1 ]
Yang, Haiyang [1 ]
Yan, Zilun [1 ]
Kuang, Li [1 ]
Zhang, Lingyan [1 ]
机构
[1] Cent South Univ, Sch Comp Sci & Engn, Changsha, Hunan, Peoples R China
来源
2022 IEEE 22ND INTERNATIONAL CONFERENCE ON SOFTWARE QUALITY, RELIABILITY AND SECURITY, QRS | 2022年
关键词
Bug Localization; Bug Reports; Adversarial Learning; Deep Learning; NEURAL-NETWORKS; RETRIEVAL;
D O I
10.1109/QRS57517.2022.00033
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Developers often need to locate buggy code files in the software quality maintenance process. Bug localization aims to automatically identify potentially buggy source code files from the project codes for developers based on the bug reports. Up to now, researchers have proposed many methods to advance this task. However, the early studies only focus on the accuracy of capturing text features or the efficiency of calculating relevance scores, which do not consider the semantic gap between bug reports in natural language and codes in programming language. In this paper, we propose a novel adversarial learning model to bridge the semantic gap. Due to the different characteristics of natural language and programming language, we propose two different representation models for bug reports and code files respectively, and regards the two representation models as the generators. Then we construct adversarial learning by adding a discriminator to distinguish the source of representations so that the model can learn the public features of different texts. In addition, method name is the summary of the code function, and the relevant method name often appears in the bug report. We consider the method name information according to whether the method name appears in the report. Our model can dynamically integrate the information to improve the model effect. We evaluate our model on three open-source java project datasets and compare it with four state-of-the-art methods. The experimental results show that our model outperforms the baseline models and has a significant improvement in evaluation metrics. Besides, we conduct ablation experiments to explain each module's contribution to the model.
引用
收藏
页码:231 / 241
页数:11
相关论文
共 30 条
  • [1] Bug Localization with Combination of Deep Learning and Information Retrieval
    An Ngoc Lam
    Anh Tuan Nguyen
    Hoan Anh Nguyen
    Nguyen, Tien N.
    [J]. 2017 IEEE/ACM 25TH INTERNATIONAL CONFERENCE ON PROGRAM COMPREHENSION (ICPC), 2017, : 218 - 229
  • [2] Combining Deep Learning with Information Retrieval to Localize Buggy Files for Bug Reports
    An Ngoc Lam
    Anh Tuan Nguyen
    Hoan Anh Nguyen
    Nguyen, Tien N.
    [J]. 2015 30TH IEEE/ACM INTERNATIONAL CONFERENCE ON AUTOMATED SOFTWARE ENGINEERING (ASE), 2015, : 476 - 481
  • [3] Arong, 2014, PROCEEDINGS OF 2014 IEEE INTERNATIONAL CONFERENCE ON PROGRESS IN INFORMATICS AND COMPUTING (PIC), P51, DOI 10.1109/PIC.2014.6972294
  • [4] Bahdanau D, 2016, Arxiv, DOI arXiv:1409.0473
  • [5] Davies S., 2012, 2012 19th Working Conference on Reverse Engineering (WCRE), P125, DOI 10.1109/WCRE.2012.22
  • [6] On the classification of bug reports to improve bug localization
    Fang, Fan
    Wu, John
    Li, Yanyan
    Ye, Xin
    Aljedaani, Wajdi
    Mkaouer, Mohamed Wiem
    [J]. SOFT COMPUTING, 2021, 25 (11) : 7307 - 7323
  • [7] Artificial neural networks (the multilayer perceptron) - A review of applications in the atmospheric sciences
    Gardner, MW
    Dorling, SR
    [J]. ATMOSPHERIC ENVIRONMENT, 1998, 32 (14-15) : 2627 - 2636
  • [8] On the Use of Relevance Feedback in IR-Based Concept Location
    Gay, Gregory
    Haiduc, Sonia
    Marcus, Andrian
    Menzies, Tim
    [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON SOFTWARE MAINTENANCE, CONFERENCE PROCEEDINGS, 2009, : 351 - +
  • [9] Goodfellow IJ, 2014, ADV NEUR IN, V27, P2672
  • [10] Huo X., 2016, IJCAI, P1606