DeepFL: Integrating Multiple Fault Diagnosis Dimensions for Deep Fault Localization

被引:202
作者
Li, Xia [1 ]
Li, Wei [2 ]
Zhang, Yuqun [2 ]
Zhang, Lingming [1 ]
机构
[1] UT Dallas, Richardson, TX 75080 USA
[2] SUSTech, Shenzhen, Guangdong, Peoples R China
来源
PROCEEDINGS OF THE 28TH ACM SIGSOFT INTERNATIONAL SYMPOSIUM ON SOFTWARE TESTING AND ANALYSIS (ISSTA '19) | 2019年
基金
美国国家科学基金会;
关键词
Fault localization; Deep learning; Mutation testing; NETWORK; METRICS;
D O I
10.1145/3293882.3330574
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Learning-based fault localization has been intensively studied recently. Prior studies have shown that traditional Learning-to-Rank techniques can help precisely diagnose fault locations using various dimensions of fault-diagnosis features, such as suspiciousness values computed by various off-the-shelf fault localization techniques. However, with the increasing dimensions of features considered by advanced fault localization techniques, it can be quite challenging for the traditional Learning-to-Rank algorithms to automatically identify effective existing/latent features. In this work, we propose DeepFL, a deep learning approach to automatically learn the most effective existing/latent features for precise fault localization. Although the approach is general, in this work, we collect various suspiciousness-value-based, fault-proneness-based and textual-similarity-based features from the fault localization, defect prediction and information retrieval areas, respectively. DeepFL has been studied on 395 real bugs from the widely used Defects4J benchmark. The experimental results show DeepFL can significantly outperform state-of-the-art TraPT/FLUCCS (e.g., localizing 50+ more faults within Top-i). We also investigate the impacts of deep model configurations (e.g., loss functions and epoch settings) and features. Furthermore, DeepFL is also surprisingly effective for cross-project prediction.
引用
收藏
页码:169 / 180
页数:12
相关论文
共 90 条
[1]   On the accuracy of spectrum-based fault localization [J].
Abreu, Rui ;
Zoeteweij, Peter ;
van Gemund, Arjan J. C. .
TAIC PART 2007 - TESTING: ACADEMIC AND INDUSTRIAL CONFERENCE - PRACTICE AND RESEARCH TECHNIQUES, PROCEEDINGS: CO-LOCATED WITH MUTATION 2007, 2007, :89-+
[2]  
Abreu R, 2006, 12TH PACIFIC RIM INTERNATIONAL SYMPOSIUM ON DEPENDABLE COMPUTING, PROCEEDINGS, P39
[3]  
[Anonymous], 2002, EC IMPACTS INADEQUAT
[4]  
[Anonymous], 2018, Asm java bytecode manipulation and analysis framework
[5]  
[Anonymous], 2016, EFFECT SIZE
[6]  
[Anonymous], 2001, Proceedings of ICSE 2001 Workshop on Software Visualization, Toronto, Ontario, Canada
[7]  
[Anonymous], 1980, Order No. 8025191
[8]  
[Anonymous], 2018, Pit mutation testing system
[9]  
[Anonymous], 2018, Java programming language agents
[10]   LEARNING LONG-TERM DEPENDENCIES WITH GRADIENT DESCENT IS DIFFICULT [J].
BENGIO, Y ;
SIMARD, P ;
FRASCONI, P .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 1994, 5 (02) :157-166