Variable-based Fault Localization via Enhanced Decision Tree

被引:5
作者
Jiang, Jiajun [1 ]
Wang, Yumeng [1 ]
Chen, Junjie [1 ]
Lv, Delin [1 ]
Liu, Mengjiao [1 ]
机构
[1] Tianjin Univ, Coll Intelligence & Comp, Tianjin 300350, Peoples R China
基金
中国国家自然科学基金;
关键词
Fault localization; program debugging; decision tree;
D O I
10.1145/3624741
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Fault localization, aiming at localizing the root cause of the bug under repair, has been a longstanding research topic. Although many approaches have been proposed in past decades, most of the existing studies work at coarse-grained statement or method levels with very limited insights about how to repair the bug (granularity problem), but few studies target the finer-grained fault localization. In this article, we target the granularity problem and propose a novel finer-grained variable-level fault localization technique. Specifically, the basic idea of our approach is that fault-relevant variables may exhibit different values in failed and passed test runs, and variables that have higher discrimination ability have a larger possibility to be the root causes of the failure. Based on this, we propose a program-dependency-enhanced decision tree model to boost the identification of fault-relevant variables via discriminating failed and passed test cases based on the variable values. To evaluate the effectiveness of our approach, we have implemented it in a tool called VarDT and conducted an extensive study over the Defects4J benchmark. The results show that VarDT outperforms the state-of-the-art fault localization approaches with at least 268.4% improvement in terms of bugs located at Top-1, and the average improvement is 351.3%. Besides, to investigate whether our finer-grained fault localization result can further improve the effectiveness of downstream APR techniques, we have adapted VarDT to the application of patch filtering, where we use the variables located by VarDT to filter incorrect patches. The results denote that VarDT outperforms the state-of-the-art PATCH-SIM and BATS by filtering 14.8% and 181.8% more incorrect patches, respectively, demonstrating the effectiveness of our approach. It also provides a new way of thinking for improving automatic program repair techniques.
引用
收藏
页数:32
相关论文
共 92 条
[1]   On the accuracy of spectrum-based fault localization [J].
Abreu, Rui ;
Zoeteweij, Peter ;
van Gemund, Arjan J. C. .
TAIC PART 2007 - TESTING: ACADEMIC AND INDUSTRIAL CONFERENCE - PRACTICE AND RESEARCH TECHNIQUES, PROCEEDINGS: CO-LOCATED WITH MUTATION 2007, 2007, :89-+
[2]  
Abreu R, 2006, 12TH PACIFIC RIM INTERNATIONAL SYMPOSIUM ON DEPENDABLE COMPUTING, PROCEEDINGS, P39
[3]  
AGRAWAL H, 1990, P ACM SIGPLAN 90 C P, P246, DOI DOI 10.1145/93542.93576
[4]  
Aiken A, 2006, P 23 INT C MACH LEAR, P1105
[5]  
[Anonymous], 2008, P INT S SOFTW TEST A
[6]  
Arumuga PiramanayagamNainar., 2010, P 32 ACMIEEE INT C S, P255
[7]  
Baah G.K., 2010, P 19 INT S SOFTWARE, P73, DOI [10.1145/1831708.1831717, DOI 10.1145/1831708.1831717]
[8]  
Bai ZF, 2015, IEEE ICST WORKSHOP
[9]  
Banning J.P., 1979, Proceedings of the 6th ACM SIGACT-SIGPLAN symposium on Principles of programming languages, P29
[10]  
Benesty J., 2009, The SAGE Encyclopedia of Educational Research, Measurement, and Evaluation, V2, P1, DOI [DOI 10.4135/9781506326139, DOI 10.1007/978-3-211-89836-9_1025]