A Replication Study: Just-In-Time Defect Prediction with Ensemble Learning

被引:0
作者
Young, Steven [1 ]
Abdou, Tamer [2 ]
Bener, Ayse [1 ]
机构
[1] Ryerson Univ, Data Sci Lab, Toronto, ON, Canada
[2] Ryerson Univ, Arish Univ, Toronto, ON, Canada
来源
2018 IEEE/ACM 6TH INTERNATIONAL WORKSHOP ON REALIZING ARTIFICIAL INTELLIGENCE SYNERGIES IN SOFTWARE ENGINEERING (RAISE) | 2018年
关键词
Deep Learning; Defect Prediction;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Just-in-time defect prediction, which is also known as change-level defect prediction, can be used to efficiently allocate resources and manage project schedules in the software testing and debugging process. Just-in-time defect prediction can reduce the amount of code to review and simplify the assignment of developers to bug fixes. This paper reports a replicated experiment and an extension comparing the prediction of defect-prone changes using traditional machine learning techniques and ensemble learning. Using datasets from six open source projects, namely Bugzilla, Columba, JDT, Platform, Mozilla, and PostgreSQL we replicate the original approach to verify the results of the original experiment and use them as a basis for comparison for alternatives in the approach. Our results from the replicated experiment are consistent with the original. The original approach uses a combination of data preprocessing and a two-layer ensemble of decision trees. The first layer uses bagging to form multiple random forests. The second layer stacks the forests together with equal weights. Generalizing the approach to allow the use of any arbitrary set of classifiers in the ensemble, optimizing the weights of the classifiers, and allowing additional layers, we apply a new deep ensemble approach, called deep super learner, to test the depth of the original study. The deep super learner achieves statistically significantly better results than the original approach on five of the six projects in predicting defects as measured by F-1 score.
引用
收藏
页码:42 / 47
页数:6
相关论文
共 8 条
  • [1] [Anonymous], P 31 CAN C ART INT C
  • [2] Representation Learning: A Review and New Perspectives
    Bengio, Yoshua
    Courville, Aaron
    Vincent, Pascal
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (08) : 1798 - 1828
  • [3] Carver J. C., 2010, INT WORKSH REPL EMP, P2
  • [4] A Large-Scale Empirical Study of Just-in-Time Quality Assurance
    Kamei, Yasutaka
    Shihab, Emad
    Adams, Bram
    Hassan, Ahmed E.
    Mockus, Audris
    Sinha, Anand
    Ubayashi, Naoyasu
    [J]. IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2013, 39 (06) : 757 - 773
  • [5] Nan Y., 2012, P 29 INT C MACH LEAR, P289
  • [6] TLEL: A two-layer ensemble learning approach for just-in-time defect prediction
    Yang, Xinli
    Lo, David
    Xia, Xin
    Sun, Jianling
    [J]. INFORMATION AND SOFTWARE TECHNOLOGY, 2017, 87 : 206 - 220
  • [7] Deep Learning for Just-In-Time Defect Prediction
    Yang, Xinli
    Lo, David
    Xia, Xin
    Zhang, Yun
    Sun, Jianling
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON SOFTWARE SECURITY AND RELIABILITY (QRS 2015), 2015, : 17 - 26
  • [8] Zhou Z.H., 2012, Ensemble methods: foundations and algorithms, DOI 10.1201/b12207