Which is More Important for Cross-Project Defect Prediction: Instance or Feature?

被引:24
作者
Yu, Qiao [1 ]
Jiang, Shujuan [1 ]
Qian, Junyan [2 ]
机构
[1] China Univ Min & Technol, Sch Comp Sci & Technol, Xuzhou, Peoples R China
[2] Guilin Univ Elect Technol, Guangxi Key Lab Trusted Software, Guilin, Peoples R China
来源
2016 INTERNATIONAL CONFERENCE ON SOFTWARE ANALYSIS, TESTING AND EVOLUTION (SATE 2016) | 2016年
关键词
software testing; cross-project defect prediction; instance filter; feature selection; METRICS;
D O I
10.1109/SATE.2016.22
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Software defect prediction plays an important role in software testing. We can build the prediction model based on historical data. However, for a new project, we cannot be able to build a good prediction model due to lack of historical data. Therefore, researchers have proposed the cross-project defect prediction (CPDP) methods to share the historical data among different projects. In practice, there may be the problems of instance distribution differences and feature redundancy in cross-project datasets. To investigate which is more important for CPDP, instance or feature, we take instance filter and feature selection as examples to show their efficiency for CPDP. Our experiments are conducted on NASA and PROMISE datasets, and the results indicate that feature selection performs better than instance filter in improving the performance of CPDP. We can conclude that feature could be more important than instance for CPDP.
引用
收藏
页码:90 / 95
页数:6
相关论文
共 34 条
  • [1] INSTANCE-BASED LEARNING ALGORITHMS
    AHA, DW
    KIBLER, D
    ALBERT, MK
    [J]. MACHINE LEARNING, 1991, 6 (01) : 37 - 66
  • [2] [Anonymous], ICDL
  • [3] [Anonymous], 1999, Ph.D. Thesis
  • [4] [Anonymous], APPL SOFT C IN PRESS
  • [5] [Anonymous], PATTERN RECOGN LETT
  • [6] [Anonymous], P 20 INT S FDN SOFTW
  • [7] Approximations of functions by a multilayer perceptron: a new approach
    Attali, JG
    Pages, G
    [J]. NEURAL NETWORKS, 1997, 10 (06) : 1069 - 1081
  • [8] Assessing the applicability of fault-proneness models across object-oriented software projects
    Briand, LC
    Melo, WL
    Wüst, J
    [J]. IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2002, 28 (07) : 706 - 720
  • [9] Multi-Objective Cross-Project Defect Prediction
    Canfora, Gerardo
    De Lucia, Andrea
    Di Penta, Massimiliano
    Oliveto, Rocco
    Panichella, Annibale
    Panichella, Sebastiano
    [J]. 2013 IEEE SIXTH INTERNATIONAL CONFERENCE ON SOFTWARE TESTING, VERIFICATION AND VALIDATION (ICST 2013), 2013, : 252 - 261
  • [10] A METRICS SUITE FOR OBJECT-ORIENTED DESIGN
    CHIDAMBER, SR
    KEMERER, CF
    [J]. IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 1994, 20 (06) : 476 - 493