An effective software cross-project fault prediction model for quality improvement

被引:6
作者
Khatri, Yogita [1 ]
Singh, Sandeep Kumar [1 ]
机构
[1] Jaypee Inst Informat Technol, Dept Comp Sci Engn & Informat Technol, Noida, India
关键词
Software quality; Feature selection; Instance selection; Cross-project fault prediction; Effort-based performance measures; Non-effort-based performance measures; FEATURE-SELECTION METHOD; DEFECT PREDICTION; METRICS; CLASSIFICATION; OPTIMIZATION; VALIDATION;
D O I
10.1016/j.scico.2022.102918
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
As a quality assurance activity, cross-project fault prediction (CPFP) involves building a model for predicting the faults in a specific software project (aka target project) facing the shortage of within-project training data, leveraging cross-projects data. However, the quality of training data decides the success of a CPFP model. Existing CPFP approaches mainly focused on instance selection with minimal attention to feature selection. Further, the validation of their models has been performed through non-effort-based performance measures (NEPMs) only, considering the availability of unlimited inspection resources, which is impractical. Addressing these problems, we propose a Hybrid Training Data Selection (HTDS) approach combining feature selection along with instance selection for building a proficient and pragmatic CPFP model by validating its effectiveness in terms of effort-based performance measures (EPMs) along with the NEPMs to confirm its practical applicability. The empirical results on 62 datasets manifested the potency of the proposed approach over all the compared approaches in terms of NEPMs and EPMs collectively. Thus, our proposed HTDS approach results in a more productive and practical CPFP model that can empower practitioners to produce quality software at a lesser cost.(c) 2022 Elsevier B.V. All rights reserved.
引用
收藏
页数:21
相关论文
共 71 条
[1]   Discriminating features-based cost-sensitive approach for software defect prediction [J].
Ali, Aftab ;
Khan, Naveed ;
Abu-Tair, Mamun ;
Noppen, Joost ;
McClean, Sally ;
McChesney, Ian .
AUTOMATED SOFTWARE ENGINEERING, 2021, 28 (02)
[2]   Improving Cross-Project Defect Prediction Methods with Data Simplification [J].
Amasaki, Sousuke ;
Kawata, Kazuya ;
Yokogawa, Tomoyuki .
PROCEEDINGS 41ST EUROMICRO CONFERENCE ON SOFTWARE ENGINEERING AND ADVANCED APPLICATIONS SEAA 2015, 2015, :96-103
[3]  
[Anonymous], 2011, Proceedings of the 2011 International Symposium on Software Testing and Analysis, DOI DOI 10.1145/2001420.2001445
[4]  
[Anonymous], 2007, 3 INT WORKSH PRED MO, DOI [10.1109/PROMISE.2007.10, DOI 10.1109/PROMISE.2007.10]
[5]   Hybrid Filter-Wrapper Feature Selection Method for Sentiment Classification [J].
Ansari, Gunjan ;
Ahmad, Tanvir ;
Doja, Mohammad Najmud .
ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2019, 44 (11) :9191-9208
[6]   Software defect prediction using cost-sensitive neural network [J].
Arar, Omer Faruk ;
Ayan, Kursat .
APPLIED SOFT COMPUTING, 2015, 33 :263-277
[7]   A validation of object-oriented design metrics as quality indicators [J].
Basili, VR ;
Briand, LC ;
Melo, WL .
IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 1996, 22 (10) :751-761
[8]   Assessing the applicability of fault-proneness models across object-oriented software projects [J].
Briand, LC ;
Melo, WL ;
Wüst, J .
IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2002, 28 (07) :706-720
[9]   Defect prediction as a multiobjective optimization problem [J].
Canfora, Gerardo ;
De Lucia, Andrea ;
Di Penta, Massimiliano ;
Oliveto, Rocco ;
Panichella, Annibale ;
Panichella, Sebastiano .
SOFTWARE TESTING VERIFICATION & RELIABILITY, 2015, 25 (04) :426-459
[10]  
Cervante L, 2012, IEEE C EVOL COMPUTAT