A comprehensive evaluation of capture-recapture models for estimating software defect content

被引:76
作者
Briand, LC
El Emam, K
Freimut, BG
Laitenberger, O
机构
[1] Carleton Univ, Dept Syst & Comp Engn, Ottawa, ON K1S 5B6, Canada
[2] Natl Res Council Canada, Inst Informat Technol, Ottawa, ON K1A 0R6, Canada
[3] Fraunhofer Inst Expt Software Engn, D-67661 Kaiserslautern, Germany
关键词
inspections; capture-recapture models; robustness; fault content estimation;
D O I
10.1109/32.852741
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
An important requirement to control the inspection of software artifacts is to be able to decide, based on more objective information, whether the inspection can stop or whether it should continue to achieve a suitable level of artifact quality. A prediction of the number of remaining defects in an inspected artifact can be used for decision making. Several studies in software engineering have considered capture-recapture models, originally proposed by biologists to estimate animal populations, to make a prediction. However, few studies compare the actual number of remaining defects to the one predicted by a capture-recapture model on real software engineering artifacts. Thus, there is little work looking at the robustness of capture-recapture models under realistic software engineering conditions, where it is expected that some of their assumptions will be violated. Simulations have been performed, but no definite conclusions can be drawn regarding the degree of accuracy of such models under realistic inspection conditions and the factors affecting this accuracy. Furthermore, the existing studies focused on a subset of the existing capture-recapture models. Thus, a more exhaustive comparison is still missing. In this study, we focus on traditional inspections and estimate, based on actual inspections data, the degree of accuracy of relevant, state-of-the-art capture-recapture models as they have been proposed in biology and for which statistical estimators exist, in order to assess their robustness, we look at the impact of the number of inspectors and the number of actual defects on the estimators' accuracy based on actual inspection data. Our results show that models are strongly affected by the number of inspectors and, therefore, one must consider this factor before using capture-recapture models. When the number of inspectors is too small, no model is sufficiently accurate and underestimation may be substantial. In addition, some models perform better than others in a large number of conditions and plausible reasons are discussed. Based on our analyses, we recommend using a model taking into account that defects have different probabilities of being detected and the corresponding Jackknife Estimator. Furthermore, we attempt to calibrate the prediction models based on their relative error, as previously computed on other inspections. Although intuitive and straightforward, we identified theoretical limitations to this approach which were then confirmed by the data.
引用
收藏
页码:518 / 540
页数:23
相关论文
共 42 条
[1]   SOFTWARE INSPECTIONS - AN EFFECTIVE VERIFICATION PROCESS [J].
ACKERMAN, AF ;
BUCHWALD, LS ;
LEWSKI, FH .
IEEE SOFTWARE, 1989, 6 (03) :31-36
[2]  
[Anonymous], SOFTWARE INSPECTIONS
[3]  
[Anonymous], FSC726015 IBM FED SY
[4]  
[Anonymous], 1982, ESTIMATION ANIMAL AB
[5]  
Basili V. R., 1996, Empirical Software Engineering, V1, P133, DOI 10.1007/BF00368702
[6]  
BASIN SL, 1973, ESTIMATION SOFTWARE
[7]  
Begon M., 1979, Investigating animal abundance: capture-recapture for biologists
[8]   A 2-PERSON INSPECTION METHOD TO IMPROVE PROGRAMMING PRODUCTIVITY [J].
BISANT, DB ;
LYLE, JR .
IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 1989, 15 (10) :1294-1304
[9]  
BOURGEOUIS KV, 1996, CROSSTALK J DEFENSE, P17
[10]   Using simulation to build inspection efficiency benchmarks for development projects [J].
Briand, L ;
El Emam, K ;
Laitenberger, O ;
Fussbroich, T .
PROCEEDINGS OF THE 1998 INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, 1998, :340-349