Practical assessment of the models for identification of defect-prone classes in object-oriented commercial systems using design metrics

被引:34
作者
Succi, G [1 ]
Pedrycz, W [1 ]
Stefanovic, M [1 ]
Miller, J [1 ]
机构
[1] Univ Alberta, Dept Elect & Comp Engn, Elect & Comp Engn Res Facil, Edmonton, AB T6G 2V4, Canada
关键词
D O I
10.1016/S0164-1212(02)00024-9
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
The goal of this paper is to investigate and assess the ability of explanatory models based on design metrics to describe and predict defect counts in an object-oriented software system. Specifically, we empirically evaluate the influence of design decisions to defect behavior of the classes in two products from the commercial software domain. Information provided by these models can help in resource allocation and serve as a base for assessment and future improvements. We use innovative statistical methods to deal with the peculiarities of the software engineering data, such as non-normally distributed count data. To deal with overdispersed data and excess of zeroes in the dependent variable, we use negative binomial (NB) and zero-inflated NB regression in addition to Poisson regression. Furthermore, we form a framework for comparison of models' descriptive and predictive ability. Predictive capability of the models to identify most critical classes in the system early in the software development process can help in allocation of resources and foster software quality improvement. In addition to the correlation coefficients, we use additional statistics to assess a models' ability to explain high variability in the data and Pareto analysis to assess a models' ability to identify the most critical classes in the system. Results indicate that design aspects related to communication between classes and inheritance can be used as indicators of the most defect-prone classes, which require the majority of resources in development and testing phases. The zero-inflated negative binomial regression model, designed to explicitly model the occurrence of zero counts in the dataset, provides the best results for this purpose. (C) 2002 Elsevier Science Inc. All rights reserved.
引用
收藏
页码:1 / 12
页数:12
相关论文
共 37 条
  • [1] [Anonymous], 1997, REGRESSION MODELS CA
  • [2] [Anonymous], 9 INT S SOFTW REL EN
  • [3] BASILI VR, 1996, IEEE T SOFTWARE ENG, V22
  • [4] Benedicenti L, 1998, WORLD MULTICONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL 1, PROCEEDINGS, P411
  • [5] Property-based software engineering measurement
    Briand, LC
    Morasca, S
    Basili, VR
    [J]. IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 1996, 22 (01) : 68 - 86
  • [6] BRIAND LC, 1999, IMPACT DESIGN DEV CO
  • [7] Cameron AC, 1986, Journal of Applied Econometrics, V1, P29, DOI [10.2307/2096536, 10.1002/jae.3950010104, DOI 10.1002/JAE.3950010104]
  • [8] A METRICS SUITE FOR OBJECT-ORIENTED DESIGN
    CHIDAMBER, SR
    KEMERER, CF
    [J]. IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 1994, 20 (06) : 476 - 493
  • [9] Managerial use of metrics for object-oriented software: An exploratory analysis
    Chidamber, SR
    Darcy, DP
    Kemerer, CF
    [J]. IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 1998, 24 (08) : 629 - 639
  • [10] EBERT C, 1998, 9 INT S SOFTW REL EN