Copula-based software metrics aggregation

被引:6
作者
Ulan, Maria [1 ]
Lowe, Welf [1 ]
Ericsson, Morgan [1 ]
Wingkvist, Anna [1 ]
机构
[1] Linnaeus Univ, Ctr Data Intens Sci & Applicat, Data Driven Software & Informat Qual Grp, S-35195 Vaxjo, Sweden
关键词
Quality assessment; Quantitative methods; Software metrics; Aggregation; Multivariate statistical methods; Probabilistic models; Copula; QUALITY; MODEL;
D O I
10.1007/s11219-021-09568-9
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
A quality model is a conceptual decomposition of an abstract notion of quality into relevant, possibly conflicting characteristics and further into measurable metrics. For quality assessment and decision making, metrics values are aggregated to characteristics and ultimately to quality scores. Aggregation has often been problematic as quality models do not provide the semantics of aggregation. This makes it hard to formally reason about metrics, characteristics, and quality. We argue that aggregation needs to be interpretable and mathematically well defined in order to assess, to compare, and to improve quality. To address this challenge, we propose a probabilistic approach to aggregation and define quality scores based on joint distributions of absolute metrics values. To evaluate the proposed approach and its implementation under realistic conditions, we conduct empirical studies on bug prediction of ca. 5000 software classes, maintainability of ca. 15000 open-source software systems, and on the information quality of ca. 100000 real-world technical documents. We found that our approach is feasible, accurate, and scalable in performance.
引用
收藏
页码:863 / 899
页数:37
相关论文
共 66 条
[31]   Predicting fault incidence using software change history [J].
Graves, TL ;
Karr, AF ;
Marron, JS ;
Siy, H .
IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2000, 26 (07) :653-661
[32]  
Hald A., 2007, HIST PARAMETRIC STAT, V1713, P11
[33]   A practical model for measuring maintainability - a preliminary report [J].
Heitlager, Ilja ;
Kuipers, Tobias ;
Visser, Joost .
QUATIC 2007: 6TH INTERNATIONAL CONFERENCE ON THE QUALITY OF INFORMATION AND COMMUNICATIONS TECHNOLOGY, PROCEEDINGS, 2007, :30-39
[34]  
Institute of Electrical and Electronics Engineers, 1991, IEEE Standard Computer Dictionary: A Compilation of IEEE Standard Computer Glossaries, DOI [10.1109/IEEESTD.1990.101064, DOI 10.1109/IEEESTD.1990.101064, 10.1109/IEEESTD.1991.106963, DOI 10.1109/IEEESTD.1991.106963]
[35]   AutoSpearman: Automatically Mitigating Correlated Software Metrics for Interpreting Defect Models [J].
Jiarpakdee, Jirayus ;
Tantithamthavorn, Chakkrit ;
Treude, Christoph .
PROCEEDINGS 2018 IEEE INTERNATIONAL CONFERENCE ON SOFTWARE MAINTENANCE AND EVOLUTION (ICSME), 2018, :92-103
[36]  
Juran J., 1999, Quality handbook, P173
[37]  
Kendall M. G., 1948, Rank correlation methods.
[38]   Software quality: The elusive target [J].
Kitchenham, B ;
Pfleeger, SL .
IEEE SOFTWARE, 1996, 13 (01) :12-&
[39]   TOWARDS A FRAMEWORK FOR SOFTWARE MEASUREMENT VALIDATION [J].
KITCHENHAM, B ;
LAWRENCE, S ;
FENTON, N .
IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 1995, 21 (12) :929-944
[40]  
Letouzey Jean-Louis, 2010, Proceedings 2010 Second International Conference on Advances in System Testing and Validation Lifecycle (VALID), P43, DOI 10.1109/VALID.2010.31