Predicting software project effort: A grey relational analysis based method

被引:66
作者
Song, Qinbao [1 ]
Shepperd, Martin [2 ]
机构
[1] Xi An Jiao Tong Univ, Xian 710049, Shaanxi, Peoples R China
[2] Brunel Univ, Uxbridge UB8 3PH, Middx, England
基金
中国国家自然科学基金;
关键词
Software project estimation; Effort prediction; Feature subset selection; Outlier detection; Grey relational analysis; FEATURE-SELECTION; REGRESSION; ALGORITHM;
D O I
10.1016/j.eswa.2010.12.005
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The inherent uncertainty of the software development process presents particular challenges for software effort prediction. We need to systematically address missing data values, outlier detection, feature subset selection and the continuous evolution of predictions as the project unfolds, and all of this in the context of data-starvation and noisy data. However, in this paper, we particularly focus on outlier detection, feature subset selection, and effort prediction at an early stage of a project. We propose a novel approach of using grey relational analysis (GRA) from grey system theory (GST), which is a recently developed system engineering theory based on the uncertainty of small samples. In this work we address some of the theoretical challenges in applying GRA to outlier detection, feature subset selection, and effort prediction, and then evaluate our approach on five publicly available industrial data sets using both stepwise regression and Analogy as benchmarks. The results are very encouraging in the sense of being comparable or better than other machine learning techniques and thus indicate that the method has considerable potential. (c) 2010 Elsevier Ltd. All rights reserved.
引用
收藏
页码:7302 / 7316
页数:15
相关论文
共 52 条
[1]   SOFTWARE FUNCTION, SOURCE LINES OF CODE, AND DEVELOPMENT EFFORT PREDICTION - A SOFTWARE SCIENCE VALIDATION [J].
ALBRECHT, AJ ;
GAFFNEY, JE .
IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 1983, 9 (06) :639-648
[2]  
[Anonymous], 1981, Software Engineering Economics
[3]  
[Anonymous], 1997, MACHINE LEARNING, MCGRAW-HILL SCIENCE/ENGINEERING/MATH
[4]   SOFTWARE ENGINEERING ECONOMICS [J].
BOEHM, BW .
IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 1984, 10 (01) :4-21
[5]   When will it be done? Machine learner answers to the 300-billion-dollar question [J].
Boetticher, GD .
IEEE INTELLIGENT SYSTEMS, 2003, 18 (03) :48-50
[6]   LOF: Identifying density-based local outliers [J].
Breunig, MM ;
Kriegel, HP ;
Ng, RT ;
Sander, J .
SIGMOD RECORD, 2000, 29 (02) :93-104
[7]  
Briand L. C., 2000, Proceedings of the 2000 International Conference on Software Engineering. ICSE 2000 the New Millennium, P377, DOI 10.1109/ICSE.2000.870428
[8]   A PATTERN-RECOGNITION APPROACH FOR SOFTWARE ENGINEERING DATA-ANALYSIS [J].
BRIAND, LC ;
BASILI, VR ;
THOMAS, WM .
IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 1992, 18 (11) :931-942
[9]  
BRIAND LC, 2002, ENCY SOFTW ENG
[10]   Can genetic programming improve software effort estimation? A comparative evaluation [J].
Burgess, CJ ;
Lefley, M .
INFORMATION AND SOFTWARE TECHNOLOGY, 2001, 43 (14) :863-873