Predicting Software Maintenance Effort by Mining Software Project Reports Using Inter-Version Validation

被引:0
作者
Jindal, Rajni [1 ]
Malhotra, Ruchika [1 ]
Jain, Abha [1 ]
机构
[1] Delhi Technol Univ, Dept Comp Sci & Engn, Main Bawana Rd, Delhi 110042, India
关键词
Defect reports; text mining; machine learning; software maintenance effort prediction; receiver operating characteristics;
D O I
10.1142/S021853931640009X
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Changes in the software are unavoidable due to an ever changing dynamic and activeenvironment wherein expectations and requirements of the users tend to change rapidly. As a result, software needs to upgrade itself from its previous version to the next version in order to meet expectations of the user. The upgradation of the software is in terms of total number of Lines of Code (LOC) that might have been inserted, deleted or modified in moving from one version of software to the next. These changes are maintained in the change reports which constitute of the defect ID and defect description. Defect description describes the cause of defect which might have occurred in the previous version of the software due to which either new LOC needs to be inserted or existing LOC need to be deleted or modified. A lot of effort is required to correct the defects identified in software at the maintenance phase i.e., when software is delivered at the customers end. Thus, in this paper, we intend to predict maintenance effort by analyzing the defect reports using text mining techniques and thereafter developing the prediction models using suitable machine learning algorithms viz. Multi-Layer Perceptron (MLP), Radial-Basis Function (RBF) network and Decision Tree (DT). We have considered the changes between three successive versions of 'MMS' application package of Android operating system and have performed inter-version validation where the model predicted using the version 'v' is validated on the subsequent version i.e., 'v+1'. The performance of the model was evaluated using Receiver Operating Characteristics (ROC) analysis. The results indicated that the model predicted on 'MMS' 4.0 version using MLP algorithm has shown good results when validated on 'MMS' 4.1 version. On the other hand, the performance of RBF and DT algorithms has been consistently average in predicting the maintenance effort.
引用
收藏
页数:18
相关论文
共 35 条
  • [1] Aggarwal K. K., 2005, Journal of Computer Sciences, V1, P538, DOI 10.3844/jcssp.2005.538.542
  • [2] Aggarwal K. K., 2006, APPL ARTIFICIAL NEUR, V22, P140
  • [3] An integrated measure of software maintainability
    Aggarwal, KK
    Singh, Y
    Chhabra, JK
    [J]. ANNUAL RELIABILITY AND MAINTAINABILITY SYMPOSIUM, 2002 PROCEEDINGS, 2002, : 235 - 241
  • [4] [Anonymous], 2004, P INT C SOFTWARE ENG
  • [5] [Anonymous], 1993, 12191993 IEEE, DOI [10.1109/IEEESTD.1993.11557J, DOI 10.1109/IEEESTD.1993.11557J]
  • [6] [Anonymous], 2007, 4 INT WORKSH MIN SOF
  • [7] A controlled experiment for evaluating quality guidelines on the maintainability of object-oriented designs
    Briand, LC
    Bunse, C
    Daly, JW
    [J]. IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2001, 27 (06) : 513 - 530
  • [8] Broomhead D. S., 1988, Complex Systems, V2, P321
  • [9] Canfora G., 2005, STEP, P99
  • [10] Cong Jin, 2010, Proceedings of the 2010 Second International Conference on Multimedia and Information Technology (MMIT 2010), P24, DOI 10.1109/MMIT.2010.10