Automated change-prone class prediction on unlabeled dataset using unsupervised method

被引:20
作者
Yan, Meng [2 ]
Zhang, Xiaohong [1 ,2 ]
Liu, Chao [2 ]
Xu, Ling [2 ]
Yang, Mengning [2 ]
Yang, Dan [2 ]
机构
[1] Minist Educ, Key Lab Dependable Serv Comp Cyber Phys Soc, Chongqing 400044, Peoples R China
[2] Chongqing Univ, Sch Software Engn, Chongqing 401331, Peoples R China
基金
中国国家自然科学基金;
关键词
Software maintenance; Change-prone prediction; Unlabeled dataset; Unsupervised prediction; OBJECT-ORIENTED SOFTWARE; OPEN-SOURCE PRODUCTS; MAINTAINABILITY; METRICS; CODE; PROJECT; MODELS; SUITE;
D O I
10.1016/j.infsof.2017.07.003
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Context: Software change-prone class prediction can enhance software decision making activities during software maintenance (e.g., resource allocating). Researchers have proposed many change-prone class prediction approaches and most are effective on labeled datasets (projects with historical labeled data). These approaches usually build a supervised model by learning from historical labeled data. However, a major challenge is that this typical change-prone prediction setting cannot be used for unlabeled datasets (e.g., new projects or projects with limited historical data). Although the cross-project prediction is a solution on unlabeled dataset, it needs the prior labeled data from other projects and how to select the appropriate training project is a difficult task. Objective: We aim to build a change-prone class prediction model on unlabeled datasets without the need of prior labeled data. Method: We propose to tackle this task by adopting a state-of-art unsupervised method, namely CLAMI. In addition, we propose a novel unsupervised approach CLAMI+ by extending CLAMI. The key idea is to enable change-prone class prediction on unlabeled dataset by learning from itself. Results: The experiments among 14 open source projects show that the unsupervised methods achieve comparable results to the typical supervised within-project and cross-project prediction baselines in average and the proposed CLAMI+ slightly improves the CLAMI method in average. Conclusion: Our method discovers that it is effective for building change-prone class prediction model by using unsupervised method. It is convenient for practical usage in industry, since it does not need prior labeled data. (C) 2017 Elsevier B.V. All rights reserved.
引用
收藏
页码:1 / 16
页数:16
相关论文
共 46 条
[1]   TEMPORAL SOFTWARE CHANGE PREDICTION USING NEURAL NETWORKS [J].
Amoui, Mehdi ;
Salehie, Mazeiar ;
Tahvildari, Ladan .
INTERNATIONAL JOURNAL OF SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING, 2009, 19 (07) :995-1014
[2]  
[Anonymous], PROC INT CONF SOFTW
[3]   Assessing the applicability of fault-proneness models across object-oriented software projects [J].
Briand, LC ;
Melo, WL ;
Wüst, J .
IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2002, 28 (07) :706-720
[4]   A METRICS SUITE FOR OBJECT-ORIENTED DESIGN [J].
CHIDAMBER, SR ;
KEMERER, CF .
IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 1994, 20 (06) :476-493
[5]  
Demsar J, 2006, J MACH LEARN RES, V7, P1
[6]   Three empirical studies on predicting software maintainability using ensemble methods [J].
Elish, Mahmoud O. ;
Aljamaan, Hamoud ;
Ahmad, Irfan .
SOFT COMPUTING, 2015, 19 (09) :2511-2524
[7]   A suite of metrics for quantifying historical changes to predict future change-prone classes in object-oriented software [J].
Elish, Mahmoud O. ;
Al-Khiaty, Mojeeb Al-Rahman .
JOURNAL OF SOFTWARE-EVOLUTION AND PROCESS, 2013, 25 (05) :407-437
[8]   Application of TreeNet in Predicting Object-Oriented Software Maintainability: A Comparative Study [J].
Elish, Mahmoud O. ;
Elish, Karim O. .
13TH EUROPEAN CONFERENCE ON SOFTWARE MAINTENANCE AND REENGINEERING: CSMR 2009, PROCEEDINGS, 2009, :69-77
[9]  
Eski S., 2011, 2011 IEEE Fourth International Conference on Software Testing, Verification and Validation Workshops (ICSTW), P566, DOI 10.1109/ICSTW.2011.43
[10]   Automated classification of software change messages by semi-supervised Latent Dirichlet Allocation [J].
Fu, Ying ;
Yan, Meng ;
Zhang, Xiaohong ;
Xu, Ling ;
Yang, Dan ;
Kymer, Jeffrey D. .
INFORMATION AND SOFTWARE TECHNOLOGY, 2015, 57 :369-377