What's a typical commit? A characterization of open source software repositories

被引:90
作者
Alali, Abdulkareem [1 ]
Kagdi, Huzefa [1 ]
Maletic, Jonathan I. [1 ]
机构
[1] Kent State Univ, Dept Comp Sci, Kent, OH 44242 USA
来源
PROCEEDINGS OF THE 16TH IEEE INTERNATIONAL CONFERENCE ON PROGRAM COMPREHENSION | 2008年
关键词
D O I
10.1109/ICPC.2008.24
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
The research examines the version histories of nine open source software systems to uncover trends and characteristics of how developers commit source code to version control systems (e.g., subversion). The goal is to characterize what a typical or normal commit looks like with respect to the number of files, number of lines, and number of hunks committed together. The results of these three characteristics are presented and the commits are categorized from extra small to extra large. The findings show that approximately 75% of commits are quite small for the systems examined along all three characteristics. Additionally, the commit messages are examined along with the characteristics. The most common words are extracted from the commit messages and correlated with the size categories of the commits. It is observed that sized categories can be indicative of the types of maintenance activities being performed.
引用
收藏
页码:182 / 191
页数:10
相关论文
共 13 条
[1]  
AGRAWAL R, 1995, P 11 INT C DAT ENG T
[2]  
[Anonymous], 1999, APPL MULTIVARIATE AN
[3]  
[Anonymous], 2006, Proceedings of the 3rd international workshop on Mining software repositories (MSR), Shanghai, China
[4]  
Anvik J, 2006, P 28 INT C SOFTWARE, P361, DOI DOI 10.1145/1134285.1134336
[5]  
CANFORA G, 2006, P INT WORKSH MIN SOF, P105, DOI DOI 10.1145/1137983.1138009
[6]   CVSSearch: Searching through source code using CVS comments [J].
Chen, A ;
Chou, E ;
Wong, J ;
Yao, AY ;
Zhang, Q ;
Zhang, S ;
Michail, AR .
IEEE INTERNATIONAL CONFERENCE ON SOFTWARE MAINTENANCE, PROCEEDINGS: SYSTEMS AND SOFTWARE EVOLUTION IN THE ERA OF THE INTERNET, 2001, :364-373
[7]   The FreeBSD project: A replication case study of open source development [J].
Dinh-Trong, TT ;
Bieman, JM .
IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2005, 31 (06) :481-494
[8]  
GERMAN DM, 2004, P 1 INT WORKSH MIN S, P17
[9]   A survey and taxonomy of approaches for mining software repositories in the context of software evolution [J].
Kagdi, Huzefa ;
Collard, Michael L. ;
Maletic, Jonathan I. .
JOURNAL OF SOFTWARE MAINTENANCE AND EVOLUTION-RESEARCH AND PRACTICE, 2007, 19 (02) :77-131
[10]  
LOVINS JB, 1968, MECH TRANSL, V11, P22