Towards Automatic Generation of Short Summaries of Commits

被引:29
作者
Jiang, Siyuan [1 ]
McMillan, Collin [1 ]
机构
[1] Univ Notre Dame, Dept Comp Sci & Engn, Notre Dame, IN 46556 USA
来源
2017 IEEE/ACM 25TH INTERNATIONAL CONFERENCE ON PROGRAM COMPREHENSION (ICPC) | 2017年
关键词
D O I
10.1109/ICPC.2017.12
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Committing to a version control system means submitting a software change to the system. Each commit can have a message to describe the submission. Several approaches have been proposed to automatically generate the content of such messages. However, the quality of the automatically generated messages falls far short of what humans write. In studying the differences between auto-generated and human-written messages, we found that 82% of the human-written messages have only one sentence, while the automatically generated messages often have multiple lines. Furthermore, we found that the commit messages often begin with a verb followed by an direct object. This finding inspired us to use a "verb+object" format in this paper to generate short commit summaries. We split the approach into two parts: verb generation and object generation. As our first try, we trained a classifier to classify a diff to a verb. We are seeking feedback from the community before we continue to work on generating direct objects for the commits.
引用
收藏
页码:320 / 323
页数:4
相关论文
共 13 条
[1]   What's a typical commit? A characterization of open source software repositories [J].
Alali, Abdulkareem ;
Kagdi, Huzefa ;
Maletic, Jonathan I. .
PROCEEDINGS OF THE 16TH IEEE INTERNATIONAL CONFERENCE ON PROGRAM COMPREHENSION, 2008, :182-191
[2]  
Buse R. P. L., 2010, P IEEE ACM INT C AUT, P33
[3]   SMOTE: Synthetic minority over-sampling technique [J].
Chawla, Nitesh V. ;
Bowyer, Kevin W. ;
Hall, Lawrence O. ;
Kegelmeyer, W. Philip .
2002, American Association for Artificial Intelligence (16)
[4]   Change distilling:: Tree differencing for fine-grained source code change extraction [J].
Fluri, Beat ;
Wuersch, Michael ;
Pinzger, Martin ;
Gall, Harald C. .
IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2007, 33 (11) :725-743
[5]  
Hattori L. P., 2008 23 ASE WORKSH, P63
[6]   Automatic Classification of Large Changes into Maintenance Categories [J].
Hindle, Abram ;
German, Daniel M. ;
Godfrey, Michael W. ;
Holt, Richard C. .
ICPC: 2009 IEEE 17TH INTERNATIONAL CONFERENCE ON PROGRAM COMPREHENSION, 2009, :30-+
[7]   RCLinker: Automated Linking of Issue Reports and Commits Leveraging Rich Contextual Information [J].
Le, Tien-Duy B. ;
Linares-Vasquez, Mario ;
Lo, David ;
Poshyvanyk, Denys .
2015 IEEE 23RD INTERNATIONAL CONFERENCE ON PROGRAM COMPREHENSION ICPC 2015, 2015, :36-47
[8]   ChangeScribe: A Tool for Automatically Generating Commit Messages [J].
Linares-Vasquez, Mario ;
Cortes-Coy, Luis Fernando ;
Aponte, Jairo ;
Poshyvanyk, Denys .
2015 IEEE/ACM 37TH IEEE INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, VOL 2, 2015, :709-712
[9]   The Stanford CoreNLP Natural Language Processing Toolkit [J].
Manning, Christopher D. ;
Surdeanu, Mihai ;
Bauer, John ;
Finkel, Jenny ;
Bethard, Steven J. ;
McClosky, David .
PROCEEDINGS OF 52ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: SYSTEM DEMONSTRATIONS, 2014, :55-60
[10]  
Mauczka A., MSR 15, P490