RCLinker: Automated Linking of Issue Reports and Commits Leveraging Rich Contextual Information

被引:52
作者
Le, Tien-Duy B. [1 ]
Linares-Vasquez, Mario [2 ]
Lo, David [1 ]
Poshyvanyk, Denys [2 ]
机构
[1] Singapore Management Univ, Sch Informat Syst, Singapore 178902, Singapore
[2] Coll William & Mary, Dept Comp Sci, Williamsburg, VA 23187 USA
来源
2015 IEEE 23RD INTERNATIONAL CONFERENCE ON PROGRAM COMPREHENSION ICPC 2015 | 2015年
关键词
DUPLICATE; SEVERITY;
D O I
10.1109/ICPC.2015.13
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Links between issue reports and their corresponding commits in version control systems are often missing. However, these links are important for measuring the quality of a software system, predicting defects, and many other tasks. Several approaches have been designed to solve this problem by automatically linking bug reports to source code commits via comparison of textual information in commit messages and bug reports. Yet, the effectiveness of these techniques is oftentimes suboptimal when commit messages are empty or contain minimum information; this particular problem makes the process of recovering traceability links between commits and bug reports particularly challenging. In this work, we aim at improving the effectiveness of existing bug linking techniques by utilizing rich contextual information. We rely on a recently proposed approach, namely ChangeScribe, which generates commit messages containing rich contextual information by using code summarization techniques. Our approach then extracts features from these automatically generated commit messages and bug reports, and inputs them into a classification technique that creates a discriminative model used to predict if a link exists between a commit message and a bug report. We compared our approach, coined as RCLinker (Rich Context Linker), to MLink, which is an existing state-of-the-art bug linking approach. Our experiment results on bug reports from six software projects show that RCLinker outperforms MLink in terms of F-measure by 138.66%.
引用
收藏
页码:36 / 47
页数:12
相关论文
共 55 条
[1]  
Nguyen AT, 2012, IEEE INT CONF AUTOM, P70, DOI 10.1145/2351676.2351687
[2]  
[Anonymous], PREDICTING VULNERABL
[3]  
[Anonymous], 2010, P FAST SOFTW ENCR WO
[4]  
[Anonymous], P 37 INT C SOFTW ENG
[5]  
[Anonymous], 37 IEEE ACM INT C SO
[6]  
[Anonymous], 2008, INTRO INFORM RETRIEV, DOI DOI 10.1017/CBO9780511809071
[7]  
[Anonymous], 2011, ASE
[8]  
[Anonymous], 2011, Pei. data mining concepts and techniques, DOI 10.1016/C2009-0-61819-5
[9]  
[Anonymous], MSR
[10]  
[Anonymous], 2008, Proceedings of the 4th international workshop on Predictor models in software engineering