Trustrace: Mining Software Repositories to Improve the Accuracy of Requirement Traceability Links

被引:68
作者
Ali, Nasir [1 ]
Gueheneuc, Yann-Gael [1 ]
Antoniol, Giuliano [1 ]
机构
[1] Ecole Polytech, Dept Genie Informat & Genie Logiciel, Montreal, PQ H3C 3A7, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Traceability; requirements; feature; source code; repositories; experts; trust-based model; CODE;
D O I
10.1109/TSE.2012.71
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Traceability is the only means to ensure that the source code of a system is consistent with its requirements and that all and only the specified requirements have been implemented by developers. During software maintenance and evolution, requirement traceability links become obsolete because developers do not/cannot devote effort to updating them. Yet, recovering these traceability links later is a daunting and costly task for developers. Consequently, the literature has proposed methods, techniques, and tools to recover these traceability links semi-automatically or automatically. Among the proposed techniques, the literature showed that information retrieval (IR) techniques can automatically recover traceability links between free-text requirements and source code. However, IR techniques lack accuracy (precision and recall). In this paper, we show that mining software repositories and combining mined results with IR techniques can improve the accuracy (precision and recall) of IR techniques and we propose Trustrace, a trust-based traceability recovery approach. We apply Trustrace on four medium-size open-source systems to compare the accuracy of its traceability links with those recovered using state-of-the-art IR techniques from the literature, based on the Vector Space Model and Jensen-Shannon model. The results of Trustrace are up to 22.7 percent more precise and have 7.66 percent better recall values than those of the other techniques, on average. We thus show that mining software repositories and combining the mined data with existing results from IR techniques improves the precision and recall of requirement traceability links.
引用
收藏
页码:725 / 741
页数:17
相关论文
共 40 条
[1]   A traceability technique for specifications [J].
Abadi, Aharcin ;
Nisenson, Mordechai ;
Simionovici, Yahalomit .
PROCEEDINGS OF THE 16TH IEEE INTERNATIONAL CONFERENCE ON PROGRAM COMPREHENSION, 2008, :103-112
[2]  
Ali N., 2011, 2011 18th Working Conference on Reverse Engineering, P45, DOI 10.1109/WCRE.2011.16
[3]  
Ali N., 2011, FACTORS IMPACTING IN
[4]   Trust-based Requirements Traceability [J].
Ali, Nasir ;
Gueheneuc, Yann-Gael ;
Antoniol, Giuliano .
2011 IEEE 19TH INTERNATIONAL CONFERENCE ON PROGRAM COMPREHENSION (ICPC), 2011, :111-120
[5]  
[Anonymous], 2010, P FAST SOFTW ENCR WO
[6]  
[Anonymous], 1992, Information retrieval: Data structures and algorithms
[7]  
[Anonymous], J SOFTWARE MAINTENAN
[8]   Recovering traceability links between code and documentation [J].
Antoniol, G ;
Canfora, G ;
Casazza, G ;
De Lucia, A ;
Merlo, E .
IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2002, 28 (10) :970-983
[9]   A survey of trust in computer science and the Semantic Web [J].
Artz, Donovan ;
Gil, Yolanda .
JOURNAL OF WEB SEMANTICS, 2007, 5 (02) :58-71
[10]  
Asuncion H.U., 2010, P 32 INT C SOFTW ENG, P95