Recovering from a decade: a systematic mapping of information retrieval approaches to software traceability

被引:0
作者
Markus Borg
Per Runeson
Anders Ardö
机构
[1] Lund University,Department of Computer Science
[2] Lund University,Department of Electrical and Information Technology
来源
Empirical Software Engineering | 2014年 / 19卷
关键词
Traceability; Information retrieval; Software artifacts; Systematic mapping study;
D O I
暂无
中图分类号
学科分类号
摘要
Engineers in large-scale software development have to manage large amounts of information, spread across many artifacts. Several researchers have proposed expressing retrieval of trace links among artifacts, i.e. trace recovery, as an Information Retrieval (IR) problem. The objective of this study is to produce a map of work on IR-based trace recovery, with a particular focus on previous evaluations and strength of evidence. We conducted a systematic mapping of IR-based trace recovery. Of the 79 publications classified, a majority applied algebraic IR models. While a set of studies on students indicate that IR-based trace recovery tools support certain work tasks, most previous studies do not go beyond reporting precision and recall of candidate trace links from evaluations using datasets containing less than 500 artifacts. Our review identified a need of industrial case studies. Furthermore, we conclude that the overall quality of reporting should be improved regarding both context and tool details, measures reported, and use of IR terminology. Finally, based on our empirical findings, we present suggestions on how to advance research on IR-based trace recovery.
引用
收藏
页码:1565 / 1616
页数:51
相关论文
共 128 条
[1]  
Assawamekin N(2010)Ontology-based multiperspective requirements traceability framework Knowl Inf Syst 25 493-522
[2]  
Sunetnanta T(2007)A correlated topic model of science Ann Appl Stat 1 17-35
[3]  
Pluempitiwiriyawej C(2003)Latent dirichlet allocation J Mach Learn Res 3 993-1022
[4]  
Blei D(2007)Lessons from applying the systematic literature review process within the software engineering domain J Syst Software 80 571-583
[5]  
Lafferty J(2010)Hierarchical relational models for document networks Ann Appl Stat 4 124-150
[6]  
Blei D(2003)Event-based traceability for managing evolutionary change Trans Software Eng 29 796-810
[7]  
Ng A(2007)Best practices for automated traceability Computer 40 27-35
[8]  
Jordan M(2008)Goal-Centric traceability: Using virtual plumblines to maintain critical systemic qualities Trans Software Eng 34 685-699
[9]  
Brereton P(2009)Assessing IR-based traceability recovery tools through controlled experiments Empir Software Eng 14 57-92
[10]  
Kitchenham B(1990)Indexing by latent semantic analysis J Am Soc Inf Sci 41 391-407