When and How Using Structural Information to Improve IR-based Traceability Recovery

被引:50
作者
Panichella, Annibale [1 ]
McMillan, Collin [2 ]
Moritz, Evan
Palmieri, Davide
Oliveto, Rocco [3 ]
Poshyvanyk, Denys [4 ]
De Lucia, Andrea [1 ]
机构
[1] Univ Salerno, Fisciano, SA, Italy
[2] Notre Dame Univ, Notre Dame, IN USA
[3] Univ Molise, Campobasso, Italy
[4] Coll William & Mary, Williamsburg, VA USA
来源
PROCEEDINGS OF THE 17TH EUROPEAN CONFERENCE ON SOFTWARE MAINTENANCE AND REENGINEERING (CSMR 2013) | 2013年
关键词
Traceability Link Recovery; Empirical studies; IN-SOURCE CODE; LINKS;
D O I
10.1109/CSMR.2013.29
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Information Retrieval (IR) has been widely accepted as a method for automated traceability recovery based on the textual similarity among the software artifacts. However, a notorious difficulty for IR-based methods is that artifacts may be related even if they are not textually similar. A growing body of work addresses this challenge by combining IR-based methods with structural information from source code. Unfortunately, the accuracy of such methods is highly dependent on the IR methods. If the IR methods perform poorly, the combined approaches may perform even worse. In this paper, we propose to use the feedback provided by the software engineer when classifying candidate links to regulate the effect of using structural information. Specifically, our approach only considers structural information when the traceability links from the IR methods are verified by the software engineer and classified as correct links. An empirical evaluation conducted on three systems suggests that our approach outperforms both a pure IR-based method and a simple approach for combining textual and structural information.
引用
收藏
页码:199 / 208
页数:10
相关论文
共 40 条
[1]   A traceability technique for specifications [J].
Abadi, Aharcin ;
Nisenson, Mordechai ;
Simionovici, Yahalomit .
PROCEEDINGS OF THE 16TH IEEE INTERNATIONAL CONFERENCE ON PROGRAM COMPREHENSION, 2008, :103-112
[2]  
[Anonymous], P 32 ACM IEEE INT C
[3]  
Anquetil N., 1998, Conference of the Centre for Advanced Studies on Collaborative Research, P4
[4]   Recovering traceability links between code and documentation [J].
Antoniol, G ;
Canfora, G ;
Casazza, G ;
De Lucia, A ;
Merlo, E .
IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2002, 28 (10) :970-983
[5]  
Baeza-Yates R, 1999, MODERN INFORM RETRIE, V463
[6]  
Bajracharya Sushil K., 2010, P 18 ACM SIGSOFT INT, DOI DOI 10.1145/1882291.1882316
[7]  
Basili VR., 1994, The Goal Question Metric Approach
[8]   An empirical study of the relationship between the concepts expressed in source code and dependence [J].
Binkley, David ;
Gold, Nicolas ;
Harman, Mark ;
Li, Zheng ;
Mahdavi, Kiarash .
JOURNAL OF SYSTEMS AND SOFTWARE, 2008, 81 (12) :2287-2298
[9]   On the Role of the Nouns in IR-based Traceability Recovery [J].
Capobianco, Giovanni ;
De Lucia, Andrea ;
Oliveto, Rocco ;
Panichella, Annibale ;
Panichella, Sebastiano .
ICPC: 2009 IEEE 17TH INTERNATIONAL CONFERENCE ON PROGRAM COMPREHENSION, 2009, :148-+
[10]  
Cleland-Huang J., 2011, Proceedings of the 6th International Workshop on Traceability in Emerging Forms of Software Engineering, P17, DOI DOI 10.1145/1987856.1987861