Semantically Enhanced Software Traceability Using Deep Learning Techniques

被引:186
作者
Guo, Jin [1 ]
Cheng, Jinghui [1 ]
Cleland-Huang, Jane [1 ]
机构
[1] Univ Notre Dame, Notre Dame, IN 46556 USA
来源
2017 IEEE/ACM 39TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING (ICSE) | 2017年
基金
美国国家科学基金会;
关键词
Traceability; Deep Learning; Recurrent Neural Network; Semantic Representation; REQUIREMENTS; CODE;
D O I
10.1109/ICSE.2017.9
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
In most safety-critical domains the need for traceability is prescribed by certifying bodies. Trace links are generally created among requirements, design, source code, test cases and other artifacts; however, creating such links manually is time consuming and error prone. Automated solutions use information retrieval and machine learning techniques to generate trace links; however, current techniques fail to understand semantics of the software artifacts or to integrate domain knowledge into the tracing process and therefore tend to deliver imprecise and inaccurate results. In this paper, we present a solution that uses deep learning to incorporate requirements artifact semantics and domain knowledge into the tracing solution. We propose a tracing network architecture that utilizes Word Embedding and Recurrent Neural Network (RNN) models to generate trace links. Word embedding learns word vectors that represent knowledge of the domain corpus and RNN uses these word vectors to learn the sentence semantics of requirements artifacts. We trained 360 different configurations of the tracing network using existing trace links in the Positive Train Control domain and identified the Bidirectional Gated Recurrent Unit (BI-GRU) as the best model for the tracing task. BI-GRU significantly out-performed state-of-the-art tracing methods including the Vector Space Model and Latent Semantic Indexing.
引用
收藏
页码:3 / 14
页数:12
相关论文
共 71 条
[1]   Combining Deep Learning with Information Retrieval to Localize Buggy Files for Bug Reports [J].
An Ngoc Lam ;
Anh Tuan Nguyen ;
Hoan Anh Nguyen ;
Nguyen, Tien N. .
2015 30TH IEEE/ACM INTERNATIONAL CONFERENCE ON AUTOMATED SOFTWARE ENGINEERING (ASE), 2015, :476-481
[2]  
[Anonymous], 2016, Deep learning
[3]  
[Anonymous], 2013, P 2 INT C LEARN REPR
[4]  
[Anonymous], 2015, Reasoning about entailment with neural attention
[5]  
[Anonymous], 1998, Introduction to expert systems
[6]  
[Anonymous], 2012, Software and Systems Traceability, DOI 10.1007/978-1-4471-2239-5_1
[7]   Recovering traceability links between code and documentation [J].
Antoniol, G ;
Canfora, G ;
Casazza, G ;
De Lucia, A ;
Merlo, E .
IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2002, 28 (10) :970-983
[8]  
Arora Chetan, 2013, 2013 ACM / IEEE International Symposium on Empirical Software Engineering and Measurement (ESEM), P35, DOI 10.1109/ESEM.2013.13
[9]  
Arora C, 2015, INT REQUIR ENG CONF, P6, DOI 10.1109/RE.2015.7320403
[10]   Ontology-based multiperspective requirements traceability framework [J].
Assawamekin, Namfon ;
Sunetnanta, Thanwadee ;
Pluempitiwiriyawej, Charnyote .
KNOWLEDGE AND INFORMATION SYSTEMS, 2010, 25 (03) :493-522