Exploiting Parts-of-Speech for effective automated requirements traceability

被引:23
作者
Ali, Nasir [1 ]
Cai, Haipeng [2 ]
Hamou-Lhadj, Abdelwahab [3 ]
Hassine, Jameleddine [4 ]
机构
[1] Univ Memphis, Dept Comp Sci, Memphis, TN 38152 USA
[2] Washington State Univ, Sch Elect Engn & Comp Sci, Pullman, WA 99164 USA
[3] Concordia Univ, Elect & Comp Engn Dept, Montreal, PQ, Canada
[4] King Fand Univ Petr & Minerals, Dept Informat & Comp Sci, Dhahran, Saudi Arabia
关键词
Requirements traceability (RT); Parts of Speech (POS); Information retrieval (IR); Trace links; DESIGN-CODE TRACEABILITY; DOCUMENTATION; LINKS;
D O I
10.1016/j.infsof.2018.09.009
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Context: Requirement traceability (RT) is defined as the ability to describe and follow the life of a requirement. RT helps developers ensure that relevant requirements are implemented and that the source code is consistent with its requirement with respect to a set of traceability links called trace links. Previous work leverages Parts Of Speech (POS) tagging of software artifacts to recover trace links among them. These studies work on the premise that discarding one or more POS tags results in an improved accuracy of Information Retrieval (IR) techniques. Objective: First, we show empirically that excluding one or more POS tags could negatively impact the accuracy of existing IR-based traceability approaches, namely the Vector Space Model (VSM) and the Jensen Shannon Model (JSM). Second, we propose a method that improves the accuracy of IR-based traceability approaches. Method: We developed an approach, called ConPOS, to recover trace links using constraint-based pruning. ConPOS uses major POS categories and applies constraints to the recovered trace links for pruning as a filtering process to significantly improve the effectiveness of IR-based techniques. We conducted an experiment to provide evidence that removing POSs does not improve the accuracy of IR techniques. Furthermore, we conducted two empirical studies to evaluate the effectiveness of ConPOS in recovering trace links compared to existing peer RT approaches. Results: The results of the first empirical study show that removing one or more POS negatively impacts the accuracy of VSM and JSM. Furthermore, the results from the other empirical studies show that ConPOS provides 11%-107%, 8%-64%, and 15%-170% higher precision, recall, and mean average precision (MAP) than VSM and JSM. Conclusion: We showed that ConPos outperforms existing IR-based RT approaches that discard some POS tags from the input documents.
引用
收藏
页码:126 / 141
页数:16
相关论文
共 61 条
[11]  
[Anonymous], 2010, BASICS SOFTWARE ENG
[12]  
[Anonymous], P 2003 C N AM CHAPT
[13]  
[Anonymous], 2007, PROCC 16 ACM C C INF
[14]   Recovering traceability links between code and documentation [J].
Antoniol, G ;
Canfora, G ;
Casazza, G ;
De Lucia, A ;
Merlo, E .
IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2002, 28 (10) :970-983
[15]   Design-code traceability recovery: selecting the basic linkage properties [J].
Antoniol, G ;
Caprile, B ;
Potrich, A ;
Tonella, P .
SCIENCE OF COMPUTER PROGRAMMING, 2001, 40 (2-3) :213-234
[16]   Design-code traceability for object-oriented systems [J].
Antoniol, G ;
Caprile, B ;
Potrich, A ;
Tonella, P .
ANNALS OF SOFTWARE ENGINEERING, 2000, 9 (1-4) :35-58
[17]  
Borg Markus, 2013, 2013 ACM / IEEE International Symposium on Empirical Software Engineering and Measurement (ESEM), P243, DOI 10.1109/ESEM.2013.39
[18]   Recovering from a decade: a systematic mapping of information retrieval approaches to software traceability [J].
Borg, Markus ;
Runeson, Per ;
Ardo, Anders .
EMPIRICAL SOFTWARE ENGINEERING, 2014, 19 (06) :1565-1616
[19]   Improving IR-based traceability recovery via noun-based indexing of software artifacts [J].
Capobianco, Giovanni ;
De Lucia, Andrea ;
Oliveto, Rocco ;
Panichella, Annibale ;
Panichella, Sebastiano .
JOURNAL OF SOFTWARE-EVOLUTION AND PROCESS, 2013, 25 (07) :743-762
[20]   Traceability Recovery using Numerical Analysis [J].
Capobianco, Giovanni ;
De Lucia, Andrea ;
Oliveto, Rocco ;
Panichella, Annibale ;
Panichella, Sebastiano .
16TH WORKING CONFERENCE ON REVERSE ENGINEERING (WCRE 2009), 2009, :195-+