Enhancing Unsupervised Requirements Traceability with Sequential Semantics

被引:11
作者
Chen, Lei [1 ]
Wang, Dandan [1 ]
Wang, Junjie [1 ]
Wang, Qing [1 ]
机构
[1] Chinese Acad Sci, Lab Internet Software Technol, Inst Software, Beijing, Peoples R China
来源
2019 26TH ASIA-PACIFIC SOFTWARE ENGINEERING CONFERENCE (APSEC) | 2019年
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
requirements traceability; sequential patterns; sequential semantics; document embedding representation; LINKS;
D O I
10.1109/APSEC48747.2019.00013
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Requirements traceability provides important support throughout all software life cycle; however, creating such links manually is time-consuming and error-prone. Supervised automated solutions use machine learning or deep learning techniques to generate trace links, but require large labeled dataset to train an effective model. Unsupervised solutions as word embedding approaches can generate links by capturing the semantic meaning of artifacts and are gaining more attention. Despite that, our observation revealed that, besides the semantic information, the sequential information of terms in the artifacts would provide additional assistance for building the accurate links. This paper proposes an unsupervised requirements traceability approach (named S2Trace) which learns the Sequential Semantics of software artifacts to generate the trace links. Its core idea is to mine the sequential patterns and use them to learn the document embedding representation. Evaluation is conducted on five public datasets, and results show that our approach outperforms three typical baselines. The modeling of sequential information in this paper provides new insights into the unsupervised traceability solutions, and the improvement in the traceability accuracy further proves the usefulness of the sequential information.
引用
收藏
页码:23 / 30
页数:8
相关论文
共 34 条
[1]  
[Anonymous], 2010, 2010 ACM IEEE 32 INT
[2]  
Arora S., 2017, P C LEARN REPR ICLR
[3]   Ontology-based multiperspective requirements traceability framework [J].
Assawamekin, Namfon ;
Sunetnanta, Thanwadee ;
Pluempitiwiriyawej, Charnyote .
KNOWLEDGE AND INFORMATION SYSTEMS, 2010, 25 (03) :493-522
[4]   Improving IR-based traceability recovery via noun-based indexing of software artifacts [J].
Capobianco, Giovanni ;
De Lucia, Andrea ;
Oliveto, Rocco ;
Panichella, Annibale ;
Panichella, Sebastiano .
JOURNAL OF SOFTWARE-EVOLUTION AND PROCESS, 2013, 25 (07) :743-762
[5]  
Dietrich T, 2013, IEEE INT CONF AUTOM, P586, DOI 10.1109/ASE.2013.6693117
[6]   Estimating the number of remaining links in traceability recovery [J].
Falessi, Davide ;
Di Penta, Massimiliano ;
Canfora, Gerardo ;
Cantone, Giovanni .
EMPIRICAL SOFTWARE ENGINEERING, 2017, 22 (03) :996-1027
[7]   A Subsequence Interleaving Model for Sequential Pattern Mining [J].
Fowkes, Jaroslav ;
Sutton, Charles .
KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, :835-844
[8]  
Gibiec M., 2010, Proceedings of the IEEE/ACM International Conference on Automated Software Engineering, ASE '10, P245
[9]  
Gotel O. C. Z., 1994, Proceedings of the First International Conference on Requirements Engineering (Cat. No.94TH0613-0), P94, DOI 10.1109/ICRE.1994.292398
[10]   node2vec: Scalable Feature Learning for Networks [J].
Grover, Aditya ;
Leskovec, Jure .
KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, :855-864