Software Requirement Traceability Analysis Using Text Mining Methods

被引:0
作者
Hatipoglu, Poyraz Umut [1 ]
Atvar, Anil [1 ]
Artan, Yusuf Oguzhan [1 ]
Sereflisan, Oguzhan [1 ]
Demir, Ali [1 ]
机构
[1] HAVELSAN AS, Siber Guvenlik & Bilisim Teknol Dept, Mustafa Kemal Mahallesi 2120 Cad 39 Cankaya, Ankara, Turkey
来源
2017 25TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU) | 2017年
关键词
LSI; LDA; word2vec; tf-idf; Requirement Traceability Matrix (RTM);
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this study, text mining based methods are proposed for requirement traceability analysis which is one of the most essential steps in the software life cycle. It is aimed to automate the requirements traceability process of the software architecture, which is conducted by a data analyst manually, with the proposed methods. For this purpose, besides the tf-idf and Latent Semantic Analysis (LSI/LSA) based approaches which are commonly used in the literature, requirement and design matching activities are realized by using Latent Dirichlet Allocation (LDA) title modelling technique and word2vec models. While the tf-idf based LSI approach achieve the highest classification accuracy, the LDA based approach produces relatively lower classification accuracy than LSI models. The word2vec + tf-idf method which has better classification accuracy than both of the word2vec + BOW and BOW alone models is the method producing the third highest performance.
引用
收藏
页数:4
相关论文
共 12 条
  • [1] [Anonymous], 2007, THESIS U TECHNOLOGY
  • [2] Latent Dirichlet allocation
    Blei, DM
    Ng, AY
    Jordan, MI
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (4-5) : 993 - 1022
  • [3] DEERWESTER S, 1990, J AM SOC INFORM SCI, V41, P391, DOI 10.1002/(SICI)1097-4571(199009)41:6<391::AID-ASI1>3.0.CO
  • [4] 2-9
  • [5] Fix E., 1951, JOSEPH
  • [6] Advancing candidate link generation for requirements tracing: The study of methods
    Hayes, JH
    Dekhtyar, A
    Sundaram, SK
    [J]. IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2006, 32 (01) : 4 - 19
  • [7] Introduction to Information Retrieval
    Larson, Ray R.
    [J]. JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2010, 61 (04): : 852 - 853
  • [8] Recovering documentation-to-source-code traceability links using latent semantic indexing
    Marcus, A
    Maletic, JI
    [J]. 25TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, PROCEEDINGS, 2003, : 125 - 135
  • [9] Mikolov T., 2013, ADV NEURAL INFORM PR, P3111
  • [10] Mikolov T., 2013, P INT C LEARN REPR I, V2013, P3781, DOI [10.48550/ARXIV.1301.3781, DOI 10.48550/ARXIV.1301.3781]